DouBao: The video generation model "VideoWorld" can recognize the world by sight alone, now open source

Feb 10

On February 10, the video generation experimental model "VideoWorld" was jointly proposed by the Doubao Large Model team, Beijing Jiaotong University and the University of Science and Technology of China. Unlike mainstream multimodal models such as Sora, DALL-E, and Midjourney, VideoWorld is the first in the industry to realize the world without relying on language models. At present, the project code and model are open source

Comments

数据猿

DouBao: The video generation model "VideoWorld" can recognize the world by sight alone, now open source