On February 10, the video generation experimental model "VideoWorld" was jointly proposed by the Doubao Large Model team, Beijing Jiaotong University and the University of Science and Technology of China. Unlike mainstream multimodal models such as Sora, DALL-E, and Midjourney, VideoWorld is the first in the industry to realize the world without relying on language models. At present, the project code and model are open source
Share this post
DouBao: The video generation model "VideoWorld" can recognize the world by sight alone, now open source
Share this post
On February 10, the video generation experimental model "VideoWorld" was jointly proposed by the Doubao Large Model team, Beijing Jiaotong University and the University of Science and Technology of China. Unlike mainstream multimodal models such as Sora, DALL-E, and Midjourney, VideoWorld is the first in the industry to realize the world without relying on language models. At present, the project code and model are open source
.