no code implementations • 13 Nov 2024 • Qiang Zhou, Shaofeng Zhang, Nianzu Yang, Ye Qian, Hao Li
Furthermore, MVideo supports motion condition editing and composition, facilitating the generation of videos with more complex actions.
1 code implementation • 21 May 2024 • Zhiyu Tan, Mengping Yang, Luozheng Qin, Hao Yang, Ye Qian, Qiang Zhou, Cheng Zhang, Hao Li
Moreover, the model capacity of the text encoder from CLIP is relatively limited compared to Large Language Models (LLMs), which offer multilingual input, accommodate longer context, and achieve superior text representation.
no code implementations • ICCV 2021 • Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang
Pixel-to-mesh has wide applications, especially in virtual or augmented reality, animation and game industry.