no code implementations • 2 Jul 2024 • Huanzhang Dou, Ruixiang Li, Wei Su, Xi Li
In text-to-video (T2V) generation, significant attention has been directed toward its development, yet unifying discrete and continuous grounding conditions in T2V generation remains under-explored.
no code implementations • 25 Jul 2023 • Yiming Wu, Ruixiang Li, Zequn Qin, Xinhai Zhao, Xi Li
In this work, we propose to explicitly model heights in the BEV space, which needs no extra data like LiDAR and can fit arbitrary camera rigs and types compared to modeling depths.