no code implementations • 12 Mar 2024 • Chuanqi Zang, Jiji Tang, Rongsheng Zhang, Zeng Zhao, Tangjie Lv, Mingtao Pei, Wei Liang
Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream.
no code implementations • CVPR 2023 • Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang
For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences.
no code implementations • 29 Sep 2021 • Chuanqi Zang, Mingtao Pei
Video prediction is an essential task in the computer vision community, helping to solve many downstream vision tasks by predicting and modeling future motion dynamics and appearance.