Search Results for author: Chuanqi Zang

Found 3 papers, 0 papers with code

Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller

no code implementations • 12 Mar 2024 • Chuanqi Zang, Jiji Tang, Rongsheng Zhang, Zeng Zhao, Tangjie Lv, Mingtao Pei, Wei Liang

Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream.

Story Generation

Paper
Add Code

Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering

no code implementations • CVPR 2023 • Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang

For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences.

Question Answering Video Question Answering

Paper
Add Code

CDNet: A cascaded decoupling architecture for video prediction

no code implementations • 29 Sep 2021 • Chuanqi Zang, Mingtao Pei

Video prediction is an essential task in the computer vision community, helping to solve many downstream vision tasks by predicting and modeling future motion dynamics and appearance.

Video Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.