Search Results for author: Pengjun Fang

Found 1 papers, 1 papers with code

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

1 code implementation30 Jul 2024 Xiaowei Chi, Yatian Wang, Aosong Cheng, Pengjun Fang, Zeyue Tian, Yingqing He, Zhaoyang Liu, Xingqun Qi, Jiahao Pan, Rongyu Zhang, Mengfei Li, Ruibin Yuan, Yanbing Jiang, Wei Xue, Wenhan Luo, Qifeng Chen, Shanghang Zhang, Qifeng Liu, Yike Guo

To fulfill this gap, we present MMTrail, a large-scale multi-modality video-language dataset incorporating more than 20M trailer clips with visual captions, and 2M high-quality clips with multimodal captions.

Audio Generation Image to Video Generation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.