no code implementations • 12 Mar 2024 • Chuanqi Zang, Jiji Tang, Rongsheng Zhang, Zeng Zhao, Tangjie Lv, Mingtao Pei, Wei Liang
Storytelling aims to generate reasonable and vivid narratives based on an ordered image stream.
no code implementations • CVPR 2023 • Chuanqi Zang, Hanqing Wang, Mingtao Pei, Wei Liang
For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences.
no code implementations • 29 Sep 2021 • Chuanqi Zang, Mingtao Pei
Video prediction is an essential task in the computer vision community, helping to solve many downstream vision tasks by predicting and modeling future motion dynamics and appearance.
no code implementations • 24 Nov 2020 • Qing Gao, Mingtao Pei, Hongyu Shen
In contrast to current lifelogging/egocentric datasets, our dataset is suitable for lifestyle analysis as images are taken with short intervals to capture activities of short duration; moreover, images are taken continuously from morning to evening to record all the activities performed by a user.
no code implementations • 15 Mar 2019 • Yanmei Dong, Mingtao Pei, Lijia Zhang, Bin Xu, Yuwei Wu, Yunde Jia
In this paper, we propose to stitch videos from the FF-camera with a wide-angle lens and the DF-camera with a fisheye lens for telepresence robots.
3 code implementations • 13 Mar 2019 • Huiling Hao, Mingtao Pei
The client identity information is not utilized in previous face liveness detection methods.
no code implementations • 22 Mar 2016 • Zhen Dong, Su Jia, Chi Zhang, Mingtao Pei
To sufficiently discover the useful information contained in face videos, we present a novel network architecture called input aggregated network which is able to learn fixed-length representations for variable-length face videos.