no code implementations • 20 Dec 2023 • Chengxiang Yin, Zhengping Che, Kun Wu, Zhiyuan Xu, Jian Tang
Visual Question Answering (VQA) has emerged as one of the most challenging tasks in artificial intelligence due to its multi-modal nature.
no code implementations • 20 Dec 2023 • Chengxiang Yin, Zhengping Che, Kun Wu, Zhiyuan Xu, Qinru Qiu, Jian Tang
Video Question Answering (VideoQA) is a very attractive and challenging research direction aiming to understand complex semantics of heterogeneous data from two domains, i. e., the spatio-temporal video content and the word sequence in question.
no code implementations • 10 Jul 2022 • Kun Wu, Chengxiang Yin, Jian Tang, Zhiyuan Xu, Yanzhi Wang, Dejun Yang
In this paper, we define a new problem called continual few-shot learning, in which tasks arrive sequentially and each task is associated with a few training samples.
no code implementations • 23 Jul 2021 • Kun Wu, Chengxiang Yin, Zhengping Che, Bo Jiang, Jian Tang, Zheng Guan, Gangyi Ding
Deep generative models have made great progress in synthesizing images with arbitrary human poses and transferring poses of one person to others.
no code implementations • ICCV 2021 • Chengxiang Yin, Kun Wu, Zhengping Che, Bo Jiang, Zhiyuan Xu, Jian Tang
Deep learning has made tremendous success in computer vision, natural language processing and even visual-semantic learning, which requires a huge amount of labeled training data.
no code implementations • 8 Jun 2018 • Chengxiang Yin, Jian Tang, Zhiyuan Xu, Yanzhi Wang
Meta-learning enables a model to learn from very limited data to undertake a new task.