1 code implementation • 4 Apr 2024 • Yuting He, Fuxiang Huang, Xinrui Jiang, Yuxiang Nie, Minghao Wang, Jiguang Wang, Hao Chen
To answer these questions, a comprehensive and deep survey of the challenges, opportunities, and future directions of HFMs is presented in this survey.
1 code implementation • 25 Mar 2024 • Han Wang, Yanjie Wang, YongJie Ye, Yuxiang Nie, Can Huang
Multi-modal Large Language Models (MLLMs) have demonstrated their ability to perceive objects in still images, but their application in video-related tasks, such as object tracking, remains understudied.
Ranked #1 on Zero-Shot Single Object Tracking on LaSOT
1 code implementation • 25 Aug 2019 • Yong Hu, He-Yan Huang, Tian Lan, Xiaochi Wei, Yuxiang Nie, Jiarui Qi, Liner Yang, Xian-Ling Mao
Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned.
1 code implementation • COLING 2022 • Yuxiang Nie, Heyan Huang, Zewen Chi, Xian-Ling Mao
Previous works usually make use of heuristic rules as well as pre-trained models to construct data and train QA models.
1 code implementation • 11 Oct 2022 • Yuxiang Nie, Heyan Huang, Wei Wei, Xian-Ling Mao
The proposed model mainly focuses on the evidence selection phase of long document question answering.
1 code implementation • 3 May 2023 • Yuxiang Nie, Heyan Huang, Wei Wei, Xian-Ling Mao
To alleviate the problem, it might be possible to generate long-document QA pairs via unsupervised question answering (UQA) methods.
no code implementations • 25 Jun 2023 • Xiao Zhang, Heqi Zheng, Yuxiang Nie, Heyan Huang, Xian-Ling Mao
However, the dataset has ignored the fact that different readers may have different levels of understanding of the text, and only includes single-perspective question-answer pairs, leading to a lack of consideration of different perspectives.
no code implementations • 26 Mar 2024 • Yuxiang Nie, Heyan Huang, Xian-Ling Mao, Lizi Liao
Specifically, IDPT decouples initiative factors into different prefix parameters and uses the attention mechanism to adjust the selection of initiatives in guiding generation dynamically.
no code implementations • 23 Apr 2024 • Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen
The rapid advancement of large-scale vision-language models has showcased remarkable capabilities across various tasks.