1 code implementation • arXiv 2024 • Lin Xu, Yilin Zhao, Daquan Zhou⋆†, Zhijie Lin, See Kiong Ng, Jiashi Feng
PLLaVA achieves new state-of-the-art performance on modern benchmark datasets for both video question-answer and captioning tasks.
Ranked #1 on Zero-Shot Video Question Answer on TGIF-QA
Video-based Generative Performance Benchmarking (Consistency) Video-based Generative Performance Benchmarking (Contextual Understanding) +4
no code implementations • 12 Nov 2023 • Yilin Zhao, Xinbin Yuan, ShangHua Gao, Zhijie Lin, Qibin Hou, Jiashi Feng, Daquan Zhou
For MoV, we utilize the text-to-speech (TTS) algorithms with a variety of pre-defined tones and select the most matching one based on the user-provided text description automatically.
no code implementations • 27 Oct 2023 • Yilin Zhao, Hai Zhao, Sufeng Duan
Multi-choice Machine Reading Comprehension (MRC) is a major and challenging task for machines to answer questions according to provided options.
1 code implementation • ACL 2022 • Yilin Zhao, Hai Zhao, Libin Shen, Yinggong Zhao
As a broad and major category in machine reading comprehension (MRC), the generalized goal of discriminative MRC is answer prediction from the given materials.
1 code implementation • Findings (ACL) 2021 • Kuicai Dong, Yilin Zhao, Aixin Sun, Jung-jae Kim, XiaoLi Li
Both DocOIE dataset and DocIE model are released for public.
Ranked #1 on Open Information Extraction on DocOIE-transportation
1 code implementation • 7 Dec 2020 • Yilin Zhao, Zhuosheng Zhang, Hai Zhao
Thus we propose a novel reference-based knowledge enhancement model called Reference Knowledgeable Network (RekNet), which simulates human reading strategies to refine critical information from the passage and quote explicit knowledge in necessity.