no code implementations • 9 Mar 2025 • Yihong Luo, Tianyang Hu, YiFan Song, Jiacheng Sun, Zhenguo Li, Jing Tang
This asymmetric distillation scheme enables our one-step student to handle controls unknown to the teacher model and facilitates improved classifier-free guidance (CFG) usage and seamless integration of human feedback learning (HFL).
1 code implementation • 4 Mar 2025 • Weimin Xiong, YiFan Song, Qingxiu Dong, Bingchan Zhao, Feifan Song, Xun Wang, Sujian Li
Recent advancements in large language models (LLMs) have enabled LLM-based agents to successfully tackle interactive planning tasks.
1 code implementation • 25 Feb 2025 • Yuhan Chen, Yihong Luo, YiFan Song, Pengwen Dai, Jing Tang, Xiaochun Cao
Despite extensive research efforts focused on OOD detection on images, OOD detection on nodes in graph learning remains underexplored.
no code implementations • 17 Dec 2024 • Jiebin Zhang, Dawei Zhu, YiFan Song, Wenhao Wu, Chuqiao Kuang, Xiaoguang Li, Lifeng Shang, Qun Liu, Sujian Li
As large language models (LLMs) process increasing context windows, the memory usage of KV cache has become a critical bottleneck during inference.
no code implementations • 26 Nov 2024 • Lei LI, Yuancheng Wei, Zhihui Xie, Xuqing Yang, YiFan Song, Peiyi Wang, Chenxin An, Tianyu Liu, Sujian Li, Bill Yuchen Lin, Lingpeng Kong, Qi Liu
Vision-language generative reward models (VL-GenRMs) play a crucial role in aligning and evaluating multimodal AI systems, yet their own evaluation remains under-explored.
no code implementations • 17 Oct 2024 • Jinjie Ni, YiFan Song, Deepanway Ghosal, Bo Li, David Junhao Zhang, Xiang Yue, Fuzhao Xue, Zian Zheng, Kaichen Zhang, Mahir Shah, Kabir Jain, Yang You, Michael Shieh
Perceiving and generating diverse modalities are crucial for AI models to effectively learn from and engage with real-world signals, necessitating reliable evaluations for their development.
no code implementations • 17 Oct 2024 • Junpeng Liu, Tianyue Ou, YiFan Song, Yuxiao Qu, Wai Lam, Chenyan Xiong, Wenhu Chen, Graham Neubig, Xiang Yue
Text-rich visual understanding-the ability to process environments where dense textual content is integrated with visuals-is crucial for multimodal large language models (MLLMs) to interact effectively with structured environments.
no code implementations • 10 Oct 2024 • YiFan Song, Weimin Xiong, Xiutian Zhao, Dawei Zhu, Wenhao Wu, Ke Wang, Cheng Li, Wei Peng, Sujian Li
Furthermore, we fine-tune LLMs on AgentBank to get a series of agent models, Samoyed.
1 code implementation • 12 Aug 2024 • Jiahui Jin, YiFan Song, Dong Kan, Haojia Zhu, Xiangguo Sun, Zhicheng Li, Xigang Sun, Jinghui Zhang
Urban region representation is crucial for various urban downstream tasks.
1 code implementation • 15 Jul 2024 • YiFan Song, Guoyin Wang, Sujian Li, Bill Yuchen Lin
Current evaluations of large language models (LLMs) often overlook non-determinism, typically focusing on a single output per example.
1 code implementation • 17 Jun 2024 • Weimin Xiong, YiFan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li
Large language model agents have exhibited exceptional performance across a range of complex interactive tasks.
1 code implementation • 18 Apr 2024 • Dawei Zhu, Liang Wang, Nan Yang, YiFan Song, Wenhao Wu, Furu Wei, Sujian Li
This paper explores context window extension of existing embedding models, pushing the limit to 32k without requiring additional training.
1 code implementation • 9 Apr 2024 • Junpeng Liu, YiFan Song, Bill Yuchen Lin, Wai Lam, Graham Neubig, Yuanzhi Li, Xiang Yue
Multimodal Large Language models (MLLMs) have shown promise in web-related tasks, but evaluating their performance in the web domain remains a challenge due to the lack of comprehensive benchmarks.
1 code implementation • 31 Mar 2024 • Dawei Zhu, Wenhao Wu, YiFan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li
Due to the scarcity of annotated data, data augmentation is commonly used for training coherence evaluation models.
2 code implementations • 4 Mar 2024 • YiFan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin
This iterative cycle of exploration and training fosters continued improvement in the agents.
no code implementations • 16 Dec 2023 • Jiarui Yang, Songpengcheng Xia, YiFan Song, Qi Wu, Ling Pei
Human body reconstruction with Millimeter Wave (mmWave) radar point clouds has gained significant interest due to its ability to work in adverse environments and its capacity to mitigate privacy concerns associated with traditional camera-based solutions.
1 code implementation • 10 Oct 2023 • YiFan Song, Peiyi Wang, Weimin Xiong, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li
Continual learning (CL) aims to constantly learn new knowledge over time while avoiding catastrophic forgetting on old tasks.
1 code implementation • 10 Oct 2023 • Weimin Xiong, YiFan Song, Peiyi Wang, Sujian Li
Continual relation extraction (CRE) aims to solve the problem of catastrophic forgetting when learning a sequence of newly emerging relations.
no code implementations • 1 Oct 2023 • YiFan Song, Keyang Yu, Seth Young
This work leverages the U. S. Federal Aviation Administration's Traffic Flow Management System dataset and DV8, a recently developed tool for highly interactive visualization of air traffic data, to develop clustering algorithms for categorizing air traffic by their varying flight paths.
2 code implementations • 19 Sep 2023 • Dawei Zhu, Nan Yang, Liang Wang, YiFan Song, Wenhao Wu, Furu Wei, Sujian Li
To decouple train length from target length for efficient context window extension, we propose Positional Skip-wisE (PoSE) training that smartly simulates long inputs using a fixed context window.
no code implementations • 5 Sep 2023 • YiFan Song, Mengkun She, Kevin Köser
To validate the effectiveness of our approach, we conducted extensive experiments on simulated and real-world datasets.
no code implementations • 11 Aug 2023 • Mengkun She, YiFan Song, David Nakath, Kevin Köser
Despite impressive results achieved by many on-land visual mapping algorithms in the recent decades, transferring these methods from land to the deep sea remains a challenge due to harsh environmental conditions.
no code implementations • 11 Jun 2023 • YiFan Song, Weimin Xiong, Dawei Zhu, Wenhao Wu, Han Qian, Mingbo Song, Hailiang Huang, Cheng Li, Ke Wang, Rong Yao, Ye Tian, Sujian Li
To address the practical challenges of tackling complex instructions, we propose RestGPT, which exploits the power of LLMs and conducts a coarse-to-fine online planning mechanism to enhance the abilities of task decomposition and API selection.
no code implementations • 12 May 2023 • YiFan Song, Peiyi Wang, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li
Continual learning (CL) aims to constantly learn new knowledge over time while avoiding catastrophic forgetting on old tasks.
1 code implementation • 20 Mar 2023 • Hongbo Wang, Weimin Xiong, YiFan Song, Dawei Zhu, Yu Xia, Sujian Li
Joint entity and relation extraction (JERE) is one of the most important tasks in information extraction.
no code implementations • 16 Mar 2023 • Jaromir Savelka, Arav Agarwal, Christopher Bogart, YiFan Song, Majd Sakr
We evaluated the capability of generative pre-trained transformers (GPT), to pass assessments in introductory and intermediate Python programming courses at the postsecondary level.
1 code implementation • 10 Oct 2022 • Peiyi Wang, YiFan Song, Tianyu Liu, Binghuai Lin, Yunbo Cao, Sujian Li, Zhifang Sui
In this paper, through empirical studies we argue that this assumption may not hold, and an important reason for catastrophic forgetting is that the learned representations do not have good robustness against the appearance of analogous relations in the subsequent learning process.
1 code implementation • 7 Oct 2022 • Qingxiu Dong, Damai Dai, YiFan Song, Jingjing Xu, Zhifang Sui, Lei LI
However, we find that facts stored in the PLMs are not always correct.
1 code implementation • COLING 2022 • Dawei Zhu, Qiusi Zhan, Zhejian Zhou, YiFan Song, Jiebin Zhang, Sujian Li
Different from previous token-level or sentence-level counterparts, ConFiguRe aims at extracting a figurative unit from discourse-level context, and classifying the figurative unit into the right figure type.
no code implementations • 1 Sep 2022 • Peiyi Wang, YiFan Song, Tianyu Liu, Rundong Gao, Binghuai Lin, Yunbo Cao, Zhifang Sui
2) Balanced Tuning (BT) finetunes the model on the balanced memory data.
no code implementations • 26 Jul 2022 • Ziqiao Ao, Gergely Horvath, Chunyuan Sheng, YiFan Song, Yutong Sun
In this paper, we compare different methods to extract skill requirements from job advertisements.
1 code implementation • 2 May 2022 • Shoujie Tong, Qingxiu Dong, Damai Dai, YiFan Song, Tianyu Liu, Baobao Chang, Zhifang Sui
For each instance in a batch, we involve other instances in the same batch to interact with it.
no code implementations • 14 Dec 2021 • Mengkun She, Tim Weiß, YiFan Song, Peter Urban, Jens Greinert, Kevin Köser
Beside reporting the steps to make bubble characterization robust and autonomous, we carefully evaluate the reachable accuracy to be in the range of 1-2\% of the bubble radius and propose a novel auto-calibration procedure that, due to the lack of point correspondences, uses only the silhouettes of bubbles.
no code implementations • 1 Oct 2021 • Kevin Köser, YiFan Song, Lasse Petersen, Emanuel Wenzlaff, Felix Woelk
The majority of Earth's surface lies deep in the oceans, where no surface light reaches.
no code implementations • 14 Aug 2021 • Mengkun She, David Nakath, YiFan Song, Kevin Köser
Underwater cameras are typically placed behind glass windows to protect them from the water.