no code implementations • 22 Jul 2024 • Xiao Liu, Liangzhi Li, Tong Xiang, Fuying Ye, Lu Wei, Wangyue Li, Noa Garcia
Unlike conventional methods that target explicit malicious responses, our approach delves deeper into the nature of the information provided in responses.
1 code implementation • 26 Mar 2024 • Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng, Noa Garcia
Additionally, we propose two methods to quantify the consistency and confidence of LLMs' output, which can be generalized to other QA evaluation benchmarks.
1 code implementation • 2 Nov 2023 • Zhouqiang Jiang, Bowen Wang, Tong Xiang, Zhaofeng Niu, Hong Tang, Guangshun Li, Liangzhi Li
Learning representations from videos requires understanding continuous motion and visual correspondences between frames.
no code implementations • 24 Oct 2023 • Junyi Liu, Liangzhi Li, Tong Xiang, Bowen Wang, Yiming Qian
Our summarization compression can reduce 65% of the retrieval token size with further 0. 3% improvement on the accuracy; semantic compression provides a more flexible way to trade-off the token size with performance, for which we can reduce the token size by 20% with only 1. 6% of accuracy drop.
1 code implementation • NeurIPS 2023 • Tong Xiang, Liangzhi Li, Wangyue Li, Mingbai Bai, Lu Wei, Bowen Wang, Noa Garcia
In an effort to minimize the reliance on human resources for performance evaluation, we offer off-the-shelf judgment models for automatically assessing the LF output of LLMs given benchmark questions.
1 code implementation • 12 Oct 2021 • Jiayuan Ding, Tong Xiang, Zijing Ou, Wangyang Zuo, Ruihui Zhao, Chenghua Lin, Yefeng Zheng, Bang Liu
In this paper, we introduce a new task named Reading Path Generation (RPG) which aims at automatically producing a path of papers to read for a given query.
no code implementations • 30 Aug 2021 • Jacob Beel, Tong Xiang, Sandeep Soni, Diyi Yang
As public discourse continues to move and grow online, conversations about divisive topics on social media platforms have also increased.
no code implementations • EACL (WASSA) 2021 • Tong Xiang, Sean MacAvaney, Eugene Yang, Nazli Goharian
Despite the recent successes of transformer-based models in terms of effectiveness on a variety of tasks, their decisions often remain opaque to humans.
no code implementations • SEMEVAL 2020 • Sajad Sotudeh, Tong Xiang, Hao-Ren Yao, Sean MacAvaney, Eugene Yang, Nazli Goharian, Ophir Frieder
Offensive language detection is an important and challenging task in natural language processing.