Search Results for author: Tong Xiang

Found 8 papers, 4 papers with code

Can multiple-choice questions really be useful in detecting the abilities of LLMs?

1 code implementation26 Mar 2024 Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng, Noa Garcia

Additionally, we propose two methods to quantify the consistency and confidence of LLMs' output, which can be generalized to other QA evaluation benchmarks.

Multiple-choice Question Answering

Concatenated Masked Autoencoders as Spatial-Temporal Learner

1 code implementation2 Nov 2023 Zhouqiang Jiang, Bowen Wang, Tong Xiang, Zhaofeng Niu, Hong Tang, Guangshun Li, Liangzhi Li

Learning representations from videos requires understanding continuous motion and visual correspondences between frames.

Action Recognition Data Augmentation +3

TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction

no code implementations24 Oct 2023 Junyi Liu, Liangzhi Li, Tong Xiang, Bowen Wang, Yiming Qian

Our summarization compression can reduce 65% of the retrieval token size with further 0. 3% improvement on the accuracy; semantic compression provides a more flexible way to trade-off the token size with performance, for which we can reduce the token size by 20% with only 1. 6% of accuracy drop.

Food recommendation In-Context Learning +3

CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care

1 code implementation NeurIPS 2023 Tong Xiang, Liangzhi Li, Wangyue Li, Mingbai Bai, Lu Wei, Bowen Wang, Noa Garcia

In an effort to minimize the reliance on human resources for performance evaluation, we offer off-the-shelf judgment models for automatically assessing the LF output of LLMs given benchmark questions.

Misinformation

Tell Me How to Survey: Literature Review Made Simple with Automatic Reading Path Generation

1 code implementation12 Oct 2021 Jiayuan Ding, Tong Xiang, Zijing Ou, Wangyang Zuo, Ruihui Zhao, Chenghua Lin, Yefeng Zheng, Bang Liu

In this paper, we introduce a new task named Reading Path Generation (RPG) which aims at automatically producing a path of papers to read for a given query.

Linguistic Characterization of Divisive Topics Online: Case Studies on Contentiousness in Abortion, Climate Change, and Gun Control

no code implementations30 Aug 2021 Jacob Beel, Tong Xiang, Sandeep Soni, Diyi Yang

As public discourse continues to move and grow online, conversations about divisive topics on social media platforms have also increased.

ToxCCIn: Toxic Content Classification with Interpretability

no code implementations EACL (WASSA) 2021 Tong Xiang, Sean MacAvaney, Eugene Yang, Nazli Goharian

Despite the recent successes of transformer-based models in terms of effectiveness on a variety of tasks, their decisions often remain opaque to humans.

Classification General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.