Search Results for author: Shijue Huang

Found 9 papers, 8 papers with code

CGIM: A Cycle Guided Interactive Learning Model for Consistency Identification in Task-oriented Dialogue

1 code implementation COLING 2022 Libo Qin, Qiguang Chen, Tianbao Xie, Qian Liu, Shijue Huang, Wanxiang Che, Zhou Yu

Consistency identification in task-oriented dialog (CI-ToD) usually consists of three subtasks, aiming to identify inconsistency between current system response and current user response, dialog history and the corresponding knowledge base.

CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

1 code implementation6 Mar 2024 Zexuan Qiu, Jingjing Li, Shijue Huang, Wanjun Zhong, Irwin King

Developing Large Language Models (LLMs) with robust long-context capabilities has been the recent research focus, resulting in the emergence of long-context LLMs proficient in Chinese.

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

1 code implementation30 Jan 2024 Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu

The recent trend of using Large Language Models (LLMs) as tool agents in real-world applications underscores the necessity for comprehensive evaluations of their capabilities, particularly in complex scenarios involving planning, creating, and using tools.

Benchmarking

SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection

1 code implementation31 Dec 2023 Shijue Huang, Libo Qin, Bingbing Wang, Geng Tu, Ruifeng Xu

The two core challenges for multi-modal intent detection are (1) how to effectively align and fuse different features of modalities and (2) the limited labeled multi-modal intent training data.

Data Augmentation Intent Detection +2

Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages

1 code implementation23 Oct 2023 Libo Qin, Qiguang Chen, Fuxuan Wei, Shijue Huang, Wanxiang Che

The cross-lingual alignment prompting is responsible for aligning representations across different languages, whereas the task-specific solver prompting is used to generate the final chain of thoughts and results for the reasoning task.

Improving Few-shot and Zero-shot Entity Linking with Coarse-to-Fine Lexicon-based Retriever

no code implementations7 Aug 2023 Shijue Huang, Bingbing Wang, Libo Qin, Qin Zhao, Ruifeng Xu

Few-shot and zero-shot entity linking focus on the tail and emerging entities, which are more challenging but closer to real-world scenarios.

Entity Linking Retrieval

Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

1 code implementation23 Sep 2021 Libo Qin, Tianbao Xie, Shijue Huang, Qiguang Chen, Xiao Xu, Wanxiang Che

Consistency Identification has obtained remarkable success on open-domain dialogue, which can be used for preventing inconsistent response generation.

Benchmarking Response Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.