Search Results for author: Zujie Wen

Found 17 papers, 6 papers with code

Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning

no code implementations11 Mar 2024 Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua

We investigate non-collaborative dialogue agents that must engage in tailored strategic planning for diverse users to secure a favorable agreement.

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

no code implementations2 Feb 2024 Jian Guan, Wei Wu, Zujie Wen, Peng Xu, Hongning Wang, Minlie Huang

We present AMOR, an agent framework based on open-source LLMs, which reasons with external knowledge bases and adapts to specific domains through human supervision to the reasoning process.

Multi-granularity Correspondence Learning from Long-term Noisy Videos

1 code implementation30 Jan 2024 Yijie Lin, Jie Zhang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng

Existing video-language studies mainly focus on learning short video clips, leaving long-term temporal dependencies rarely explored due to over-high computational cost of modeling long videos.

Action Segmentation Long Video Retrieval (Background Removed) +2

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems

no code implementations11 Jan 2024 Tianyu Cui, Yanling Wang, Chuanpu Fu, Yong Xiao, Sijia Li, Xinhao Deng, Yunpeng Liu, Qinglin Zhang, Ziyi Qiu, Peiyang Li, Zhixing Tan, Junwu Xiong, Xinyu Kong, Zujie Wen, Ke Xu, Qi Li

Based on this, we propose a comprehensive taxonomy, which systematically analyzes potential risks associated with each module of an LLM system and discusses the corresponding mitigation strategies.

Language Modelling Large Language Model

Multi-view Hypergraph Contrastive Policy Learning for Conversational Recommendation

1 code implementation26 Jul 2023 Sen Zhao, Wei Wei, Xian-Ling Mao, Shuai Zhu, Minghui Yang, Zujie Wen, Dangyang Chen, Feida Zhu

Specifically, MHCPL timely chooses useful social information according to the interactive history and builds a dynamic hypergraph with three types of multiplex relations from different views.

Recommendation Systems

Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

1 code implementation4 May 2023 Sen Zhao, Wei Wei, Yifan Liu, Ziyang Wang, Wendi Li, Xian-Ling Mao, Shuai Zhu, Minghui Yang, Zujie Wen

Conversational recommendation systems (CRS) aim to timely and proactively acquire user dynamic preferred attributes through conversations for item recommendation.

Attribute Decision Making +2

Robust Domain Adaptation for Machine Reading Comprehension

no code implementations23 Sep 2022 Liang Jiang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng

Such a process will inevitably introduce mismatched pairs (i. e., noisy correspondence) due to i) the unavailable QA pairs in target documents, and ii) the domain shift during applying the QA construction model to the target domain.

Domain Adaptation Machine Reading Comprehension

AdaCoach: A Virtual Coach for Training Customer Service Agents

no code implementations27 Apr 2022 Shuang Peng, Shuai Zhu, Minghui Yang, Haozhou Huang, Dan Liu, Zujie Wen, Xuelian Li, Biao Fan

With the development of online business, customer service agents gradually play a crucial role as an interface between the companies and their customers.

Dialogue Evaluation

A Dialogue-based Information Extraction System for Medical Insurance Assessment

no code implementations Findings (ACL) 2021 Shuang Peng, Mengdi Zhou, Minghui Yang, Haitao Mi, Shaosheng Cao, Zujie Wen, Teng Xu, Hongbin Wang, Lei Liu

In the Chinese medical insurance industry, the assessor's role is essential and requires significant efforts to converse with the claimant.

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

1 code implementation ACL 2021 Xiang Hu, Haitao Mi, Zujie Wen, Yafang Wang, Yi Su, Jing Zheng, Gerard de Melo

Human language understanding operates at multiple levels of granularity (e. g., words, phrases, and sentences) with increasing levels of abstraction that can be hierarchically combined.

Language Modelling

SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising

1 code implementation Findings (NAACL) 2022 Kuan Xu, Yongbo Wang, Yongliang Wang, Zujie Wen, Yang Dong

In text-to-SQL task, seq-to-seq models often lead to sub-optimal performance due to limitations in their architecture.

Denoising slot-filling +2

Query Distillation: BERT-based Distillation for Ensemble Ranking

no code implementations COLING 2020 Wangshu Zhang, Junhong Liu, Zujie Wen, Yafang Wang, Gerard de Melo

We present a novel two-stage distillation method for ranking problems that allows a smaller student model to be trained while benefitting from the better performance of the teacher model, providing better control of the inference latency and computational burden.

Knowledge Distillation

Long Short-Term Sample Distillation

no code implementations2 Mar 2020 Liang Jiang, Zujie Wen, Zhongping Liang, Yafang Wang, Gerard de Melo, Zhe Li, Liangzhuang Ma, Jiaxing Zhang, Xiaolong Li, Yuan Qi

The long-term teacher draws on snapshots from several epochs ago in order to provide steadfast guidance and to guarantee teacher--student differences, while the short-term one yields more up-to-date cues with the goal of enabling higher-quality updates.

Cannot find the paper you are looking for? You can Submit a new open access paper.