Search Results for author: Shen Gao

Found 40 papers, 20 papers with code

DRE: Generating Recommendation Explanations by Aligning Large Language Models at Data-level

no code implementations9 Apr 2024 Shen Gao, Yifan Wang, Jiabao Fang, Lisi Chen, Peng Han, Shuo Shang

Recommendation systems play a crucial role in various domains, suggesting items based on user behavior. However, the lack of transparency in presenting recommendations can lead to user confusion.

Recommendation Systems

360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System

no code implementations8 Apr 2024 Shen Gao, Hao Li, Zhengliang Shi, Chengrui Huang, Quan Tu, Zhiliang Tian, Minlie Huang, Shuo Shang

The framework employs a novel 360{\deg} performance assessment method for multi-perspective performance evaluation with fine-grained assessment.

Language Modelling Large Language Model

Selecting Query-bag as Pseudo Relevance Feedback for Information-seeking Conversations

no code implementations22 Mar 2024 Xiaoqing Zhang, Xiuying Chen, Shen Gao, Shuqi Li, Xin Gao, Ji-Rong Wen, Rui Yan

Given the user query, the information-seeking dialogue systems first retrieve a subset of response candidates, then further select the best response from the candidate set through re-ranking.

Contrastive Learning Re-Ranking

Characteristic AI Agents via Large Language Models

1 code implementation19 Mar 2024 Xi Wang, Hongliang Dai, Shen Gao, Piji Li

In response to this research gap, we create a benchmark for the characteristic AI agents task, including dataset, techniques, and evaluation metrics.

Chatbot

Generative News Recommendation

1 code implementation6 Mar 2024 Shen Gao, Jiabao Fang, Quan Tu, Zhitao Yao, Zhumin Chen, Pengjie Ren, Zhaochun Ren

In this paper, we propose a novel generative news recommendation paradigm that includes two steps: (1) Leveraging the internal knowledge and reasoning capabilities of the Large Language Model (LLM) to perform high-level matching between candidate news and user representation; (2) Generating a coherent and logically structured narrative based on the associations between related news and user interests, thus engaging users in further reading of the news.

Language Modelling Large Language Model +1

Learning to Use Tools via Cooperative and Interactive Agents

no code implementations5 Mar 2024 Zhengliang Shi, Shen Gao, Xiuyi Chen, Lingyong Yan, Haibo Shi, Dawei Yin, Zhumin Chen, Pengjie Ren, Suzan Verberne, Zhaochun Ren

Tool learning empowers large language models (LLMs) as agents to use external tools to extend their capability.

Lightweight Deep Learning Based Channel Estimation for Extremely Large-Scale Massive MIMO Systems

1 code implementation14 Feb 2024 Shen Gao, Peihao Dong, Zhiwen Pan, Xiaohu You

Extremely large-scale massive multiple-input multiple-output (XL-MIMO) systems introduce the much higher channel dimensionality and incur the additional near-field propagation effect, aggravating the computation load and the difficulty to acquire the prior knowledge for channel estimation.

Quantization

A Multi-Agent Conversational Recommender System

no code implementations2 Feb 2024 Jiabao Fang, Shen Gao, Pengjie Ren, Xiuying Chen, Suzan Verberne, Zhaochun Ren

First, we design a multi-agent act planning framework, which can control the dialogue flow based on four LLM-based agents.

Recommendation Systems

Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning

1 code implementation10 Dec 2023 Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren

Multiple defendants in a criminal fact description generally exhibit complex interactions, and cannot be well handled by existing Legal Judgment Prediction (LJP) methods which focus on predicting judgment results (e. g., law articles, charges, and terms of penalty) for single-defendant cases.

Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

1 code implementation27 Aug 2023 Shen Gao, Zhengliang Shi, Minghang Zhu, Bowen Fang, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extending the capability of LLMs.

UMSE: Unified Multi-scenario Summarization Evaluation

1 code implementation26 May 2023 Shen Gao, Zhitao Yao, Chongyang Tao, Xiuying Chen, Pengjie Ren, Zhaochun Ren, Zhumin Chen

Experimental results across three typical scenarios on the benchmark dataset SummEval indicate that our UMSE can achieve comparable performance with several existing strong methods which are specifically designed for each scenario.

Text Summarization

A Topic-aware Summarization Framework with Different Modal Side Information

no code implementations19 May 2023 Xiuying Chen, Mingzhe Li, Shen Gao, Xin Cheng, Qiang Yang, Qishen Zhang, Xin Gao, Xiangliang Zhang

To address these two challenges, we first propose a unified topic encoder, which jointly discovers latent topics from the document and various kinds of side information.

Contrastive Learning

Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order

1 code implementation2 Jan 2023 Xiuying Chen, Mingzhe Li, Shen Gao, Zhangming Chan, Dongyan Zhao, Xin Gao, Xiangliang Zhang, Rui Yan

Nowadays, time-stamped web documents related to a general news query floods spread throughout the Internet, and timeline summarization targets concisely summarizing the evolution trajectory of events along the timeline.

Document Summarization Timeline Summarization +1

Contrastive Learning Reduces Hallucination in Conversations

1 code implementation20 Dec 2022 Weiwei Sun, Zhengliang Shi, Shen Gao, Pengjie Ren, Maarten de Rijke, Zhaochun Ren

MixCL effectively reduces the hallucination of LMs in conversations and achieves the highest performance among LM-based dialogue agents in terms of relevancy and factuality.

Contrastive Learning Hallucination

Scientific Paper Extractive Summarization Enhanced by Citation Graphs

no code implementations8 Dec 2022 Xiuying Chen, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang

We first propose a Multi-granularity Unsupervised Summarization model (MUS) as a simple and low-cost solution to the task.

Extractive Summarization Link Prediction +1

Neural Machine Translation with Contrastive Translation Memories

1 code implementation6 Dec 2022 Xin Cheng, Shen Gao, Lemao Liu, Dongyan Zhao, Rui Yan

Retrieval-augmented Neural Machine Translation models have been successful in many translation scenarios.

Contrastive Learning Machine Translation +4

Target-aware Abstractive Related Work Generation with Contrastive Learning

1 code implementation26 May 2022 Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang

The related work section is an important component of a scientific paper, which highlights the contribution of the target paper in the context of the reference papers.

Contrastive Learning TAG

Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation

1 code implementation ACL 2021 Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Xiangliang Zhang, Dongyan Zhao, Rui Yan

Hence, in this paper, we propose a Relation-aware Related work Generator (RRG), which generates an abstractive related work from the given multiple scientific papers in the same research area.

Relation

Learning to Organize a Bag of Words into Sentences with Neural Networks: An Empirical Study

no code implementations NAACL 2021 Chongyang Tao, Shen Gao, Juntao Li, Yansong Feng, Dongyan Zhao, Rui Yan

Sequential information, a. k. a., orders, is assumed to be essential for processing a sequence with recurrent neural network or convolutional neural network based encoders.

Sentence

Deep Multi-Stage CSI Acquisition for Reconfigurable Intelligent Surface Aided MIMO Systems

no code implementations23 Apr 2021 Shen Gao, Peihao Dong, Zhiwen Pan, Geoffrey Ye Li

This article aims to reduce huge pilot overhead when estimating the reconfigurable intelligent surface (RIS) relayed wireless channel.

The Style-Content Duality of Attractiveness: Learning to Write Eye-Catching Headlines via Disentanglement

no code implementations14 Dec 2020 Mingzhe Li, Xiuying Chen, Min Yang, Shen Gao, Dongyan Zhao, Rui Yan

In this paper, we propose a Disentanglement-based Attractive Headline Generator (DAHG) that generates headline which captures the attractive content following the attractive style.

Disentanglement

Meaningful Answer Generation of E-Commerce Question-Answering

no code implementations14 Nov 2020 Shen Gao, Xiuying Chen, Zhaochun Ren, Dongyan Zhao, Rui Yan

To generate more meaningful answers, in this paper, we propose a novel generative neural model, called the Meaningful Product Answer Generator (MPAG), which alleviates the safe answer problem by taking product reviews, product attributes, and a prototype answer into consideration.

Answer Generation Question Answering +1

Learning to Respond with Your Favorite Stickers: A Framework of Unifying Multi-Modality and User Preference in Multi-Turn Dialog

no code implementations5 Nov 2020 Shen Gao, Xiuying Chen, Li Liu, Dongyan Zhao, Rui Yan

Hence, in this paper, we propose to recommend an appropriate sticker to user based on multi-turn dialog context and sticker using history of user.

VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles

1 code implementation EMNLP 2020 Mingzhe Li, Xiuying Chen, Shen Gao, Zhangming Chan, Dongyan Zhao, Rui Yan

Hence, in this paper, we propose the task of Video-based Multimodal Summarization with Multimodal Output (VMSMO) to tackle such a problem.

From Standard Summarization to New Tasks and Beyond: Summarization with Manifold Information

no code implementations10 May 2020 Shen Gao, Xiuying Chen, Zhaochun Ren, Dongyan Zhao, Rui Yan

Text summarization is the research area aiming at creating a short and condensed version of the original document, which conveys the main idea of the document in a few words.

Text Summarization

Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog

1 code implementation10 Mar 2020 Shen Gao, Xiuying Chen, Chang Liu, Li Liu, Dongyan Zhao, Rui Yan

Stickers with vivid and engaging expressions are becoming increasingly popular in online messaging apps, and some works are dedicated to automatically select sticker response by matching text labels of stickers with previous utterances.

Reinforcement Learning Based Cooperative Coded Caching under Dynamic Popularities in Ultra-Dense Networks

no code implementations8 Mar 2020 Shen Gao, Peihao Dong, Zhiwen Pan, Geoffrey Ye Li

For ultra-dense networks with wireless backhaul, caching strategy at small base stations (SBSs), usually with limited storage, is critical to meet massive high data rate requests.

Q-Learning Reinforcement Learning (RL)

RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search

no code implementations28 Oct 2019 Xiuying Chen, Daorui Xiao, Shen Gao, Guojun Liu, Wei. Lin, Bo Zheng, Dongyan Zhao, Rui Yan

Sponsored search optimizes revenue and relevance, which is estimated by Revenue Per Mille (RPM).

How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing

1 code implementation IJCNLP 2019 Shen Gao, Xiuying Chen, Piji Li, Zhangming Chan, Dongyan Zhao, Rui Yan

There are two main challenges in this task: (1) the model needs to incorporate learned patterns from the prototype, but (2) should avoid copying contents other than the patternized words---such as irrelevant facts---into the generated summaries.

Abstractive Text Summarization

Learning towards Abstractive Timeline Summarization

1 code implementation IJCAI 2019 2019 Xiuying Chen, Zhangming Chan, Shen Gao, Meng-Hsuan Yu, Dongyan Zhao, Rui Yan

Timeline summarization targets at concisely summarizing the evolution trajectory along the timeline and existing timeline summarization approaches are all based on extractive methods. In this paper, we propose the task of abstractive timeline summarization, which tends to concisely paraphrase the information in the time-stamped events. Unlike traditional document summarization, timeline summarization needs to model the time series information of the input events and summarize important events in chronological order. To tackle this challenge, we propose a memory-based timeline summarization model (MTS). Concretely, we propose a time-event memory to establish a timeline, and use the time position of events on this timeline to guide generation process. Besides, in each decoding step, we incorporate event-level information into word-level attention to avoid confusion between events. Extensive experiments are conducted on a large-scale real-world dataset, and the results show that MTS achieves the state-of-the-art performance in terms of both automatic and human evaluations.

Document Summarization Timeline Summarization +2

Product-Aware Answer Generation in E-Commerce Question-Answering

1 code implementation23 Jan 2019 Shen Gao, Zhaochun Ren, Yihong Eric Zhao, Dongyan Zhao, Dawei Yin, Rui Yan

In this paper, we propose the task of product-aware answer generation, which tends to generate an accurate and complete answer from large-scale unlabeled e-commerce reviews and product attributes.

Answer Generation Question Answering

Abstractive Text Summarization by Incorporating Reader Comments

no code implementations13 Dec 2018 Shen Gao, Xiuying Chen, Piji Li, Zhaochun Ren, Lidong Bing, Dongyan Zhao, Rui Yan

To tackle this problem, we propose the task of reader-aware abstractive summary generation, which utilizes the reader comments to help the model produce better summary about the main aspect.

Reader-Aware Summarization

Iterative Document Representation Learning Towards Summarization with Polishing

1 code implementation EMNLP 2018 Xiuying Chen, Shen Gao, Chongyang Tao, Yan Song, Dongyan Zhao, Rui Yan

In this paper, we introduce Iterative Text Summarization (ITS), an iteration-based model for supervised extractive text summarization, inspired by the observation that it is often necessary for a human to read an article multiple times in order to fully understand and summarize its contents.

Extractive Text Summarization Representation Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.