Search Results for author: Jian-Yun Nie

Found 58 papers, 31 papers with code

A Survey of Large Language Models

4 code implementations31 Mar 2023 Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, YiFan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen

To discriminate the difference in parameter scale, the research community has coined the term large language models (LLM) for the PLMs of significant size.

Language Modelling

Retrieve Anything To Augment Large Language Models

1 code implementation11 Oct 2023 Peitian Zhang, Shitao Xiao, Zheng Liu, Zhicheng Dou, Jian-Yun Nie

On the other hand, the task-specific retrievers lack the required versatility, hindering their performance across the diverse retrieval augmentation scenarios.

Knowledge Distillation Retrieval

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

1 code implementation26 Dec 2022 Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu, Wenxun Dai, Zican Dong, Xiaoxue Cheng, Yuhao Wang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2. 0, focusing on the use of pre-trained language models (PLMs).

Abstractive Text Summarization Data-to-Text Generation +7

A General SIMD-based Approach to Accelerating Compression Algorithms

1 code implementation6 Feb 2015 Wayne Xin Zhao, Xu-Dong Zhang, Daniel Lemire, Dongdong Shan, Jian-Yun Nie, Hongfei Yan, Ji-Rong Wen

Compression algorithms are important for data oriented tasks, especially in the era of Big Data.

HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models

2 code implementations19 May 2023 Junyi Li, Xiaoxue Cheng, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

Large language models (LLMs), such as ChatGPT, are prone to generate hallucinations, i. e., content that conflicts with the source or cannot be verified by the factual knowledge.

Hallucination Hallucination Evaluation

A Hierarchical Recurrent Encoder-Decoder For Generative Context-Aware Query Suggestion

4 code implementations8 Jul 2015 Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob G. Simonsen, Jian-Yun Nie

Our novel hierarchical recurrent encoder-decoder architecture allows the model to be sensitive to the order of queries in the context while avoiding data sparsity.

VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification

1 code implementation12 Apr 2020 Zhibin Lu, Pan Du, Jian-Yun Nie

Much progress has been made recently on text classification with methods based on neural networks.

General Classification Graph Embedding +3

An End-to-End Neighborhood-based Interaction Model for Knowledge-enhanced Recommendation

1 code implementation12 Aug 2019 Yanru Qu, Ting Bai, Wei-Nan Zhang, Jian-Yun Nie, Jian Tang

This paper studies graph-based recommendation, where an interaction graph is constructed from historical records and is lever-aged to alleviate data sparsity and cold start problems.

Click-Through Rate Prediction Knowledge Graphs

WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus

1 code implementation10 Apr 2023 Hongjing Qian, Yutao Zhu, Zhicheng Dou, Haoqi Gu, Xinyu Zhang, Zheng Liu, Ruofei Lai, Zhao Cao, Jian-Yun Nie, Ji-Rong Wen

In this paper, we introduce a new NLP task -- generating short factual articles with references for queries by mining supporting evidence from the Web.

Retrieval Text Generation

ScriptWriter: Narrative-Guided Script Generation

1 code implementation ACL 2020 Yutao Zhu, Ruihua Song, Zhicheng Dou, Jian-Yun Nie, Jin Zhou

In dialogue systems, it would also be useful to drive dialogues by a dialogue plan.

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

1 code implementation6 Jan 2024 Junyi Li, Jie Chen, Ruiyang Ren, Xiaoxue Cheng, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

To tackle the LLM hallucination, three key questions should be well studied: how to detect hallucinations (detection), why do LLMs hallucinate (source), and what can be done to mitigate them (mitigation).

Hallucination

Neural Sentence Ordering Based on Constraint Graphs

1 code implementation27 Jan 2021 Yutao Zhu, Kun Zhou, Jian-Yun Nie, Shengchao Liu, Zhicheng Dou

Our experiments on five benchmark datasets show that our method outperforms all the existing baselines significantly, achieving a new state-of-the-art performance.

Sentence Sentence Ordering

Learning to Transfer Prompts for Text Generation

1 code implementation NAACL 2022 Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Wayne Xin Zhao

First, PTG learns a set of source prompts for various source generation tasks and then transfers these prompts as target prompts to perform target generation tasks.

Text Generation

Jointly Optimizing State Operation Prediction and Value Generation for Dialogue State Tracking

2 code implementations24 Oct 2020 Yan Zeng, Jian-Yun Nie

However, in such a stacked encoder-decoder structure, the operation prediction objective only affects the BERT encoder and the value generation objective mainly affects the RNN decoder.

Dialogue State Tracking Multi-domain Dialogue State Tracking

Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking

1 code implementation24 Aug 2021 Yutao Zhu, Jian-Yun Nie, Zhicheng Dou, Zhengyi Ma, Xinyu Zhang, Pan Du, Xiaochen Zuo, Hao Jiang

To learn a more robust representation of the user behavior sequence, we propose a method based on contrastive learning, which takes into account the possible variations in user's behavior sequences.

Contrastive Learning Data Augmentation +1

From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking

1 code implementation22 Aug 2022 Yutao Zhu, Jian-Yun Nie, Yixuan Su, Haonan Chen, Xinyu Zhang, Zhicheng Dou

In this work, we propose a curriculum learning framework for context-aware document ranking, in which the ranking model learns matching signals between the search context and the candidate document in an easy-to-hard manner.

Document Ranking

ConvGQR: Generative Query Reformulation for Conversational Search

1 code implementation25 May 2023 Fengran Mo, Kelong Mao, Yutao Zhu, Yihong Wu, Kaiyu Huang, Jian-Yun Nie

In this paper, we propose ConvGQR, a new framework to reformulate conversational queries based on generative pre-trained language models (PLMs), one for query rewriting and another for generating potential answers.

Conversational Search Retrieval

Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals

1 code implementation18 Jul 2021 Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Hao Jiang, Zhicheng Dou

The final response is selected according to the predicted knowledge, the goal to achieve, and the context.

Multi-Task Learning Retrieval

MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model

1 code implementation20 Oct 2023 Le Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie, Aishwarya Agrawal

To enable LLMs to tackle the task in a zero-shot manner, we introduce MoqaGPT, a straightforward and flexible framework.

Language Modelling Large Language Model +2

The Web Can Be Your Oyster for Improving Large Language Models

1 code implementation18 May 2023 Junyi Li, Tianyi Tang, Wayne Xin Zhao, Jingyuan Wang, Jian-Yun Nie, Ji-Rong Wen

In order to further improve the capacity of LLMs for knowledge-intensive tasks, we consider augmenting LLMs with the large-scale web using search engine.

Retrieval World Knowledge

Content Selection Network for Document-grounded Retrieval-based Chatbots

1 code implementation21 Jan 2021 Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Zhicheng Dou

It is thus crucial to select the part of document content relevant to the current conversation context.

Retrieval

Collaboration and Transition: Distilling Item Transitions into Multi-Query Self-Attention for Sequential Recommendation

1 code implementation2 Nov 2023 Tianyu Zhu, Yansong Shi, Yuan Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie

Second, we develop a transition-aware embedding distillation module that distills global item-to-item transition patterns into item embeddings, which enables the model to memorize and leverage transitional signals and serves as a calibrator for collaborative signals.

Sequential Recommendation

History-Aware Conversational Dense Retrieval

1 code implementation30 Jan 2024 Fengran Mo, Chen Qu, Kelong Mao, Tianyu Zhu, Zhan Su, Kaiyu Huang, Jian-Yun Nie

To address the aforementioned issues, we propose a History-Aware Conversational Dense Retrieval (HAConvDR) system, which incorporates two ideas: context-denoised query reformulation and automatic mining of supervision signals based on the actual impact of historical turns.

Conversational Search Information Retrieval +1

ConvSDG: Session Data Generation for Conversational Search

1 code implementation17 Mar 2024 Fengran Mo, Bole Yi, Kelong Mao, Chen Qu, Kaiyu Huang, Jian-Yun Nie

Conversational search provides a more convenient interface for users to search by allowing multi-turn interaction with the search engine.

Conversational Search Retrieval +1

A Long-Short Demands-Aware Model for Next-Item Recommendation

no code implementations12 Feb 2019 Ting Bai, Pan Du, Wayne Xin Zhao, Ji-Rong Wen, Jian-Yun Nie

Recommending the right products is the central problem in recommender systems, but the right products should also be recommended at the right time to meet the demands of users, so as to maximize their values.

Recommendation Systems

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

no code implementations19 May 2019 Zhiqing Sun, Jian Tang, Pan Du, Zhi-Hong Deng, Jian-Yun Nie

Furthermore, we propose a diversified point network to generate a set of diverse keyphrases out of the word graph in the decoding process.

Document Summarization Information Retrieval +2

Personalizing Search Results Using Hierarchical RNN with Query-aware Attention

no code implementations20 Aug 2019 Songwei Ge, Zhicheng Dou, Zhengbao Jiang, Jian-Yun Nie, Ji-Rong Wen

Our analysis reveals that the attention model is able to attribute higher weights to more related past sessions after fine training.

Attribute

Multi-Domain Dialogue State Tracking based on State Graph

no code implementations21 Oct 2020 Yan Zeng, Jian-Yun Nie

Existing approaches usually concatenate previous dialogue state with dialogue history as the input to a bi-directional Transformer encoder.

Dialogue State Tracking Multi-domain Dialogue State Tracking

Meta-Learning for Neural Relation Classification with Distant Supervision

no code implementations26 Oct 2020 Zhenzhen Li, Jian-Yun Nie, Benyou Wang, Pan Du, Yuhan Zhang, Lixin Zou, Dongsheng Li

Distant supervision provides a means to create a large number of weakly labeled data at low cost for relation classification.

Classification General Classification +3

Open-Domain Dialogue Generation Based on Pre-trained Language Models

no code implementations24 Oct 2020 Yan Zeng, Jian-Yun Nie

These results show that discrepancies is an important factor to consider when we use a pre-trained model, and a reduction in discrepancies can lead to improved performance.

Dialogue Generation Language Modelling +1

The DELICES project: Indexing scientific literature through semantic expansion

no code implementations28 Jun 2021 Florian Boudin, Béatrice Daille, Evelyne Jacquey, Jian-Yun Nie

Scientific digital libraries play a critical role in the development and dissemination of scientific literature.

Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment

no code implementations ACL 2021 Xinying Qiu, Yuan Chen, Hanwu Chen, Jian-Yun Nie, Yuming Shen, Dawei Lu

Deep learning models for automatic readability assessment generally discard linguistic features traditionally used in machine learning models for the task.

Empirical Study of Multi-level Convolution Models for IR Based on Representations and Interactions

no code implementations20 Dec 2018 Yifan Nie, Yanling Li, Jian-Yun Nie

Deep learning models have been employed to perform IR tasks and have shown competitive results.

Pretrained Language Models for Text Generation: A Survey

no code implementations14 Jan 2022 Junyi Li, Tianyi Tang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

We begin with introducing three key aspects of applying PLMs to text generation: 1) how to encode the input into representations preserving input semantics which can be fused into PLMs; 2) how to design an effective PLM to serve as the generation model; and 3) how to effectively optimize PLMs given the reference text and to ensure that the generated texts satisfy special text properties.

Text Generation

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

no code implementations25 Jun 2022 Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

In this paper, we propose a novel framework for learning style representation from abundant plain text in a self-supervised manner.

Contrastive Learning Deep Clustering +2

PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN

no code implementations5 Jul 2022 Pan Du, Jian-Yun Nie, Yutao Zhu, Hao Jiang, Lixin Zou, Xiaohui Yan

Beyond topical relevance, passage ranking for open-domain factoid question answering also requires a passage to contain an answer (answerability).

Passage Ranking Question Answering

Tapping the Potential of Coherence and Syntactic Features in Neural Models for Automatic Essay Scoring

no code implementations24 Nov 2022 Xinying Qiu, Shuxuan Liao, Jiajun Xie, Jian-Yun Nie

In this paper, we propose a novel approach to extract and represent essay coherence features with prompt-learning NSP that shows to match the state-of-the-art AES coherence model, and achieves the best performance for long essays.

CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval

no code implementations11 Jan 2023 Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, Laure Soulier

Conversational search is a difficult task as it aims at retrieving documents based not only on the current user query but also on the full conversation history.

Conversational Search Information Retrieval +2

GlyphDiffusion: Text Generation as Image Generation

no code implementations25 Apr 2023 Junyi Li, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen

In this way, conditional text generation can be cast as a glyph image generation task, and it is then natural to apply continuous diffusion models to discrete texts.

Conditional Text Generation Glyph Image Generation +2

Word Grounded Graph Convolutional Network

1 code implementation10 May 2023 Zhibin Lu, Qianqian Xie, Benyou Wang, Jian-Yun Nie

An inductive Word-grounded Graph Convolutional Network (WGCN) is proposed to learn word and document representations based on WGraph in a supervised manner.

text-classification Text Classification

Augmenting Ad-Hoc IR Dataset for Interactive Conversational Search

no code implementations10 Nov 2023 Pierre Erbacher, Jian-Yun Nie, Philippe Preux, Laure Soulier

The only two datasets known to us that contain both document relevance judgments and the associated clarification interactions are Qulac and ClariQ.

Conversational Search

Metacognitive Retrieval-Augmented Large Language Models

1 code implementation18 Feb 2024 Yujia Zhou, Zheng Liu, Jiajie Jin, Jian-Yun Nie, Zhicheng Dou

Drawing from cognitive psychology, metacognition allows an entity to self-reflect and critically evaluate its cognitive processes.

Response Generation Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.