Search Results for author: Ming Zhou

Found 225 papers, 74 papers with code

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

15 code implementations • 31 Dec 2019 • Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou

In this paper, we propose the \textbf{LayoutLM} to jointly model interactions between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents.

Ranked #7 on Relation Extraction on FUNSD

Document AI Document Image Classification +3

124,457

Paper
Code

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

4 code implementations • 13 Jan 2020 • Weizhen Qi, Yu Yan, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang, Ming Zhou

This paper presents a new sequence-to-sequence pre-training model called ProphetNet, which introduces a novel self-supervised objective named future n-gram prediction and the proposed n-stream self-attention mechanism.

Ranked #6 on Question Generation on SQuAD1.1 (using extra training data)

Abstractive Text Summarization Question Generation +1

124,457

Paper
Code

Unified Language Model Pre-training for Natural Language Understanding and Generation

9 code implementations • NeurIPS 2019 • Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

This paper presents a new Unified pre-trained Language Model (UniLM) that can be fine-tuned for both natural language understanding and generation tasks.

Ranked #2 on Generative Question Answering on CoQA (using extra training data)

Abstractive Text Summarization Document Summarization +7

18,262

Paper
Code

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

1 code implementation • NeurIPS 2020 • Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, Ming Zhou

The small model (student) is trained by deeply mimicking the self-attention module, which plays a vital role in Transformer networks, of the large model (teacher).

Ranked #8 on Zero-shot Text Search on BEIR

Zero-shot Text Search

18,262

Paper
Code

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

3 code implementations • 28 Feb 2020 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Songhao Piao, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM).

Ranked #4 on Question Generation on SQuAD1.1 (using extra training data)

Abstractive Text Summarization Language Modelling +3

18,262

Paper
Code

Pseudo-Masked Language Models for Unified Language Model Pre-Training

1 code implementation • ICML 2020 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, Hsiao-Wuen Hon

Language Modelling Natural Language Understanding +1

18,262

Paper
Code

InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training

4 code implementations • NAACL 2021 • Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, He-Yan Huang, Ming Zhou

In this work, we present an information-theoretic framework that formulates cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts.

Ranked #16 on Zero-Shot Cross-Lingual Transfer on XTREME

Contrastive Learning Cross-Lingual Transfer +2

18,261

Paper
Code

MIND: A Large-scale Dataset for News Recommendation

2 code implementations • ACL 2020 • Fangzhao Wu, Ying Qiao, Jiun-Hung Chen, Chuhan Wu, Tao Qi, Jianxun Lian, Danyang Liu, Xing Xie, Jianfeng Gao, Winnie Wu, Ming Zhou

News recommendation is an important technique for personalized news service.

News Recommendation

17,916

Paper
Code

Neural Speech Synthesis with Transformer Network

6 code implementations • 19 Sep 2018 • Naihan Li, Shujie Liu, Yanqing Liu, Sheng Zhao, Ming Liu, Ming Zhou

Although end-to-end neural text-to-speech (TTS) methods (such as Tacotron2) are proposed and achieve state-of-the-art performance, they still suffer from two problems: 1) low efficiency during training and inference; 2) hard to model long dependency using current recurrent neural networks (RNNs).

Ranked #9 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Machine Translation NMT +2

10,084

Paper
Code

CodeBLEU: a Method for Automatic Evaluation of Code Synthesis

2 code implementations • 22 Sep 2020 • Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, Shuai Ma

Evaluation metrics play a vital role in the growth of an area as it defines the standard of distinguishing between good and bad models.

Code Translation Translation

7,744

Paper
Code

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

8 code implementations • Findings of the Association for Computational Linguistics 2020 • Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, Ming Zhou

Results show that CodeBERT achieves state-of-the-art performance on both natural language code search and code documentation generation tasks.

Ranked #1 on Code Documentation Generation on CodeSearchNet - Go

Code Documentation Generation Code Search +1

2,586

Paper
Code

GraphCodeBERT: Pre-training Code Representations with Data Flow

1 code implementation • ICLR 2021 • Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou

Instead of taking syntactic-level structure of code like abstract syntax tree (AST), we use data flow in the pre-training stage, which is a semantic-level structure of code that encodes the relation of "where-the-value-comes-from" between variables.

Ranked #3 on Type prediction on ManyTypes4TypeScript

Clone Detection Code Completion +7

1,971

Paper
Code

UniXcoder: Unified Cross-Modal Pre-training for Code Representation

2 code implementations • ACL 2022 • Daya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou, Jian Yin

Furthermore, we propose to utilize multi-modal contents to learn representation of code fragment with contrastive learning, and then align representations among programming languages using a cross-modal generation task.

Code Completion Code Search +1

1,971

Paper
Code

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

4 code implementations • 9 Feb 2021 • Shuai Lu, Daya Guo, Shuo Ren, JunJie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu

Benchmark datasets have a significant impact on accelerating research in programming language tasks.

Ranked #1 on Cloze Test on CodeXGLUE - CT-maxmin

BIG-bench Machine Learning Clone Detection +9

1,405

Paper
Code

TableBank: A Benchmark Dataset for Table Detection and Recognition

2 code implementations • LREC 2020 • Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, Zhoujun Li

We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet.

Table Detection

966

Paper
Code

TableBank: Table Benchmark for Image-based Table Detection and Recognition

1 code implementation • LREC 2020 • Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, Zhoujun Li

We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet.

Table Detection

966

Paper
Code

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

3 code implementations • 19 Oct 2020 • Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander Cowen Rivers, Zheng Tian, Daniel Palenicek, Haitham Bou Ammar, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang

We open-source the SMARTS platform and the associated benchmark tasks and evaluation metrics to encourage and empower research on multi-agent learning for autonomous driving.

Autonomous Driving Multi-agent Reinforcement Learning +2

876

Paper
Code

Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-based Chatbots

3 code implementations • ACL 2017 • Yu Wu, Wei Wu, Chen Xing, Ming Zhou, Zhoujun Li

Existing work either concatenates utterances in context or matches a response with a highly abstract context vector finally, which may lose relationships among utterances or important contextual information.

Ranked #7 on Conversational Response Selection on RRS

Conversational Response Selection Retrieval

711

Paper
Code

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

1 code implementation • 13 Oct 2021 • Zhuosheng Zhang, Hanqing Zhang, Keming Chen, Yuhang Guo, Jingyun Hua, Yulong Wang, Ming Zhou

Although pre-trained models (PLMs) have achieved remarkable improvements in a wide range of NLP tasks, they are expensive in terms of time and resources.

533

Paper
Code

DocBank: A Benchmark Dataset for Document Layout Analysis

2 code implementations • COLING 2020 • Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, Ming Zhou

DocBank is constructed using a simple yet effective way with weak supervision from the \LaTeX{} documents available on the arXiv. com.

Document Layout Analysis

514

Paper
Code

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

1 code implementation • 5 Jun 2021 • Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang

Our framework is comprised of three key components: (1) a centralized task dispatching model, which supports the self-generated tasks and scalable training with heterogeneous policy combinations; (2) a programming architecture named Actor-Evaluator-Learner, which achieves high parallelism for both training and sampling, and meets the evaluation requirement of auto-curriculum learning; (3) a higher-level abstraction of MARL training paradigms, which enables efficient code reuse and flexible deployments on different distributed computing paradigms.

Atari Games Distributed Computing +3

460

Paper
Code

Mean Field Multi-Agent Reinforcement Learning

3 code implementations • ICML 2018 • Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Wei-Nan Zhang, Jun Wang

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents.

Multi-agent Reinforcement Learning Q-Learning +2

363

Paper
Code

UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

2 code implementations • 15 Feb 2020 • Huaishao Luo, Lei Ji, Botian Shi, Haoyang Huang, Nan Duan, Tianrui Li, Jason Li, Taroon Bharti, Ming Zhou

However, most of the existing multimodal models are pre-trained for understanding tasks, leading to a pretrain-finetune discrepancy for generation tasks.

Ranked #2 on Action Segmentation on COIN (using extra training data)

Action Segmentation Language Modelling +2

327

Paper
Code

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing

1 code implementation • EMNLP 2020 • Canwen Xu, Wangchunshu Zhou, Tao Ge, Furu Wei, Ming Zhou

Our approach first divides the original BERT into several modules and builds their compact substitutes.

Knowledge Distillation Model Compression

308

Paper
Code

MuTual: A Dataset for Multi-Turn Dialogue Reasoning

1 code implementation • ACL 2020 • Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang, Ming Zhou

Non-task oriented dialogue systems have achieved great success in recent years due to largely accessible conversation data and the development of deep learning techniques.

Task-Oriented Dialogue Systems

269

Paper
Code

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

2 code implementations • Findings (ACL) 2021 • Ruize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu ji, Guihong Cao, Daxin Jiang, Ming Zhou

We study the problem of injecting knowledge into large pre-trained models like BERT and RoBERTa.

Ranked #1 on Entity Typing on Open Entity

Dependency Parsing Entity Typing +2

151

Paper
Code

Neural Document Summarization by Jointly Learning to Score and Select Sentences

1 code implementation • ACL 2018 • Qingyu Zhou, Nan Yang, Furu Wei, Shaohan Huang, Ming Zhou, Tiejun Zhao

In this paper, we present a novel end-to-end neural network framework for extractive document summarization by jointly learning to score and select sentences.

Ranked #9 on Extractive Text Summarization on CNN / Daily Mail

Document Summarization Extractive Document Summarization +3

150

Paper
Code

Neural Question Generation from Text: A Preliminary Study

6 code implementations • 6 Apr 2017 • Qingyu Zhou, Nan Yang, Furu Wei, Chuanqi Tan, Hangbo Bao, Ming Zhou

Automatic question generation aims to generate questions from a text passage where the generated questions can be answered by certain sub-spans of the given passage.

Ranked #13 on Question Generation on SQuAD1.1

Position Question Generation +2

142

Paper
Code

Reinforced Mnemonic Reader for Machine Reading Comprehension

3 code implementations • 8 May 2017 • Minghao Hu, Yuxing Peng, Zhen Huang, Xipeng Qiu, Furu Wei, Ming Zhou

In this paper, we introduce the Reinforced Mnemonic Reader for machine reading comprehension tasks, which enhances previous attentive readers in two aspects.

Ranked #17 on Question Answering on SQuAD1.1 dev

Machine Reading Comprehension Question Answering +2

135

Paper
Code

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

1 code implementation • 24 Dec 2022 • Ying Wen, Ziyu Wan, Ming Zhou, Shufang Hou, Zhe Cao, Chenyang Le, Jingxiao Chen, Zheng Tian, Weinan Zhang, Jun Wang

The pervasive uncertainty and dynamic nature of real-world environments present significant challenges for the widespread implementation of machine-driven Intelligent Decision-Making (IDM) systems.

Decision Making Image Captioning +2

118

Paper
Code

Topic Aware Neural Response Generation

1 code implementation • 21 Jun 2016 • Chen Xing, Wei Wu, Yu Wu, Jie Liu, YaLou Huang, Ming Zhou, Wei-Ying Ma

We consider incorporating topic information into the sequence-to-sequence framework to generate informative and interesting responses for chatbots.

Response Generation

111

Paper
Code

Continuous Speech Separation with Conformer

1 code implementation • 13 Aug 2020 • Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou

Continuous speech separation plays a vital role in complicated speech related tasks such as conversation transcription.

Ranked #1 on Speech Separation on LibriCSS (using extra training data)

Speech Separation

103

Paper
Code

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation

2 code implementations • 3 Apr 2020 • Yaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Ruofei Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Daniel Campos, Rangan Majumder, Ming Zhou

In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks.

Natural Language Understanding XLM-R

Paper
Code

Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge

1 code implementation • ACL 2021 • Linmei Hu, Tianchi Yang, Luhao Zhang, Wanjun Zhong, Duyu Tang, Chuan Shi, Nan Duan, Ming Zhou

Specifically, we first construct a \textit{directed heterogeneous document graph} for each news incorporating topics and entities.

Fake News Detection Graph Attention

Paper
Code

Response Generation by Context-aware Prototype Editing

3 code implementations • 19 Jun 2018 • Yu Wu, Furu Wei, Shaohan Huang, Yunli Wang, Zhoujun Li, Ming Zhou

Open domain response generation has achieved remarkable progress in recent years, but sometimes yields short and uninformative responses.

Informativeness Response Generation +1

Paper
Code

Achieving Human Parity on Automatic Chinese to English News Translation

2 code implementations • 15 Mar 2018 • Hany Hassan, Anthony Aue, Chang Chen, Vishal Chowdhary, Jonathan Clark, Christian Federmann, Xuedong Huang, Marcin Junczys-Dowmunt, William Lewis, Mu Li, Shujie Liu, Tie-Yan Liu, Renqian Luo, Arul Menezes, Tao Qin, Frank Seide, Xu Tan, Fei Tian, Lijun Wu, Shuangzhi Wu, Yingce Xia, Dong-dong Zhang, Zhirui Zhang, Ming Zhou

Machine translation has made rapid advances in recent years.

Ranked #3 on Machine Translation on WMT 2017 English-Chinese

Machine Translation Translation

Paper
Code

Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension

1 code implementation • ACL 2020 • Bo Zheng, Haoyang Wen, Yaobo Liang, Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou, Ting Liu

Natural Questions is a new challenging machine reading comprehension benchmark with two-grained answers, which are a long answer (typically a paragraph) and a short answer (one or more entities inside the long answer).

Graph Attention Machine Reading Comprehension +1

Paper
Code

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

2 code implementations • NAACL 2022 • Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou

Standard automatic metrics, e. g. BLEU, are not reliable for document-level MT evaluation.

Document Level Machine Translation Machine Translation +2

Paper
Code

A Tensorized Transformer for Language Modeling

1 code implementation • NeurIPS 2019 • Xindian Ma, Peng Zhang, Shuai Zhang, Nan Duan, Yuexian Hou, Dawei Song, Ming Zhou

In this paper, based on the ideas of tensor decomposition and parameters sharing, we propose a novel self-attention model (namely Multi-linear attention) with Block-Term Tensor Decomposition (BTD).

Language Modelling Machine Translation +2

Paper
Code

Graph Neural News Recommendation with Unsupervised Preference Disentanglement

1 code implementation • ACL 2020 • Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, Ming Zhou

Furthermore, the learned representations are disentangled with latent preference factors by a neighborhood routing algorithm, which can enhance expressiveness and interpretability.

Disentanglement News Recommendation

Paper
Code

Hierarchical Recurrent Attention Network for Response Generation

1 code implementation • 25 Jan 2017 • Chen Xing, Wei Wu, Yu Wu, Ming Zhou, YaLou Huang, Wei-Ying Ma

With the word level attention, hidden vectors of a word level encoder are synthesized as utterance vectors and fed to an utterance level encoder to construct hidden representations of the context.

Response Generation

Paper
Code

GLGE: A New General Language Generation Evaluation Benchmark

1 code implementation • Findings (ACL) 2021 • Dayiheng Liu, Yu Yan, Yeyun Gong, Weizhen Qi, Hang Zhang, Jian Jiao, Weizhu Chen, Jie Fu, Linjun Shou, Ming Gong, Pengcheng Wang, Jiusheng Chen, Daxin Jiang, Jiancheng Lv, Ruofei Zhang, Winnie Wu, Ming Zhou, Nan Duan

Multi-task benchmarks such as GLUE and SuperGLUE have driven great progress of pretraining and transfer learning in Natural Language Processing (NLP).

Natural Language Understanding Text Generation +1

Paper
Code

Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder

1 code implementation • ACL 2020 • Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang, Ming Zhou

Generating inferential texts about an event in different perspectives requires reasoning over different contexts that the event occurs.

Ranked #1 on Common Sense Reasoning on Event2Mind test (BLEU metric)

Common Sense Reasoning Text Generation

Paper
Code

Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text

2 code implementations • Findings (ACL) 2022 • Siyuan Wang, Wanjun Zhong, Duyu Tang, Zhongyu Wei, Zhihao Fan, Daxin Jiang, Ming Zhou, Nan Duan

Logical reasoning of text requires understanding critical logical information in the text and performing inference over them.

Ranked #7 on Reading Comprehension on ReClor

Data Augmentation Logical Reasoning +1

Paper
Code

CoSQA: 20,000+ Web Queries for Code Search and Question Answering

1 code implementation • ACL 2021 • JunJie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou, Nan Duan

Finding codes given natural language query isb eneficial to the productivity of software developers.

Code Search Contrastive Learning +2

Paper
Code

Constraint-Based Question Answering with Knowledge Graph

1 code implementation • COLING 2016 • Junwei Bao, Nan Duan, Zhao Yan, Ming Zhou, Tiejun Zhao

WebQuestions and SimpleQuestions are two benchmark data-sets commonly used in recent knowledge-based question answering (KBQA) work.

Question Answering

Paper
Code

Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base

1 code implementation • NeurIPS 2018 • Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin

We present an approach to map utterances in conversation to logical forms, which will be executed on a large-scale knowledge base.

Conversational Question Answering Management +1

Paper
Code

Semantic Mask for Transformer based End-to-End Speech Recognition

1 code implementation • 6 Dec 2019 • Chengyi Wang, Yu Wu, Yujiao Du, Jinyu Li, Shujie Liu, Liang Lu, Shuo Ren, Guoli Ye, Sheng Zhao, Ming Zhou

Attention-based encoder-decoder model has achieved impressive results for both automatic speech recognition (ASR) and text-to-speech (TTS) tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Selective Encoding for Abstractive Sentence Summarization

2 code implementations • ACL 2017 • Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou

We propose a selective encoding model to extend the sequence-to-sequence framework for abstractive sentence summarization.

Ranked #8 on Text Summarization on DUC 2004 Task 1

Sentence Sentence Summarization

Paper
Code

Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study

1 code implementation • 3 Jul 2018 • Tao Ge, Furu Wei, Ming Zhou

Neural sequence-to-sequence (seq2seq) approaches have proven to be successful in grammatical error correction (GEC).

Ranked #1 on Grammatical Error Correction on Unrestricted

Grammatical Error Correction Sentence

Paper
Code

Unsupervised Neural Machine Translation with SMT as Posterior Regularization

1 code implementation • 14 Jan 2019 • Shuo Ren, Zhirui Zhang, Shujie Liu, Ming Zhou, Shuai Ma

To address this issue, we introduce phrase based Statistic Machine Translation (SMT) models which are robust to noisy data, as posterior regularizations to guide the training of unsupervised NMT models in the iterative back-translation process.

Ranked #2 on Unsupervised Machine Translation on WMT2014 English-German

NMT Translation +1

Paper
Code

AR-LSAT: Investigating Analytical Reasoning of Text

1 code implementation • 14 Apr 2021 • Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan

Analytical reasoning is an essential and challenging task that requires a system to analyze a scenario involving a set of particular circumstances and perform reasoning over it to make conclusions.

Paper
Code

From LSAT: The Progress and Challenges of Complex Reasoning

1 code implementation • 2 Aug 2021 • Siyuan Wang, Zhongkun Liu, Wanjun Zhong, Ming Zhou, Zhongyu Wei, Zhumin Chen, Nan Duan

Complex reasoning aims to draw a correct inference based on complex rules.

Few-Shot Learning Logical Reasoning +1

Paper
Code

Analytical Reasoning of Text

1 code implementation • Findings (NAACL) 2022 • Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan

In this paper, we study the challenge of analytical reasoning of text and collect a new dataset consisting of questions from the Law School Admission Test from 1991 to 2016.

Paper
Code

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei, Ming Zhou

We also find in experiments that our model is less dependent on sentence positions.

Document Summarization Extractive Document Summarization +4

Paper
Code

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining

1 code implementation • 31 Dec 2020 • Weizhen Qi, Yeyun Gong, Jian Jiao, Yu Yan, Weizhu Chen, Dayiheng Liu, Kewen Tang, Houqiang Li, Jiusheng Chen, Ruofei Zhang, Ming Zhou, Nan Duan

In this paper, we propose BANG, a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation.

Dialogue Generation Question Generation +1

Paper
Code

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space

1 code implementation • EMNLP 2020 • Dayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Jiancheng Lv, Nan Duan, Ming Zhou

In this paper, we propose a novel data augmentation method, referred to as Controllable Rewriting based Question Data Augmentation (CRQDA), for machine reading comprehension (MRC), question generation, and question-answering natural language inference tasks.

Data Augmentation Machine Reading Comprehension +6

Paper
Code

Response Selection with Topic Clues for Retrieval-based Chatbots

1 code implementation • 30 Apr 2016 • Yu Wu, Wei Wu, Zhoujun Li, Ming Zhou

The message vector, the response vector, and the two topic vectors are fed to neural tensors to calculate a matching score.

Retrieval

Paper
Code

Neural Deepfake Detection with Factual Structure of Text

1 code implementation • EMNLP 2020 • Wanjun Zhong, Duyu Tang, Zenan Xu, Ruize Wang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

To address this, we propose a graph-based model that utilizes the factual structure of a document for deepfake detection of text.

DeepFake Detection Face Swapping +1

Paper
Code

R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering

1 code implementation • 24 May 2018 • Pan Lu, Lei Ji, Wei zhang, Nan Duan, Ming Zhou, Jianyong Wang

To better utilize semantic knowledge in images, we propose a novel framework to learn visual relation facts for VQA.

Ranked #3 on Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 multiple choice

Question Answering Relation +3

Paper
Code

LLMaAA: Making Large Language Models as Active Annotators

1 code implementation • 30 Oct 2023 • Ruoyu Zhang, Yanzeng Li, Yongliang Ma, Ming Zhou, Lei Zou

Recently, the superior few-shot performance of large language models (LLMs) has propelled the development of dataset generation, where the training data are solely synthesized from LLMs.

Active Learning named-entity-recognition +2

Paper
Code

Multi-Agent Interactions Modeling with Correlated Policies

1 code implementation • ICLR 2020 • Minghuan Liu, Ming Zhou, Wei-Nan Zhang, Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

In this paper, we cast the multi-agent interactions modeling problem into a multi-agent imitation learning framework with explicit modeling of correlated policies by approximating opponents' policies, which can recover agents' policies that can regenerate similar interactions.

Imitation Learning

Paper
Code

Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification

1 code implementation • ACL 2014 • Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin

Classification Feature Engineering +4

Paper
Code

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

1 code implementation • 7 May 2021 • Weinan Zhang, Xihuai Wang, Jian Shen, Ming Zhou

We specify the dynamics sample complexity and the opponent sample complexity in MARL, and conduct a theoretic analysis of return discrepancy upper bound.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Paper
Code

Multiway Attention Networks for Modeling Sentence Pairs

1 code implementation • IJCAI 2018 • Chuanqi Tan, Furu Wei, Wenhui Wang, Weifeng Lv, Ming Zhou

Modeling sentence pairs plays the vital role for judging the relationship between two sentences, such as paraphrase identification, natural language inference, and answer sentence selection.

Ranked #11 on Paraphrase Identification on Quora Question Pairs (Accuracy metric)

Natural Language Inference Paraphrase Identification +1

Paper
Code

ProQA: Structural Prompt-based Pre-training for Unified Question Answering

1 code implementation • NAACL 2022 • Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan

Furthermore, ProQA exhibits strong ability in both continual learning and transfer learning by taking the advantages of the structural prompt.

Continual Learning Few-Shot Learning +2

Paper
Code

Learning to Ask Conversational Questions by Optimizing Levenshtein Distance

1 code implementation • ACL 2021 • Zhongkun Liu, Pengjie Ren, Zhumin Chen, Zhaochun Ren, Maarten de Rijke, Ming Zhou

Conversational Question Simplification (CQS) aims to simplify self-contained questions into conversational ones by incorporating some conversational characteristics, e. g., anaphora and ellipsis.

Paper
Code

Pre-Training a Graph Recurrent Network for Language Representation

1 code implementation • 8 Sep 2022 • Yile Wang, Linyi Yang, Zhiyang Teng, Ming Zhou, Yue Zhang

Transformer-based pre-trained models have gained much advance in recent years, becoming one of the most important backbones in natural language processing.

Language Modelling Sentence +2

Paper
Code

BERT-based Lexical Substitution

1 code implementation • ACL 2019 • Wangchunshu Zhou, Tao Ge, Ke Xu, Furu Wei, Ming Zhou

Our approach first applies dropout to the target word{'}s embedding for partially masking the word, allowing BERT to take balanced consideration of the target word{'}s semantics and contexts for proposing substitute candidates, and then validates the candidates based on their substitution{'}s influence on the global contextualized representation of the sentence.

Sentence

Paper
Code

Instance Regularization for Discriminative Language Model Pre-training

1 code implementation • 11 Oct 2022 • Zhuosheng Zhang, Hai Zhao, Ming Zhou

They treat training instances equally throughout the training process, with little attention on the individual contribution of those instances.

Denoising Language Modelling +2

Paper
Code

Improving Grammatical Error Correction with Machine Translation Pairs

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Wangchunshu Zhou, Tao Ge, Chang Mu, Ke Xu, Furu Wei, Ming Zhou

The poor translation model resembles the ESL (English as a second language) learner and tends to generate translations of low quality in terms of fluency and grammatical correctness, while the good translation model generally generates fluent and grammatically correct translations.

Grammatical Error Correction Language Modelling +3

Paper
Code

A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation

1 code implementation • ACL 2020 • Shuo Ren, Yu Wu, Shujie Liu, Ming Zhou, Shuai Ma

The commonly used framework for unsupervised machine translation builds initial translation models of both translation directions, and then performs iterative back-translation to jointly boost their translation performance.

NMT Retrieval +3

Paper
Code

Sequential Copying Networks

1 code implementation • 6 Jul 2018 • Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou

Copying mechanism shows effectiveness in sequence-to-sequence based neural network models for text generation tasks, such as abstractive sentence summarization and question generation.

Question Generation Question-Generation +3

Paper
Code

Scheduled DropHead: A Regularization Method for Transformer Models

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Wangchunshu Zhou, Tao Ge, Ke Xu, Furu Wei, Ming Zhou

In this paper, we introduce DropHead, a structured dropout method specifically designed for regularizing the multi-head attention mechanism, which is a key component of transformer, a state-of-the-art model for various NLP tasks.

Machine Translation text-classification +2

Paper
Code

Table-to-Text: Describing Table Region with Natural Language

no code implementations • 29 May 2018 • Junwei Bao, Duyu Tang, Nan Duan, Zhao Yan, Yuanhua Lv, Ming Zhou, Tiejun Zhao

The model maps a row from a table to a continuous vector and then generates a natural language sentence by leveraging the semantics of a table.

Language Modelling Sentence

Paper
Add Code

Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization

no code implementations • 27 May 2018 • Jonas Kohler, Hadi Daneshmand, Aurelien Lucchi, Ming Zhou, Klaus Neymeyr, Thomas Hofmann

Normalization techniques such as Batch Normalization have been applied successfully for training deep neural networks.

Paper
Add Code

Triangular Architecture for Rare Language Translation

no code implementations • ACL 2018 • Shuo Ren, Wenhu Chen, Shujie Liu, Mu Li, Ming Zhou, Shuai Ma

Neural Machine Translation (NMT) performs poor on the low-resource language pair $(X, Z)$, especially when $Z$ is a rare language.

Machine Translation NMT +1

Paper
Add Code

Neural Open Information Extraction

no code implementations • ACL 2018 • Lei Cui, Furu Wei, Ming Zhou

Conventional Open Information Extraction (Open IE) systems are usually built on hand-crafted patterns from other NLP tools such as syntactic parsing, yet they face problems of error propagation.

Computational Efficiency Open Information Extraction

Paper
Add Code

Learning Matching Models with Weak Supervision for Response Selection in Retrieval-based Chatbots

no code implementations • ACL 2018 • Yu Wu, Wei Wu, Zhoujun Li, Ming Zhou

We propose a method that can leverage unlabeled data to learn a matching model for response selection in retrieval-based chatbots.

Retrieval

Paper
Add Code

Semantic Parsing with Syntax- and Table-Aware SQL Generation

no code implementations • ACL 2018 • Yibo Sun, Duyu Tang, Nan Duan, Jianshu ji, Guihong Cao, Xiaocheng Feng, Bing Qin, Ting Liu, Ming Zhou

We present a generative model to map natural language questions into SQL queries.

Ranked #4 on Code Generation on WikiSQL

Semantic Parsing

Paper
Add Code

Generative Bridging Network in Neural Sequence Prediction

no code implementations • 28 Jun 2017 • Wenhu Chen, Guanlin Li, Shuo Ren, Shujie Liu, Zhirui Zhang, Mu Li, Ming Zhou

In order to alleviate data sparsity and overfitting problems in maximum likelihood estimation (MLE) for sequence prediction tasks, we propose the Generative Bridging Network (GBN), in which a novel bridge module is introduced to assist the training of the sequence prediction model (the generator network).

Abstractive Text Summarization Language Modelling +2

Paper
Add Code

Joint Training for Neural Machine Translation Models with Monolingual Data

no code implementations • 1 Mar 2018 • Zhirui Zhang, Shujie Liu, Mu Li, Ming Zhou, Enhong Chen

Monolingual data have been demonstrated to be helpful in improving translation quality of both statistical machine translation (SMT) systems and neural machine translation (NMT) systems, especially in resource-poor or domain adaptation tasks where parallel data are not rich enough.

Domain Adaptation Machine Translation +2

Paper
Add Code

Assertion-based QA with Question-Aware Open Information Extraction

no code implementations • 23 Jan 2018 • Zhao Yan, Duyu Tang, Nan Duan, Shujie Liu, Wendi Wang, Daxin Jiang, Ming Zhou, Zhoujun Li

We present assertion based question answering (ABQA), an open domain question answering task that takes a question and a passage as inputs, and outputs a semi-structured assertion consisting of a subject, a predicate and a list of arguments.

Learning-To-Rank Open-Domain Question Answering +2

Paper
Add Code

S-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension

no code implementations • 15 Jun 2017 • Chuanqi Tan, Furu Wei, Nan Yang, Bowen Du, Weifeng Lv, Ming Zhou

We build the answer extraction model with state-of-the-art neural networks for single passage reading comprehension, and propose an additional task of passage ranking to help answer extraction in multiple passages.

Answer Generation Machine Reading Comprehension +1

Paper
Add Code

Neural Response Generation with Dynamic Vocabularies

no code implementations • 30 Nov 2017 • Yu Wu, Wei Wu, Dejian Yang, Can Xu, Zhoujun Li, Ming Zhou

We study response generation for open domain conversation in chatbots.

Response Generation

Paper
Add Code

A Sequential Matching Framework for Multi-turn Response Selection in Retrieval-based Chatbots

no code implementations • CL 2019 • Yu Wu, Wei Wu, Chen Xing, Can Xu, Zhoujun Li, Ming Zhou

The task requires matching a response candidate with a conversation context, whose challenges include how to recognize important parts of the context, and how to model the relationships among utterances in the context.

Retrieval

Paper
Add Code

Question Answering and Question Generation as Dual Tasks

no code implementations • 7 Jun 2017 • Duyu Tang, Nan Duan, Tao Qin, Zhao Yan, Ming Zhou

On one side, the QA model judges whether the generated question of a QG model is relevant to the answer.

Question Answering Question Generation +1

Paper
Add Code

Content-Based Table Retrieval for Web Queries

no code implementations • 8 Jun 2017 • Zhao Yan, Duyu Tang, Nan Duan, Junwei Bao, Yuanhua Lv, Ming Zhou, Zhoujun Li

Understanding the connections between unstructured text and semi-structured table is an important yet neglected problem in natural language processing.

Table Retrieval

Paper
Add Code

Entity Linking for Queries by Searching Wikipedia Sentences

no code implementations • EMNLP 2017 • Chuanqi Tan, Furu Wei, Pengjie Ren, Weifeng Lv, Ming Zhou

The key idea is to search sentences similar to a query from Wikipedia articles and directly use the human-annotated entities in the similar sentences as candidate entities for the query.

Entity Linking Word Embeddings

Paper
Add Code

Knowledge Enhanced Hybrid Neural Network for Text Matching

no code implementations • 15 Nov 2016 • Yu Wu, Wei Wu, Zhoujun Li, Ming Zhou

Long text brings a big challenge to semantic matching due to their complicated semantic and syntactic structures.

Question Answering Text Matching

Paper
Add Code

Detecting Context Dependent Messages in a Conversational Environment

no code implementations • COLING 2016 • Chaozhuo Li, Yu Wu, Wei Wu, Chen Xing, Zhoujun Li, Ming Zhou

While automatic response generation for building chatbot systems has drawn a lot of attention recently, there is limited understanding on when we need to consider the linguistic context of an input text in the generation process.

Chatbot Response Generation

Paper
Add Code

Aligning Coordinated Text Streams through Burst Information Network Construction and Decipherment

no code implementations • 27 Sep 2016 • Tao Ge, Qing Dou, Xiaoman Pan, Heng Ji, Lei Cui, Baobao Chang, Zhifang Sui, Ming Zhou

We introduce a novel Burst Information Network (BINet) representation that can display the most important information and illustrate the connections among bursty entities, events and keywords in the corpus.

Decipherment Translation

Paper
Add Code

Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction

no code implementations • 25 May 2016 • Yichun Yin, Furu Wei, Li Dong, Kaimeng Xu, Ming Zhang, Ming Zhou

In this paper, we develop a novel approach to aspect term extraction based on unsupervised learning of distributed representations of words and dependency paths.

Term Extraction

Paper
Add Code

Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model

no code implementations • 13 Jan 2016 • Shi Feng, Shujie Liu, Mu Li, Ming Zhou

Aiming to resolve these problems, we propose new variations of attention-based encoder-decoder and compare them with other models on machine translation.

Attribute Image Captioning +5

Paper
Add Code

Jointly Modeling Topics and Intents with Global Order Structure

no code implementations • 7 Dec 2015 • Bei Chen, Jun Zhu, Nan Yang, Tian Tian, Ming Zhou, Bo Zhang

Modeling document structure is of great importance for discourse analysis and related applications.

Paper
Add Code

TGSum: Build Tweet Guided Multi-Document Summarization Dataset

no code implementations • 26 Nov 2015 • Ziqiang Cao, Chengyao Chen, Wenjie Li, Sujian Li, Furu Wei, Ming Zhou

Both informativeness and readability of the collected summaries are verified by manual judgment.

Document Summarization Informativeness +2

Paper
Add Code

A Dependency-Based Neural Network for Relation Classification

no code implementations • IJCNLP 2015 • Yang Liu, Furu Wei, Sujian Li, Heng Ji, Ming Zhou, Houfeng Wang

Previous research on relation classification has verified the effectiveness of using dependency shortest paths or subtrees.

Ranked #5 on Relation Classification on SemEval 2010 Task 8

Classification General Classification +2

Paper
Add Code

Multi-Document Summarization via Discriminative Summary Reranking

no code implementations • 8 Jul 2015 • Xiaojun Wan, Ziqiang Cao, Furu Wei, Sujian Li, Ming Zhou

However, according to our quantitative analysis, none of the existing summarization models can always produce high-quality summaries for different document sets, and even a summarization model with good overall performance may produce low-quality summaries for some document sets.

Document Summarization Multi-Document Summarization +1

Paper
Add Code

A Statistical Parsing Framework for Sentiment Classification

no code implementations • CL 2015 • Li Dong, Furu Wei, Shujie Liu, Ming Zhou, Ke Xu

Unlike previous works that employ syntactic parsing results for sentiment analysis, we develop a statistical parser to directly analyze the sentiment structure of a sentence.

Classification General Classification +4

Paper
Add Code

Beyond Word-based Language Model in Statistical Machine Translation

no code implementations • 5 Feb 2015 • Jiajun Zhang, Shujie Liu, Mu Li, Ming Zhou, Cheng-qing Zong

Language model is one of the most important modules in statistical machine translation and currently the word-based language model dominants this community.

Language Modelling Machine Translation +1

Paper
Add Code

Dictionary-Guided Editing Networks for Paraphrase Generation

no code implementations • 21 Jun 2018 • Shaohan Huang, Yu Wu, Furu Wei, Ming Zhou

An intuitive way for a human to write paraphrase sentences is to replace words or phrases in the original sentence with their corresponding synonyms and make necessary changes to ensure the new sentences are fluent and grammatically correct.

Paraphrase Generation Sentence

Paper
Add Code

Regularizing Neural Machine Translation by Target-bidirectional Agreement

no code implementations • 13 Aug 2018 • Zhirui Zhang, Shuangzhi Wu, Shujie Liu, Mu Li, Ming Zhou, Tong Xu

Although Neural Machine Translation (NMT) has achieved remarkable progress in the past several years, most NMT systems still suffer from a fundamental shortcoming as in other sequence generation tasks: errors made early in generation process are fed as inputs to the model and can be quickly amplified, harming subsequent sequence generation.

Machine Translation NMT +1

Paper
Add Code

Question Generation from SQL Queries Improves Neural Semantic Parsing

no code implementations • EMNLP 2018 • Daya Guo, Yibo Sun, Duyu Tang, Nan Duan, Jian Yin, Hong Chi, James Cao, Peng Chen, Ming Zhou

We study how to learn a semantic parser of state-of-the-art accuracy with less supervised training data.

Question Generation Question-Generation +1

Paper
Add Code

Neural Latent Extractive Document Summarization

no code implementations • EMNLP 2018 • Xingxing Zhang, Mirella Lapata, Furu Wei, Ming Zhou

Extractive summarization models require sentence-level labels, which are usually created heuristically (e. g., with rule-based methods) given that most summarization datasets only have document-summary pairs.

Ranked #11 on Extractive Text Summarization on CNN / Daily Mail

Document Summarization Extractive Document Summarization +3

Paper
Add Code

Attention-Guided Answer Distillation for Machine Reading Comprehension

no code implementations • EMNLP 2018 • Minghao Hu, Yuxing Peng, Furu Wei, Zhen Huang, Dongsheng Li, Nan Yang, Ming Zhou

Despite that current reading comprehension systems have achieved significant advancements, their promising performances are often obtained at the cost of making an ensemble of numerous models.

Knowledge Distillation Machine Reading Comprehension

Paper
Add Code

Approximate Distribution Matching for Sequence-to-Sequence Learning

no code implementations • 24 Aug 2018 • Wenhu Chen, Guanlin Li, Shujie Liu, Zhirui Zhang, Mu Li, Ming Zhou

Then, we interpret sequence-to-sequence learning as learning a transductive model to transform the source local latent distributions to match their corresponding target distributions.

Image Captioning Machine Translation +1

Paper
Add Code

Style Transfer as Unsupervised Machine Translation

no code implementations • 23 Aug 2018 • Zhirui Zhang, Shuo Ren, Shujie Liu, Jianyong Wang, Peng Chen, Mu Li, Ming Zhou, Enhong Chen

Language style transferring rephrases text with specific stylistic attributes while preserving the original attribute-independent content.

Ranked #3 on Unsupervised Text Style Transfer on GYAFC

Attribute NMT +4

Paper
Add Code

Neural Melody Composition from Lyrics

no code implementations • 12 Sep 2018 • Hangbo Bao, Shaohan Huang, Furu Wei, Lei Cui, Yu Wu, Chuanqi Tan, Songhao Piao, Ming Zhou

In this paper, we study a novel task that learns to compose music from natural language.

Paper
Add Code

Text Morphing

no code implementations • 30 Sep 2018 • Shaohan Huang, Yu Wu, Furu Wei, Ming Zhou

In this paper, we introduce a novel natural language generation task, termed as text morphing, which targets at generating the intermediate sentences that are fluency and smooth with the two input sentences.

Sentence Text Generation

Paper
Add Code

Improving Question Answering by Commonsense-Based Pre-Training

no code implementations • 5 Sep 2018 • Wanjun Zhong, Duyu Tang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

Although neural network approaches achieve remarkable success on a variety of NLP tasks, many of them struggle to answer questions that require commonsense knowledge.

Question Answering

Paper
Add Code

Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition

no code implementations • EMNLP 2018 • Tao Ge, Qing Dou, Heng Ji, Lei Cui, Baobao Chang, Zhifang Sui, Furu Wei, Ming Zhou

This paper proposes to study fine-grained coordinated cross-lingual text stream alignment through a novel information network decipherment paradigm.

Decipherment Information Retrieval

Paper
Add Code

Bidirectional Generative Adversarial Networks for Neural Machine Translation

no code implementations • CONLL 2018 • Zhirui Zhang, Shujie Liu, Mu Li, Ming Zhou, Enhong Chen

To address this issue and stabilize the GAN training, in this paper, we propose a novel Bidirectional Generative Adversarial Network for Neural Machine Translation (BGAN-NMT), which aims to introduce a generator model to act as the discriminator, whereby the discriminator naturally considers the entire translation space so that the inadequate training problem can be alleviated.

Generative Adversarial Network Language Modelling +4

Paper
Add Code

Fluency Boost Learning and Inference for Neural Grammatical Error Correction

no code implementations • ACL 2018 • Tao Ge, Furu Wei, Ming Zhou

Most of the neural sequence-to-sequence (seq2seq) models for grammatical error correction (GEC) have two limitations: (1) a seq2seq model may not be well generalized with only limited error-corrected data; (2) a seq2seq model may fail to completely correct a sentence with multiple errors through normal seq2seq inference.

Grammatical Error Correction Sentence

Paper
Add Code

Gated Self-Matching Networks for Reading Comprehension and Question Answering

no code implementations • ACL 2017 • Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang, Ming Zhou

We first match the question and passage with gated attention-based recurrent networks to obtain the question-aware passage representation.

Ranked #35 on Question Answering on SQuAD1.1 dev

Question Answering Reading Comprehension

Paper
Add Code

Sequence-to-Dependency Neural Machine Translation

no code implementations • ACL 2017 • Shuangzhi Wu, Dong-dong Zhang, Nan Yang, Mu Li, Ming Zhou

Nowadays a typical Neural Machine Translation (NMT) model generates translations from left to right as a linear sequence, during which latent syntactic structures of the target sentences are not explicitly concerned.

Machine Translation NMT +1

Paper
Add Code

Chunk-based Decoder for Neural Machine Translation

no code implementations • ACL 2017 • Shonosuke Ishiwatari, JingTao Yao, Shujie Liu, Mu Li, Ming Zhou, Naoki Yoshinaga, Masaru Kitsuregawa, Weijia Jia

The chunk-level decoder models global dependencies while the word-level decoder decides the local word order in a chunk.

Machine Translation NMT +2

Paper
Add Code

SuperAgent: A Customer Service Chatbot for E-commerce Websites

no code implementations • ACL 2017 • Lei Cui, Shaohan Huang, Furu Wei, Chuanqi Tan, Chaoqun Duan, Ming Zhou

Chatbot Opinion Mining +1

Paper
Add Code

DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents

no code implementations • ACL 2016 • Zhao Yan, Nan Duan, Junwei Bao, Peng Chen, Ming Zhou, Zhoujun Li, Jianshe Zhou

Chatbot Community Question Answering +6

Paper
Add Code

Knowledge-Based Semantic Embedding for Machine Translation

no code implementations • ACL 2016 • Chen Shi, Shujie Liu, Shuo Ren, Shi Feng, Mu Li, Ming Zhou, Xu sun, Houfeng Wang

Machine Translation Translation

Paper
Add Code

Learning to Generate Product Reviews from Attributes

no code implementations • EACL 2017 • Li Dong, Shaohan Huang, Furu Wei, Mirella Lapata, Ming Zhou, Ke Xu

This paper presents an attention-enhanced attribute-to-sequence model to generate product reviews for given attribute information, such as user, product, and rating.

Attribute Review Generation +2

Paper
Add Code

Learning to Collaborate for Question Answering and Asking

no code implementations • NAACL 2018 • Duyu Tang, Nan Duan, Zhao Yan, Zhirui Zhang, Yibo Sun, Shujie Liu, Yuanhua Lv, Ming Zhou

Secondly, directly applying GAN that regards all the generated questions as negative instances could not improve the accuracy of the QA model.

Answer Selection Generative Adversarial Network +2

Paper
Add Code

Generative Bridging Network for Neural Sequence Prediction

no code implementations • NAACL 2018 • Wenhu Chen, Guanlin Li, Shuo Ren, Shujie Liu, Zhirui Zhang, Mu Li, Ming Zhou

Abstractive Text Summarization Image Captioning +5

Paper
Add Code

Beihang-MSRA at SemEval-2017 Task 3: A Ranking System with Neural Matching Features for Community Question Answering

no code implementations • SEMEVAL 2017 • Wenzheng Feng, Yu Wu, Wei Wu, Zhoujun Li, Ming Zhou

This paper presents the system in SemEval-2017 Task 3, Community Question Answering (CQA).

Answer Selection Community Question Answering +4

Paper
Add Code

Question Generation for Question Answering

no code implementations • EMNLP 2017 • Nan Duan, Duyu Tang, Peng Chen, Ming Zhou

This paper presents how to generate questions from given passages using neural networks, where large scale QA pairs are automatically crawled and processed from Community-QA website, and used as training data.

Chatbot Question Answering +5

Paper
Add Code

Stack-based Multi-layer Attention for Transition-based Dependency Parsing

no code implementations • EMNLP 2017 • Zhirui Zhang, Shujie Liu, Mu Li, Ming Zhou, Enhong Chen

Although sequence-to-sequence (seq2seq) network has achieved significant success in many NLP tasks such as machine translation and text summarization, simply applying this approach to transition-based dependency parsing cannot yield a comparable performance gain as in other state-of-the-art methods, such as stack-LSTM and head selection.

Language Modelling Machine Translation +3

Paper
Add Code

News Stream Summarization using Burst Information Networks

no code implementations • EMNLP 2016 • Tao Ge, Lei Cui, Baobao Chang, Sujian Li, Ming Zhou, Zhifang Sui

Document Summarization Multi-Document Summarization

Paper
Add Code

Solving and Generating Chinese Character Riddles

no code implementations • EMNLP 2016 • Chuanqi Tan, Furu Wei, Li Dong, Weifeng Lv, Ming Zhou

Paper
Add Code

A Redundancy-Aware Sentence Regression Framework for Extractive Summarization

no code implementations • COLING 2016 • Pengjie Ren, Furu Wei, Zhumin Chen, Jun Ma, Ming Zhou

Existing sentence regression methods for extractive summarization usually model sentence importance and redundancy in two separate processes.

Document Summarization Extractive Summarization +3

Paper
Add Code

Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation

no code implementations • COLING 2016 • Shi Feng, Shujie Liu, Nan Yang, Mu Li, Ming Zhou, Kenny Q. Zhu

In neural machine translation, the attention mechanism facilitates the translation process by producing a soft alignment between the source sentence and the target sentence.

Machine Translation Sentence +1

Paper
Add Code

Event Detection with Burst Information Networks

no code implementations • COLING 2016 • Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui, Ming Zhou

Retrospective event detection is an important task for discovering previously unidentified events in a text stream.

Clustering Event Detection

Paper
Add Code

EventWiki: A Knowledge Base of Major Events

no code implementations • LREC 2018 • Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui, Furu Wei, Ming Zhou

Question Answering Semantic Parsing

Paper
Add Code

Cross-lingual Sentiment Lexicon Learning With Bilingual Word Graph Label Propagation

no code implementations • CL 2015 • Dehong Gao, Furu Wei, Wenjie Li, Xiaohua Liu, Ming Zhou

Sentiment Analysis Word Alignment

Paper
Add Code

Question Answering over Freebase with Multi-Column Convolutional Neural Networks

no code implementations • IJCNLP 2015 • Li Dong, Furu Wei, Ming Zhou, Ke Xu

Multi-Task Learning Question Answering +2

Paper
Add Code

Efficient Disfluency Detection with Transition-based Parsing

no code implementations • IJCNLP 2015 • Shuangzhi Wu, Dong-dong Zhang, Ming Zhou, Tiejun Zhao

Machine Translation Speech Recognition

Paper
Add Code

Learning Summary Prior Representation for Extractive Summarization

no code implementations • IJCNLP 2015 • Ziqiang Cao, Furu Wei, Sujian Li, Wenjie Li, Ming Zhou, Houfeng Wang

Extractive Summarization Feature Engineering

Paper
Add Code

Bilingually-constrained Phrase Embeddings for Machine Translation

no code implementations • ACL 2014 • Jiajun Zhang, Shujie Liu, Mu Li, Ming Zhou, Cheng-qing Zong

Language Modelling Machine Translation +4

Paper
Add Code

Learning Topic Representation for SMT with Neural Networks

no code implementations • ACL 2014 • Lei Cui, Dong-dong Zhang, Shujie Liu, Qiming Chen, Mu Li, Ming Zhou, Muyun Yang

Domain Adaptation Machine Translation

Paper
Add Code

Knowledge-Based Question Answering as Machine Translation

no code implementations • ACL 2014 • Junwei Bao, Nan Duan, Ming Zhou, Tiejun Zhao

Machine Translation Question Answering +2

Paper
Add Code

A Recursive Recurrent Neural Network for Statistical Machine Translation

no code implementations • ACL 2014 • Shujie Liu, Nan Yang, Mu Li, Ming Zhou

Chunking Language Modelling +7

Paper
Add Code

Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification

no code implementations • ACL 2014 • Li Dong, Furu Wei, Chuanqi Tan, Duyu Tang, Ming Zhou, Ke Xu

Classification Dependency Parsing +4

Paper
Add Code

Word Alignment Modeling with Context Dependent Deep Neural Network

no code implementations • ACL 2013 • Nan Yang, Shujie Liu, Mu Li, Ming Zhou, Nenghai Yu

Speech Recognition Word Alignment

Paper
Add Code

Entity Linking for Tweets

no code implementations • ACL 2013 • Xiaohua Liu, Yitong Li, Haocheng Wu, Ming Zhou, Furu Wei, Yi Lu

Entity Linking Entity Resolution +2

Paper
Add Code

Machine Translation Detection from Monolingual Web-Text

no code implementations • ACL 2013 • Yuki Arase, Ming Zhou

Machine Translation Translation

Paper
Add Code

Learning Entity Representation for Entity Disambiguation

no code implementations • ACL 2013 • Zhengyan He, Shujie Liu, Mu Li, Ming Zhou, Longkai Zhang, Houfeng Wang

Denoising Entity Disambiguation +1

Paper
Add Code

Paraphrasing Adaptation for Web Search Ranking

no code implementations • ACL 2013 • Chenguang Wang, Nan Duan, Ming Zhou, Ming Zhang

Machine Translation Paraphrase Generation

Paper
Add Code

Bilingual Data Cleaning for SMT using Graph-based Random Walk

no code implementations • ACL 2013 • Lei Cui, Dong-dong Zhang, Shujie Liu, Mu Li, Ming Zhou

Machine Translation Word Alignment

Paper
Add Code

Learning Translation Consensus with Structured Label Propagation

no code implementations • ACL 2012 • Shujie Liu, Chi-Ho Li, Mu Li, Ming Zhou

Machine Translation Translation

Paper
Add Code

Joint Inference of Named Entity Recognition and Normalization for Tweets

no code implementations • ACL 2012 • Xiaohua Liu, Ming Zhou, Xiangyang Zhou, Zhongyang Fu, Furu Wei

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Cross-Lingual Mixture Model for Sentiment Classification

no code implementations • ACL 2012 • Xinfan Meng, Furu Wei, Xiaohua Liu, Ming Zhou, Ge Xu, Houfeng Wang

Classification General Classification +4

Paper
Add Code

Joint Learning of a Dual SMT System for Paraphrase Generation

no code implementations • ACL 2012 • Hong Sun, Ming Zhou

Machine Translation Paraphrase Generation +1

Paper
Add Code

Translation Model Size Reduction for Hierarchical Phrase-based Statistical Machine Translation

no code implementations • ACL 2012 • Seung-Wook Lee, Dong-dong Zhang, Mu Li, Ming Zhou, Hae-Chang Rim

Machine Translation Translation +1

Paper
Add Code

QuickView: NLP-based Tweet Search

no code implementations • ACL 2012 • Xiaohua Liu, Furu Wei, Ming Zhou, QuickView Team Microsoft

Named Entity Recognition (NER) Semantic Role Labeling +1

Paper
Add Code

Splusplus: A Feature-Rich Two-stage Classifier for Sentiment Analysis of Tweets

no code implementations • SEMEVAL 2015 • Li Dong, Furu Wei, Yichun Yin, Ming Zhou, Ke Xu

Sentiment Analysis

Paper
Add Code

Coooolll: A Deep Learning System for Twitter Sentiment Classification

no code implementations • SEMEVAL 2014 • Duyu Tang, Furu Wei, Bing Qin, Ting Liu, Ming Zhou

Classification General Classification +3

Paper
Add Code

Hierarchical Recurrent Neural Network for Document Modeling

no code implementations • EMNLP 2015 • Rui Lin, Shujie Liu, Muyun Yang, Mu Li, Ming Zhou, Sheng Li

Chunking Language Modelling +3

Paper
Add Code

A Joint Segmentation and Classification Framework for Sentiment Analysis

no code implementations • EMNLP 2014 • Duyu Tang, Furu Wei, Bing Qin, Li Dong, Ting Liu, Ming Zhou

Classification General Classification +3

Paper
Add Code

Joint Relational Embeddings for Knowledge-based Question Answering

no code implementations • EMNLP 2014 • Min-Chul Yang, Nan Duan, Ming Zhou, Hae-Chang Rim

Named Entity Recognition (NER) Question Answering +1

Paper
Add Code

Efficient Collective Entity Linking with Stacking

no code implementations • EMNLP 2013 • Zhengyan He, Shujie Liu, Yang song, Mu Li, Ming Zhou, Houfeng Wang

Entity Linking Learning-To-Rank +1

Paper
Add Code

Multi-Domain Adaptation for SMT Using Multi-Task Learning

no code implementations • EMNLP 2013 • Lei Cui, Xilun Chen, Dong-dong Zhang, Shujie Liu, Mu Li, Ming Zhou

Domain Adaptation Machine Translation +2

Paper
Add Code

Forced Derivation Tree based Model Training to Statistical Machine Translation

no code implementations • EMNLP 2012 • Nan Duan, Mu Li, Ming Zhou

Machine Translation Translation +1

Paper
Add Code

Re-training Monolingual Parser Bilingually for Syntactic SMT

no code implementations • EMNLP 2012 • Shujie Liu, Chi-Ho Li, Mu Li, Ming Zhou

Machine Translation Word Alignment

Paper
Add Code

Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach

no code implementations • COLING 2014 • Duyu Tang, Furu Wei, Bing Qin, Ming Zhou, Ting Liu

Information Retrieval Representation Learning +1

Paper
Add Code

A Lexicalized Reordering Model for Hierarchical Phrase-based Translation

no code implementations • COLING 2014 • Hailong Cao, Dong-dong Zhang, Mu Li, Ming Zhou, Tiejun Zhao

Machine Translation Translation

Paper
Add Code

Soft Dependency Matching for Hierarchical Phrase-based Machine Translation

no code implementations • COLING 2014 • Hailong Cao, Dong-dong Zhang, Ming Zhou, Tiejun Zhao

Machine Translation Translation

Paper
Add Code

Twitter Topic Summarization by Ranking Tweets using Social Influence and Content Quality

no code implementations • COLING 2012 • Yajuan Duan, Zhumin Chen, Furu Wei, Ming Zhou, Heung-Yeung Shum

Document Summarization Extractive Text Summarization

Paper
Add Code

Graph-Based Multi-Tweet Summarization using Social Signals

no code implementations • COLING 2012 • Xiaohua Liu, Yitong Li, Furu Wei, Ming Zhou

Paper
Add Code

Lost in Translations? Building Sentiment Lexicons using Context Based Machine Translation

no code implementations • COLING 2012 • Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, Houfeng Wang

Machine Translation Sentiment Analysis +1

Paper
Add Code

Factorized Q-Learning for Large-Scale Multi-Agent Systems

no code implementations • 11 Sep 2018 • Yong Chen, Ming Zhou, Ying Wen, Yaodong Yang, Yufeng Su, Wei-Nan Zhang, Dell Zhang, Jun Wang, Han Liu

Deep Q-learning has achieved a significant success in single-agent decision making tasks.

Multiagent Systems

Paper
Add Code

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

no code implementations • ACL 2019 • Xingxing Zhang, Furu Wei, Ming Zhou

Neural extractive summarization models usually employ a hierarchical encoder for document encoding and they are trained using sentence-level labels, which are created heuristically using rule-based methods.

Ranked #7 on Extractive Text Summarization on CNN / Daily Mail

Document Summarization Extractive Summarization +2

Paper
Add Code

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms

no code implementations • 27 May 2019 • Jiarui Jin, Ming Zhou, Wei-Nan Zhang, Minne Li, Zilong Guo, Zhiwei Qin, Yan Jiao, Xiaocheng Tang, Chenxi Wang, Jun Wang, Guobin Wu, Jieping Ye

How to optimally dispatch orders to vehicles and how to trade off between immediate and future returns are fundamental questions for a typical ride-hailing platform.

Multiagent Systems

Paper
Add Code

Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing

no code implementations • ACL 2019 • Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin

In this paper, we present an approach to incorporate retrieved datapoints as supporting evidence for context-dependent semantic parsing, such as generating source code conditioned on the class environment.

Meta-Learning Retrieval +1

Paper
Add Code

Pathologist-Level Grading of Prostate Biopsies with Artificial Intelligence

no code implementations • 2 Jul 2019 • Peter Ström, Kimmo Kartasalo, Henrik Olsson, Leslie Solorzano, Brett Delahunt, Daniel M. Berney, David G. Bostwick, Andrew J. Evans, David J. Grignon, Peter A. Humphrey, Kenneth A. Iczkowski, James G. Kench, Glen Kristiansen, Theodorus H. van der Kwast, Katia R. M. Leite, Jesse K. McKenney, Jon Oxley, Chin-Chen Pan, Hemamali Samaratunga, John R. Srigley, Hiroyuki Takahashi, Toyonori Tsuzuki, Murali Varma, Ming Zhou, Johan Lindberg, Cecilia Bergström, Pekka Ruusuvuori, Carolina Wählby, Henrik Grönberg, Mattias Rantalainen, Lars Egevad, Martin Eklund

We additionally evaluated grading performance on 87 biopsies individually graded by 23 experienced urological pathologists from the International Society of Urological Pathology.

Paper
Add Code

Automatic Grammatical Error Correction for Sequence-to-sequence Text Generation: An Empirical Study

no code implementations • ACL 2019 • Tao Ge, Xingxing Zhang, Furu Wei, Ming Zhou

Sequence-to-sequence (seq2seq) models have achieved tremendous success in text generation tasks.

Formality Style Transfer Grammatical Error Correction +6

Paper
Add Code

Dense Procedure Captioning in Narrated Instructional Videos

no code implementations • ACL 2019 • Botian Shi, Lei Ji, Yaobo Liang, Nan Duan, Peng Chen, Zhendong Niu, Ming Zhou

Understanding narrated instructional videos is important for both research and real-world web applications.

Dense Captioning

Paper
Add Code

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training

no code implementations • 16 Aug 2019 • Gen Li, Nan Duan, Yuejian Fang, Ming Gong, Daxin Jiang, Ming Zhou

We propose Unicoder-VL, a universal encoder that aims to learn joint representations of vision and language in a pre-training manner.

Ranked #5 on Image-to-Text Retrieval on MS COCO (Recall@10 metric)

Image-text matching Image-to-Text Retrieval +5

Paper
Add Code

Explicit Cross-lingual Pre-training for Unsupervised Machine Translation

no code implementations • IJCNLP 2019 • Shuo Ren, Yu Wu, Shujie Liu, Ming Zhou, Shuai Ma

Pre-training has proven to be effective in unsupervised machine translation due to its ability to model deep context information in cross-lingual scenarios.

Language Modelling Translation +1

Paper
Add Code

Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks

no code implementations • IJCNLP 2019 • Haoyang Huang, Yaobo Liang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Ming Zhou

On XNLI, 1. 8% averaged accuracy improvement (on 15 languages) is obtained.

Cross-Lingual Natural Language Inference Cross-Lingual Question Answering +1

Paper
Add Code

Reasoning Over Semantic-Level Graph for Fact Checking

no code implementations • ACL 2020 • Wanjun Zhong, Jingjing Xu, Duyu Tang, Zenan Xu, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

We evaluate our system on FEVER, a benchmark dataset for fact checking, and find that rich structural information is helpful and both our graph-based mechanisms improve the accuracy.

Ranked #2 on Fact Verification on FEVER

Claim Verification Fact Checking +4

Paper
Add Code

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning

no code implementations • 10 Sep 2019 • Liheng Chen, Hongyi Guo, Yali Du, Fei Fang, Haifeng Zhang, Yaoming Zhu, Ming Zhou, Wei-Nan Zhang, Qing Wang, Yong Yu

Although existing works formulate this problem into a centralized learning with decentralized execution framework, which avoids the non-stationary problem in training, their decentralized execution paradigm limits the agents' capability to coordinate.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting

no code implementations • 13 Sep 2019 • Yi Zhang, Tao Ge, Furu Wei, Ming Zhou, Xu sun

We study sequence-to-sequence (seq2seq) pre-training with data augmentation for sentence rewriting.

Data Augmentation Formality Style Transfer +4

Paper
Add Code

Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation

no code implementations • 17 Sep 2019 • Chengyi Wang, Yu Wu, Shujie Liu, Zhenglu Yang, Ming Zhou

End-to-end speech translation, a hot topic in recent years, aims to translate a segment of audio into a specific language with an end-to-end model.

Multi-Task Learning Translation

Paper
Add Code

Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching

no code implementations • 7 Oct 2019 • Ming Zhou, Jiarui Jin, Wei-Nan Zhang, Zhiwei Qin, Yan Jiao, Chenxi Wang, Guobin Wu, Yong Yu, Jieping Ye

Improving the efficiency of dispatching orders to vehicles is a research hotspot in online ride-hailing systems.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension

no code implementations • WS 2019 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Lei Cui, Songhao Piao, Ming Zhou

Most machine reading comprehension (MRC) models separately handle encoding and matching with different network architectures.

Machine Reading Comprehension

Paper
Add Code

Asking Clarification Questions in Knowledge-Based Question Answering

no code implementations • IJCNLP 2019 • Jingjing Xu, Yuechen Wang, Duyu Tang, Nan Duan, Pengcheng Yang, Qi Zeng, Ming Zhou, Xu sun

We provide representative baselines for these tasks and further introduce a coarse-to-fine model for clarification question generation.

Question Answering Question Generation +1

Paper
Add Code

Self-Adversarial Learning with Comparative Discrimination for Text Generation

no code implementations • ICLR 2020 • Wangchunshu Zhou, Tao Ge, Ke Xu, Furu Wei, Ming Zhou

Conventional Generative Adversarial Networks (GANs) for text generation tend to have issues of reward sparsity and mode collapse that affect the quality and diversity of generated samples.

Sentence Text Generation

Paper
Add Code

XGPT: Cross-modal Generative Pre-Training for Image Captioning

no code implementations • 3 Mar 2020 • Qiaolin Xia, Haoyang Huang, Nan Duan, Dong-dong Zhang, Lei Ji, Zhifang Sui, Edward Cui, Taroon Bharti, Xin Liu, Ming Zhou

While many BERT-based cross-modal pre-trained models produce excellent results on downstream understanding tasks like image-text retrieval and VQA, they cannot be applied to generation tasks directly.

Data Augmentation Denoising +7

Paper
Add Code

At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

no code implementations • COLING 2020 • Qingyu Zhou, Furu Wei, Ming Zhou

In this work, we show that unnecessity and redundancy issues exist when extracting full sentences, and extracting sub-sentential units is a promising alternative.

Constituency Parsing Document Summarization +4

Paper
Add Code

Learning to Summarize Passages: Mining Passage-Summary Pairs from Wikipedia Revision Histories

no code implementations • 6 Apr 2020 • Qingyu Zhou, Furu Wei, Ming Zhou

In this paper, we propose a method for automatically constructing a passage-to-summary dataset by mining the Wikipedia page revision histories.

Paper
Add Code

Pre-training for Abstractive Document Summarization by Reinstating Source Text

no code implementations • EMNLP 2020 • Yanyan Zou, Xingxing Zhang, Wei Lu, Furu Wei, Ming Zhou

The main idea is that, given an input text artificially constructed from a document, a model is pre-trained to reinstate the original document.

Abstractive Text Summarization Document Summarization +1

Paper
Add Code

Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning

no code implementations • 7 Apr 2020 • Daya Guo, Akari Asai, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Jian Yin, Ming Zhou

In this work, we use multiple knowledge sources as fuels for the model.

Meta-Learning Multi-Task Learning +2

Paper
Add Code

Pre-training Text Representations as Meta Learning

no code implementations • 12 Apr 2020 • Shangwen Lv, Yuechen Wang, Daya Guo, Duyu Tang, Nan Duan, Fuqing Zhu, Ming Gong, Linjun Shou, Ryan Ma, Daxin Jiang, Guihong Cao, Ming Zhou, Songlin Hu

In this work, we introduce a learning algorithm which directly optimizes model's ability to learn text representations for effective learning of downstream tasks.

Language Modelling Meta-Learning +2

Paper
Add Code

Curriculum Pre-training for End-to-End Speech Translation

no code implementations • ACL 2020 • Chengyi Wang, Yu Wu, Shujie Liu, Ming Zhou, Zhenglu Yang

End-to-end speech translation poses a heavy burden on the encoder, because it has to transcribe, understand, and learn cross-lingual semantics simultaneously.

speech-recognition Speech Recognition +1

Paper
Add Code

A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

no code implementations • 25 Apr 2020 • Wanjun Zhong, Duyu Tang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

We study question answering over a dynamic textual environment.

Decision Making Question Answering

Paper
Add Code

LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

no code implementations • ACL 2020 • Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Ming Gong, Linjun Shou, Daxin Jiang, Jiahai Wang, Jian Yin

The graph is used to obtain graph-enhanced contextual representations of words in Transformer-based architecture.

Fact Checking Semantic Parsing

Paper
Add Code

Leveraging Declarative Knowledge in Text and First-Order Logic for Fine-Grained Propaganda Detection

no code implementations • EMNLP 2020 • Ruize Wang, Duyu Tang, Nan Duan, Wanjun Zhong, Zhongyu Wei, Xuanjing Huang, Daxin Jiang, Ming Zhou

We study the detection of propagandistic text fragments in news articles.

Propaganda detection

Paper
Add Code

A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction

no code implementations • ACL 2020 • Shuo Ren, Shujie Liu, Ming Zhou, Shuai Ma

To deal with those issues, in this paper, we propose a novel graph-based paradigm to induce bilingual lexicons in a coarse-to-fine way.

Bilingual Lexicon Induction Cross-Lingual Word Embeddings +2

Paper
Add Code

A Simple and Effective Unified Encoder for Document-Level Machine Translation

no code implementations • ACL 2020 • Shuming Ma, Dong-dong Zhang, Ming Zhou

Most of the existing models for document-level machine translation adopt dual-encoder structures.

Document Level Machine Translation Machine Translation +1

Paper
Add Code

Improving Neural Machine Translation with Soft Template Prediction

no code implementations • ACL 2020 • Jian Yang, Shuming Ma, Dong-dong Zhang, Zhoujun Li, Ming Zhou

Although neural machine translation (NMT) has achieved significant progress in recent years, most previous NMT models only depend on the source text to generate translation.

Machine Translation NMT +1

Paper
Add Code

Regioned Episodic Reinforcement Learning

no code implementations • 1 Jan 2021 • Jiarui Jin, Cong Chen, Ming Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Yong Yu, Jun Wang, Alex Smola

Goal-oriented reinforcement learning algorithms are often good at exploration, not exploitation, while episodic algorithms excel at exploitation, not exploration.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction

no code implementations • EMNLP 2020 • Mengyun Chen, Tao Ge, Xingxing Zhang, Furu Wei, Ming Zhou

We propose a novel language-independent approach to improve the efficiency for Grammatical Error Correction (GEC) by dividing the task into two subtasks: Erroneous Span Detection (ESD) and Erroneous Span Correction (ESC).

Grammatical Error Correction Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.