Search Results for author: Qun Liu

Found 291 papers, 79 papers with code

End-to-End Simultaneous Speech Translation with Pretraining and Distillation: Huawei Noah’s System for AutoSimTranS 2022

no code implementations • NAACL (AutoSimTrans) 2022 • Xingshan Zeng, Pengfei Li, Liangyou Li, Qun Liu

This paper describes the system submitted to AutoSimTrans 2022 from Huawei Noah’s Ark Lab, which won the first place in the audio input track of the Chinese-English translation task.

Knowledge Distillation NMT +1

Paper
Add Code

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings

1 code implementation • EMNLP 2021 • Weixuan Wang, Wei Peng, Meng Zhang, Qun Liu

Neural Machine Translation (NMT) has shown a strong ability to utilize local context to disambiguate the meaning of words.

Machine Translation NMT +3

Paper
Code

Self-Supervised Quality Estimation for Machine Translation

no code implementations • EMNLP 2021 • Yuanhang Zheng, Zhixing Tan, Meng Zhang, Mieradilijiang Maimaiti, Huanbo Luan, Maosong Sun, Qun Liu, Yang Liu

Quality estimation (QE) of machine translation (MT) aims to evaluate the quality of machine-translated sentences without references and is important in practical applications of MT.

Machine Translation Sentence +1

Paper
Add Code

Chinese WPLC: A Chinese Dataset for Evaluating Pretrained Language Models on Word Prediction Given Long-Range Context

no code implementations • EMNLP 2021 • Huibin Ge, Chenxi Sun, Deyi Xiong, Qun Liu

Experiment results show that the Chinese pretrained language model PanGu-\alpha is 45 points behind human in terms of top-1 word prediction accuracy, indicating that Chinese WPLC is a challenging dataset.

Language Modelling

Paper
Add Code

Huawei’s Submissions to the WMT20 Biomedical Translation Task

no code implementations • WMT (EMNLP) 2020 • Wei Peng, Jianfeng Liu, Minghan Wang, Liangyou Li, Xupeng Meng, Hao Yang, Qun Liu

This paper describes Huawei’s submissions to the WMT20 biomedical translation shared task.

Machine Translation Transfer Learning +1

Paper
Add Code

Huawei AARC’s Submissions to the WMT21 Biomedical Translation Task: Domain Adaption from a Practical Perspective

no code implementations • WMT (EMNLP) 2021 • Weixuan Wang, Wei Peng, Xupeng Meng, Qun Liu

This paper describes Huawei Artificial Intelligence Application Research Center’s neural machine translation systems and submissions to the WMT21 biomedical translation shared task.

Domain Adaptation Machine Translation +1

Paper
Add Code

NoahNMT at WMT 2021: Dual Transfer for Very Low Resource Supervised Machine Translation

no code implementations • WMT (EMNLP) 2021 • Meng Zhang, Minghao Wu, Pengfei Li, Liangyou Li, Qun Liu

This paper describes the NoahNMT system submitted to the WMT 2021 shared task of Very Low Resource Supervised Machine Translation.

Machine Translation Translation

Paper
Add Code

ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer

no code implementations • ACL 2022 • Ningning Wang, Guobing Gan, Peng Zhang, Shuai Zhang, Junqiu Wei, Qun Liu, Xin Jiang

Other sparse methods use clustering patterns to select words, but the clustering process is separate from the training process of the target task, which causes a decrease in effectiveness.

Clustering Machine Translation +4

Paper
Add Code

Findings of the Third Workshop on Automatic Simultaneous Translation

no code implementations • NAACL (AutoSimTrans) 2022 • Ruiqing Zhang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Haifeng Wang, Liang Huang, Qun Liu, Julia Ive, Wolfgang Macherey

This paper reports the results of the shared task we hosted on the Third Workshop of Automatic Simultaneous Translation (AutoSimTrans).

Translation

Paper
Add Code

MINER: Multi-Interest Matching Network for News Recommendation

1 code implementation • Findings (ACL) 2022 • Jian Li, Jieming Zhu, Qiwei Bi, Guohao Cai, Lifeng Shang, Zhenhua Dong, Xin Jiang, Qun Liu

Accurately matching user’s interests and candidate news is the key to news recommendation.

News Recommendation

Paper
Code

Multilingual Speech Translation with Unified Transformer: Huawei Noah’s Ark Lab at IWSLT 2021

no code implementations • ACL (IWSLT) 2021 • Xingshan Zeng, Liangyou Li, Qun Liu

We use a unified transformer architecture for our MultiST model, so that the data from different modalities (i. e., speech and text) and different tasks (i. e., Speech Recognition, Machine Translation, and Speech Translation) can be exploited to enhance the model’s ability.

Data Augmentation Machine Translation +4

Paper
Add Code

Controlled Text Generation Using Dictionary Prior in Variational Autoencoders

no code implementations • Findings (ACL) 2022 • Xianghong Fang, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Dit-yan Yeung

While variational autoencoders (VAEs) have been widely applied in text generation tasks, they are troubled by two challenges: insufficient representation capacity and poor controllability.

Contrastive Learning Language Modelling +2

Paper
Add Code

MTRec: Multi-Task Learning over BERT for News Recommendation

1 code implementation • Findings (ACL) 2022 • Qiwei Bi, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Hanfang Yang

With the adoption of large pre-trained models like BERT in news recommendation, the above way to incorporate multi-field information may encounter challenges: the shallow feature encoding to compress the category and entity information is not compatible with the deep BERT encoding.

Multi-Task Learning News Recommendation

Paper
Code

Two Parents, One Child: Dual Transfer for Low-Resource Neural Machine Translation

no code implementations • Findings (ACL) 2021 • Meng Zhang, Liangyou Li, Qun Liu

Low-Resource Neural Machine Translation Translation

Paper
Add Code

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning

1 code implementation • Findings (ACL) 2021 • Chenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun

Data Augmentation

Paper
Code

Calculating the percentage reduction in translator effort when using machine translation

no code implementations • TC 2016 • Andrzej Zydrón, Qun Liu

Machine Translation Translation

Paper
Add Code

Visually Guided Generative Text-Layout Pre-training for Document Intelligence

1 code implementation • 25 Mar 2024 • Zhiming Mao, Haoli Bai, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu, Kam-Fai Wong

Prior study shows that pre-training techniques can boost the performance of visual document understanding (VDU), which typically requires models to gain abilities to perceive and reason both document texts and layouts (e. g., locations of texts and table-cells).

Document Classification document understanding +2

Paper
Code

HawkEye: Training Video-Text LLMs for Grounding Text in Videos

1 code implementation • 15 Mar 2024 • Yueqian Wang, Xiaojun Meng, Jianxin Liang, Yuxuan Wang, Qun Liu, Dongyan Zhao

Video-text Large Language Models (video-text LLMs) have shown remarkable performance in answering questions and holding conversations on simple videos.

Ranked #3 on Video Question Answering on MVBench

Video Grounding Video Question Answering

Paper
Code

Retrieval-based Full-length Wikipedia Generation for Emergent Events

no code implementations • 28 Feb 2024 • Jiebin Zhang, Eugene J. Yu, Qinyu Chen, Chenhao Xiong, Dawei Zhu, Han Qian, Mingbo Song, Xiaoguang Li, Qun Liu, Sujian Li

In today's fast-paced world, the growing demand to quickly generate comprehensive and accurate Wikipedia documents for emerging events is both crucial and challenging.

Retrieval

Paper
Add Code

Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions

no code implementations • 25 Feb 2024 • Xuming Hu, Xiaochuan Li, Junzhe Chen, Yinghui Li, Yangning Li, Xiaoguang Li, Yasheng Wang, Qun Liu, Lijie Wen, Philip S. Yu, Zhijiang Guo

To this end, we propose evaluating the robustness of generative search engines in the realistic and high-risk setting, where adversaries have only black-box system access and seek to deceive the model into returning incorrect responses.

Retrieval

Paper
Add Code

Teacher-Student Learning on Complexity in Intelligent Routing

no code implementations • 24 Feb 2024 • Shu-Ting Pi, Michael Yang, Yuying Zhu, Qun Liu

Customer service is often the most time-consuming aspect for e-commerce websites, with each contact typically taking 10-15 minutes.

Paper
Add Code

Universal Model in Online Customer Service

no code implementations • 24 Feb 2024 • Shu-Ting Pi, Cheng-Ping Hsieh, Qun Liu, Yuying Zhu

Our novel approach involves using machine learning techniques to tag customer questions in transcripts and create a repository of questions and corresponding labels.

Information Retrieval Retrieval +1

Paper
Add Code

Contact Complexity in Customer Service

no code implementations • 24 Feb 2024 • Shu-Ting Pi, Michael Yang, Qun Liu

To tackle this, a machine learning model that accurately predicts the complexity of customer issues is highly desirable.

Paper
Add Code

Learning to Edit: Aligning LLMs with Knowledge Editing

1 code implementation • 19 Feb 2024 • Yuxin Jiang, YuFei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang

Knowledge editing techniques, aiming to efficiently modify a minor proportion of knowledge in large language models (LLMs) without negatively impacting performance across other inputs, have garnered widespread attention.

knowledge editing Philosophy

Paper
Code

Findings of the First Workshop on Simulating Conversational Intelligence in Chat

no code implementations • 9 Feb 2024 • Yvette Graham, Mohammed Rameez Qureshi, Haider Khalid, Gerasimos Lampouras, Ignacio Iacobacci, Qun Liu

The aim of this workshop is to bring together experts working on open-domain dialogue research.

Paper
Add Code

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models

1 code implementation • 30 Jan 2024 • Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, YuFei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

Large language models (LLMs) are increasingly relied upon for complex multi-turn conversations across diverse real-world applications.

Paper
Code

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

1 code implementation • 30 Jan 2024 • Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu

The recent trend of using Large Language Models (LLMs) as tool agents in real-world applications underscores the necessity for comprehensive evaluations of their capabilities, particularly in complex scenarios involving planning, creating, and using tools.

Benchmarking

Paper
Code

YODA: Teacher-Student Progressive Learning for Language Models

no code implementations • 28 Jan 2024 • Jianqiao Lu, Wanjun Zhong, YuFei Wang, Zhijiang Guo, Qi Zhu, Wenyong Huang, Yanlin Wang, Fei Mi, Baojun Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu

With the teacher's guidance, the student learns to iteratively refine its answer with feedback, and forms a robust and comprehensive understanding of the posed questions.

GSM8K Math

Paper
Add Code

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

1 code implementation • 26 Jan 2024 • Haochen Tan, Zhijiang Guo, Zhan Shi, Lu Xu, Zhili Liu, Yunlong Feng, Xiaoguang Li, Yasheng Wang, Lifeng Shang, Qun Liu, Linqi Song

LLMs are prompted to generate extensive content in response to these meta-questions.

Text Generation

Paper
Code

Preparing Lessons for Progressive Training on Language Models

1 code implementation • 17 Jan 2024 • Yu Pan, Ye Yuan, Yichun Yin, Jiaxin Shi, Zenglin Xu, Ming Zhang, Lifeng Shang, Xin Jiang, Qun Liu

The rapid progress of Transformers in artificial intelligence has come at the cost of increased resource consumption and greenhouse gas emissions due to growing model sizes.

Paper
Code

PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

no code implementations • 27 Dec 2023 • Yunhe Wang, Hanting Chen, Yehui Tang, Tianyu Guo, Kai Han, Ying Nie, Xutao Wang, Hailin Hu, Zheyuan Bai, Yun Wang, Fangcheng Liu, Zhicheng Liu, Jianyuan Guo, Sinan Zeng, Yinchen Zhang, Qinghua Xu, Qun Liu, Jun Yao, Chao Xu, DaCheng Tao

We then demonstrate that the proposed approach is significantly effective for enhancing the model nonlinearity through carefully designed ablations; thus, we present a new efficient model architecture for establishing modern, namely, PanGu-$\pi$.

Language Modelling

Paper
Add Code

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation

1 code implementation • 18 Dec 2023 • Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin

We measure LLM robustness using two metrics: (i) hallucination rate, measuring model tendency to hallucinate an answer, when the answer is not present in passages in the non-relevant subset, and (ii) error rate, measuring model inaccuracy to recognize relevant passages in the relevant subset.

Hallucination Language Modelling +2

Paper
Code

A Survey of Reasoning with Foundation Models

1 code implementation • 17 Dec 2023 • Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-world settings such as negotiation, medical diagnosis, and criminal investigation.

Medical Diagnosis

343

Paper
Code

Unsupervised Extractive Summarization with Learnable Length Control Strategies

no code implementations • 12 Dec 2023 • Renlong Jie, Xiaojun Meng, Xin Jiang, Qun Liu

Different from the centrality-based ranking methods, our extractive scorer can be trained in an end-to-end manner, with no other requirement of positional assumption.

Extractive Summarization Sentence +1

Paper
Add Code

Data Management For Large Language Models: A Survey

1 code implementation • 4 Dec 2023 • Zige Wang, Wanjun Zhong, YuFei Wang, Qi Zhu, Fei Mi, Baojun Wang, Lifeng Shang, Xin Jiang, Qun Liu

Data plays a fundamental role in the training of Large Language Models (LLMs).

Management

185

Paper
Code

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

1 code implementation • 31 Oct 2023 • Yuxin Jiang, YuFei Wang, Xingshan Zeng, Wanjun Zhong, Liangyou Li, Fei Mi, Lifeng Shang, Xin Jiang, Qun Liu, Wei Wang

To fill this research gap, in this paper, we propose FollowBench, a Multi-level Fine-grained Constraints Following Benchmark for LLMs.

Instruction Following

Paper
Code

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

1 code implementation • 30 Oct 2023 • Wai-Chung Kwan, Xingshan Zeng, YuFei Wang, Yusen Sun, Liangyou Li, Lifeng Shang, Qun Liu, Kam-Fai Wong

In this paper, we propose M4LE, a Multi-ability, Multi-range, Multi-task, Multi-domain benchmark for Long-context Evaluation.

8k Semantic Retrieval

Paper
Code

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

1 code implementation • 16 Oct 2023 • Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu

Automated theorem proving (ATP) has become an appealing domain for exploring the reasoning ability of the recent successful generative language models.

Automated Theorem Proving Benchmarking +1

Paper
Code

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

no code implementations • 16 Oct 2023 • Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-yan Yeung, Lifeng Shang, Xin Jiang, Qun Liu

The rapid development of large language models (LLMs) has not only provided numerous opportunities but also presented significant challenges.

Instruction Following

Paper
Add Code

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment

1 code implementation • 12 Oct 2023 • Boyang Xue, Weichao Wang, Hongru Wang, Fei Mi, Rui Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

Inspired by previous work which identified that feed-forward networks (FFNs) within Transformers are responsible for factual knowledge expressions, we investigate two methods to efficiently improve the factual expression capability {of FFNs} by knowledge enhancement and alignment respectively.

Paper
Code

SELF: Self-Evolution with Language Feedback

no code implementations • 1 Oct 2023 • Jianqiao Lu, Wanjun Zhong, Wenyong Huang, YuFei Wang, Qi Zhu, Fei Mi, Baojun Wang, Weichao Wang, Xingshan Zeng, Lifeng Shang, Xin Jiang, Qun Liu

SELF initiates with a meta-skill learning process that equips the LLMs with capabilities for self-feedback and self-refinement.

Language Modelling Large Language Model

Paper
Add Code

FIMO: A Challenge Formal Dataset for Automated Theorem Proving

1 code implementation • 8 Sep 2023 • Chengwu Liu, Jianhao Shen, Huajian Xin, Zhengying Liu, Ye Yuan, Haiming Wang, Wei Ju, Chuanyang Zheng, Yichun Yin, Lin Li, Ming Zhang, Qun Liu

We present FIMO, an innovative dataset comprising formal mathematical problem statements sourced from the International Mathematical Olympiad (IMO) Shortlisted Problems.

Automated Theorem Proving

Paper
Code

Prompt-Based Length Controlled Generation with Reinforcement Learning

no code implementations • 23 Aug 2023 • Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu

Large language models (LLMs) like ChatGPT and GPT-4 have attracted great attention given their surprising performance on a wide range of NLP tasks.

reinforcement-learning

Paper
Add Code

NewsDialogues: Towards Proactive News Grounded Conversation

1 code implementation • 12 Aug 2023 • Siheng Li, Yichun Yin, Cheng Yang, Wangjie Jiang, Yiwei Li, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

In this paper, we propose a novel task, Proactive News Grounded Conversation, in which a dialogue system can proactively lead the conversation based on some key topics of the news.

Response Generation

Paper
Code

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

no code implementations • 12 Aug 2023 • Siheng Li, Cheng Yang, Yichun Yin, Xinyu Zhu, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

Information-seeking conversation, which aims to help users gather information through conversation, has achieved great progress in recent years.

Few-Shot Learning Language Modelling

Paper
Add Code

Aligning Large Language Models with Human: A Survey

1 code implementation • 24 Jul 2023 • YuFei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu

(2) Training methodologies: a detailed review of the prevailing training methods employed for LLM alignment.

600

Paper
Code

DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering

1 code implementation • 13 Jul 2023 • Pei Ke, Fei Huang, Fei Mi, Yasheng Wang, Qun Liu, Xiaoyan Zhu, Minlie Huang

Existing evaluation metrics for natural language generation (NLG) tasks face the challenges on generalization ability and interpretability.

Dialogue Generation nlg evaluation +3

Paper
Code

mCLIP: Multilingual CLIP via Cross-lingual Transfer

1 code implementation • ACL 2023 • Guanhua Chen, Lu Hou, Yun Chen, Wenliang Dai, Lifeng Shang, Xin Jiang, Qun Liu, Jia Pan, Wenping Wang

Furthermore, to enhance the token- and sentence-level multilingual representation of the MTE, we propose to train it with machine translation and contrastive learning jointly before the TriKD to provide a better initialization.

Contrastive Learning Cross-Lingual Transfer +7

Paper
Code

INGB: Informed Nonlinear Granular Ball Oversampling Framework for Noisy Imbalanced Classification

1 code implementation • 3 Jul 2023 • Min Li, Hao Zhou, Qun Liu, Yabin Shao, GuoYing Wang

It uses granular balls to simulate the spatial distribution characteristics of datasets, and informed entropy is utilized to further optimize the granular-ball space.

Anchor link prediction imbalanced classification

Paper
Code

Enhancing Coherence of Extractive Summarization with Multitask Learning

no code implementations • 22 May 2023 • Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu

This study proposes a multitask learning architecture for extractive summarization with coherence boosting.

Extractive Summarization Sentence

Paper
Add Code

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

1 code implementation • 17 May 2023 • Chuang Liu, Renren Jin, Yuqi Ren, Linhao Yu, Tianyu Dong, Xiaohan Peng, Shuting Zhang, Jianxiang Peng, Peiyi Zhang, Qingqing Lyu, Xiaowen Su, Qun Liu, Deyi Xiong

Comprehensively evaluating the capability of large language models in multiple tasks is of great importance.

Instruction Following Multiple-choice +1

Paper
Code

Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video

no code implementations • 8 May 2023 • Zenan Xu, Xiaojun Meng, Yasheng Wang, Qinliang Su, Zexuan Qiu, Xin Jiang, Qun Liu

Multimodal abstractive summarization for videos (MAS) requires generating a concise textual summary to describe the highlights of a video according to multimodal resources, in our case, the video content and its transcript.

Abstractive Text Summarization Language Modelling

Paper
Add Code

End-to-end Training and Decoding for Pivot-based Cascaded Translation Model

no code implementations • 3 May 2023 • Hao Cheng, Meng Zhang, Liangyou Li, Qun Liu, Zhihua Zhang

Utilizing pivot language effectively can significantly improve low-resource machine translation.

Machine Translation Translation

Paper
Add Code

Evaluating the Efficacy of Length-Controllable Machine Translation

no code implementations • 3 May 2023 • Hao Cheng, Meng Zhang, Weixuan Wang, Liangyou Li, Qun Liu, Zhihua Zhang

We can use automatic summarization or machine translation evaluation metrics for length-controllable machine translation, but this is not necessarily suitable and accurate.

Machine Translation Translation

Paper
Add Code

Learning Homographic Disambiguation Representation for Neural Machine Translation

no code implementations • 12 Apr 2023 • Weixuan Wang, Wei Peng, Qun Liu

Visualization methods like heatmaps, T-SNE and translation examples are also utilized to demonstrate the effects of the proposed method.

Machine Translation Natural Language Inference +3

Paper
Add Code

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

no code implementations • 20 Mar 2023 • Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao

In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1. 085T parameters named PanGu-{\Sigma}.

Code Generation Language Modelling +4

Paper
Add Code

Adapting Pre-trained Language Models for Quantum Natural Language Processing

no code implementations • 24 Feb 2023 • Qiuchi Li, Benyou Wang, Yudong Zhu, Christina Lioma, Qun Liu

The emerging classical-quantum transfer learning paradigm has brought a decent performance to quantum computational models in many tasks, such as computer vision, by enabling a combination of quantum models and classical pre-trained neural networks.

Sentence Sentence Classification +1

Paper
Add Code

WL-Align: Weisfeiler-Lehman Relabeling for Aligning Users across Networks via Regularized Representation Learning

1 code implementation • 29 Dec 2022 • Li Liu, Penggang Chen, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang

Aligning users across networks using graph representation learning has been found effective where the alignment is accomplished in a low-dimensional embedding space.

Graph Representation Learning

Paper
Code

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions

1 code implementation • 21 Dec 2022 • Hao Sun, Zhexin Zhang, Fei Mi, Yasheng Wang, Wei Liu, Jianwei Cui, Bin Wang, Qun Liu, Minlie Huang

In this paper, we propose a framework, MoralDial to train and evaluate moral dialogue systems.

Paper
Code

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding

no code implementations • 19 Dec 2022 • Haoli Bai, Zhiguang Liu, Xiaojun Meng, Wentao Li, Shuang Liu, Nian Xie, Rongfu Zheng, Liangwei Wang, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu

While various vision-language pre-training objectives are studied in existing solutions, the document textline, as an intrinsic granularity in VDU, has seldom been explored so far.

Contrastive Learning document understanding +2

Paper
Add Code

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

no code implementations • 17 Dec 2022 • Xingshan Zeng, Liangyou Li, Qun Liu

To alleviate the data scarcity problem in End-to-end speech translation (ST), pre-training on data for speech recognition and machine translation is considered as an important technique.

Machine Translation speech-recognition +2

Paper
Add Code

Retrieval-based Disentangled Representation Learning with Natural Language Supervision

no code implementations • 15 Dec 2022 • Jiawei Zhou, Xiaoguang Li, Lifeng Shang, Xin Jiang, Qun Liu, Lei Chen

In light of this, we present Vocabulary Disentangled Retrieval (VDR), a retrieval-based framework that harnesses natural language as proxies of the underlying data variation to drive disentangled representation learning.

Cross-Modal Retrieval Disentanglement +2

Paper
Add Code

G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks

1 code implementation • 7 Dec 2022 • Zhongwei Wan, Yichun Yin, Wei zhang, Jiaxin Shi, Lifeng Shang, Guangyong Chen, Xin Jiang, Qun Liu

Recently, domain-specific PLMs have been proposed to boost the task performance of specific domains (e. g., biomedical and computer science) by continuing to pre-train general PLMs with domain-specific corpora.

General Knowledge Language Modelling +3

Paper
Code

KPT: Keyword-guided Pre-training for Grounded Dialog Generation

no code implementations • 4 Dec 2022 • Qi Zhu, Fei Mi, Zheng Zhang, Yasheng Wang, Yitong Li, Xin Jiang, Qun Liu, Xiaoyan Zhu, Minlie Huang

For the former, the grounding knowledge consists of keywords extracted from the response.

Knowledge Graphs Language Modelling +1

Paper
Add Code

SongRewriter: A Chinese Song Rewriting System with Controllable Content and Rhyme Scheme

1 code implementation • 28 Nov 2022 • Yusen Sun, Liangyou Li, Qun Liu, Dit-yan Yeung

Although lyrics generation has achieved significant progress in recent years, it has limited practical applications because the generated lyrics cannot be performed without composing compatible melodies.

835

Paper
Code

Lexicon-injected Semantic Parsing for Task-Oriented Dialog

no code implementations • 26 Nov 2022 • Xiaojun Meng, Wenlin Dai, Yasheng Wang, Baojun Wang, Zhiyong Wu, Xin Jiang, Qun Liu

Then we present a novel lexicon-injected semantic parser, which collects slot labels of tree representation as a lexicon, and injects lexical features to the span representation of parser.

Semantic Parsing

Paper
Add Code

FPT: Improving Prompt Tuning Efficiency via Progressive Training

1 code implementation • 13 Nov 2022 • Yufei Huang, Yujia Qin, Huadong Wang, Yichun Yin, Maosong Sun, Zhiyuan Liu, Qun Liu

Inspired by these observations, we propose Fast Prompt Tuning (FPT), which starts by conducting PT using a small-scale partial PLM, and then progressively expands its depth and width until the full-model size.

Paper
Code

COPEN: Probing Conceptual Knowledge in Pre-trained Language Models

1 code implementation • 8 Nov 2022 • Hao Peng, Xiaozhi Wang, Shengding Hu, Hailong Jin, Lei Hou, Juanzi Li, Zhiyuan Liu, Qun Liu

We believe this is a critical bottleneck for realizing human-like cognition in PLMs.

Knowledge Probing

Paper
Code

LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling

no code implementations • 21 Oct 2022 • Dongsheng Chen, Chaofan Tao, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu

Recent large-scale video-language pre-trained models have shown appealing performance on various downstream tasks.

Language Modelling Question Answering +3

Paper
Add Code

Pre-training Language Models with Deterministic Factual Knowledge

no code implementations • 20 Oct 2022 • Shaobo Li, Xiaoguang Li, Lifeng Shang, Chengjie Sun, Bingquan Liu, Zhenzhou Ji, Xin Jiang, Qun Liu

Further experiments on question-answering datasets show that trying to learn a deterministic relationship with the proposed methods can also help other knowledge-intensive tasks.

Knowledge Probing Question Answering

Paper
Add Code

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages

1 code implementation • 18 Oct 2022 • Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin

MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual dataset we have built for the WSDM 2023 Cup challenge that focuses on ad hoc retrieval across 18 different languages, which collectively encompass over three billion native speakers around the world.

Information Retrieval Retrieval

126

Paper
Code

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset

no code implementations • 17 Aug 2022 • Zhihua Jin, Xingbo Wang, Furui Cheng, Chunhui Sun, Qun Liu, Huamin Qu

Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoid them when creating benchmark datasets.

Natural Language Understanding

Paper
Add Code

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

1 code implementation • 22 Jul 2022 • Fenia Christopoulou, Gerasimos Lampouras, Milan Gritta, Guchun Zhang, Yinpeng Guo, Zhongqi Li, Qi Zhang, Meng Xiao, Bo Shen, Lin Li, Hao Yu, Li Yan, Pingyi Zhou, Xin Wang, Yuchi Ma, Ignacio Iacobacci, Yasheng Wang, Guangtai Liang, Jiansheng Wei, Xin Jiang, Qianxiang Wang, Qun Liu

We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i. e. the synthesis of programming language solutions given a natural language problem description.

Code Generation Language Modelling +2

Paper
Code

FreeTransfer-X: Safe and Label-Free Cross-Lingual Transfer from Off-the-Shelf Models

no code implementations • Findings (NAACL) 2022 • Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu

However, labeled cross-lingual corpus is expensive or even inaccessible, especially in the fields where labels are private, such as diagnostic results of symptoms in medicine and user profiles in business.

Cross-Lingual Transfer Knowledge Distillation +3

Paper
Add Code

PERT: A New Solution to Pinyin to Character Conversion Task

1 code implementation • 24 May 2022 • Jinghui Xiao, Qun Liu, Xin Jiang, Yuanfeng Xiong, Haiteng Wu, Zhe Zhang

Pinyin to Character conversion (P2C) task is the key task of Input Method Engine (IME) in commercial input software for Asian languages, such as Chinese, Japanese, Thai language and so on.

Language Modelling

835

Paper
Code

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

no code implementations • 21 May 2022 • Abbas Ghaddar, Yimeng Wu, Sunyam Bagga, Ahmad Rashid, Khalil Bibi, Mehdi Rezagholizadeh, Chao Xing, Yasheng Wang, Duan Xinyu, Zhefeng Wang, Baoxing Huai, Xin Jiang, Qun Liu, Philippe Langlais

There is a growing body of work in recent years to develop pre-trained language models (PLMs) for the Arabic language.

Natural Language Understanding

Paper
Add Code

Exploring Extreme Parameter Compression for Pre-trained Language Models

1 code implementation • ICLR 2022 • Yuxin Ren, Benyou Wang, Lifeng Shang, Xin Jiang, Qun Liu

A tiny version achieves $96. 7\%$ performance of BERT-base with $ {1}/{48} $ encoder parameters (i. e., less than 2M parameters excluding the embedding layer) and $2. 7 \times$ faster on inference.

Knowledge Distillation Tensor Decomposition

Paper
Code

UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog

no code implementations • CVPR 2022 • Cheng Chen, Yudong Zhu, Zhenshan Tan, Qingrong Cheng, Xin Jiang, Qun Liu, Xiaodong Gu

In this paper, we propose a contrastive learning-based framework UTC to unify and facilitate both discriminative and generative tasks in visual dialog with a single model.

Contrastive Learning Representation Learning +1

Paper
Add Code

PanGu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained Language Model

2 code implementations • 31 Mar 2022 • Fei Mi, Yitong Li, Yulong Zeng, Jingyan Zhou, Yasheng Wang, Chuanfei Xu, Lifeng Shang, Xin Jiang, Shiqi Zhao, Qun Liu

We investigate different aspects of responses generated by PanGu-Bot, including response quality, knowledge, and safety.

Dialogue Generation Language Modelling

2,954

Paper
Code

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis

no code implementations • Findings (ACL) 2022 • Shaobo Li, Xiaoguang Li, Lifeng Shang, Zhenhua Dong, Chengjie Sun, Bingquan Liu, Zhenzhou Ji, Xin Jiang, Qun Liu

We check the words that have three typical associations with the missing words: knowledge-dependent, positionally close, and highly co-occurred.

Paper
Add Code

Compression of Generative Pre-trained Language Models via Quantization

no code implementations • ACL 2022 • Chaofan Tao, Lu Hou, Wei zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong

We find that previous quantization methods fail on generative tasks due to the \textit{homogeneous word embeddings} caused by reduced capacity, and \textit{varied distribution of weights}.

Model Compression Quantization +1

Paper
Add Code

Triangular Transfer: Freezing the Pivot for Triangular Machine Translation

no code implementations • ACL 2022 • Meng Zhang, Liangyou Li, Qun Liu

Triangular machine translation is a special case of low-resource machine translation where the language pair of interest has limited parallel data, but both languages have abundant parallel data with a pivot language.

Language Modelling Machine Translation +2

Paper
Add Code

Universal Conditional Masked Language Pre-training for Neural Machine Translation

1 code implementation • ACL 2022 • Pengfei Li, Liangyou Li, Meng Zhang, Minghao Wu, Qun Liu

To the best of our knowledge, this is the first work to pre-train a unified model for fine-tuning on both NMT tasks.

Language Modelling Machine Translation +2

2,954

Paper
Code

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

1 code implementation • ACL 2022 • Jiawei Zhou, Xiaoguang Li, Lifeng Shang, Lan Luo, Ke Zhan, Enrui Hu, Xinyu Zhang, Hao Jiang, Zhao Cao, Fan Yu, Xin Jiang, Qun Liu, Lei Chen

To alleviate the data scarcity problem in training question answering systems, recent works propose additional intermediate pre-training for dense passage retrieval (DPR).

Open-Domain Question Answering Passage Retrieval +1

Paper
Code

Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation

no code implementations • Findings (ACL) 2022 • Wenliang Dai, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung

Furthermore, the original textual language understanding and generation ability of the PLM is maintained after VLKD, which makes our model versatile for both multimodal and unimodal tasks.

Image Captioning Knowledge Distillation +4

Paper
Add Code

Achieving Reliable Human Assessment of Open-Domain Dialogue Systems

1 code implementation • ACL 2022 • Tianbo Ji, Yvette Graham, Gareth J. F. Jones, Chenyang Lyu, Qun Liu

Answering the distress call of competitions that have emphasized the urgent need for better evaluation techniques in dialogue, we present the successful development of human evaluation that is highly reliable while still remaining feasible and low cost.

Dialogue Evaluation

Paper
Code

Compilable Neural Code Generation with Compiler Feedback

no code implementations • Findings (ACL) 2022 • Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu

Automatically generating compilable programs with (or without) natural language descriptions has always been a touchstone problem for computational linguistics and automated software engineering.

Code Completion Code Generation +3

Paper
Add Code

HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks

no code implementations • 8 Mar 2022 • Zhengkun Zhang, Wenya Guo, Xiaojun Meng, Yasheng Wang, Yadao Wang, Xin Jiang, Qun Liu, Zhenglu Yang

In this paper, we design a novel unified parameter-efficient transfer learning framework that works effectively on both pure language and V&L tasks.

Language Modelling Multi-Task Learning

Paper
Add Code

Read before Generate! Faithful Long Form Question Answering with Machine Reading

no code implementations • Findings (ACL) 2022 • Dan Su, Xiaoguang Li, Jindi Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung

Long-form question answering (LFQA) aims to generate a paragraph-length answer for a given question.

Ranked #1 on Question Answering on KILT: ELI5

Answer Generation Long Form Question Answering +1

Paper
Add Code

Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks

1 code implementation • 16 Feb 2022 • Jingyan Zhou, Jiawen Deng, Fei Mi, Yitong Li, Yasheng Wang, Minlie Huang, Xin Jiang, Qun Liu, Helen Meng

The research of open-domain dialog systems has been greatly prospered by neural models trained on large-scale corpora, however, such corpora often introduce various safety problems (e. g., offensive languages, biases, and toxic behaviors) that significantly hinder the deployment of dialog systems in practice.

Bias Detection Open-Domain Dialog

Paper
Code

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation

no code implementations • COLING 2022 • Yihe Wang, Yitong Li, Yasheng Wang, Fei Mi, Pingyi Zhou, Xin Wang, Jin Liu, Xin Jiang, Qun Liu

Experiments over publicly available datasets demonstrate that our method can help models generate better responses, even such training data are usually impressed as low-quality data.

Dialogue Generation Retrieval

Paper
Add Code

SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training

1 code implementation • ICLR 2022 • Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu

The student network is trained to output representation resembling that of the teacher.

Denoising Representation Learning

522

Paper
Code

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework

no code implementations • Findings (NAACL) 2022 • Mengjie Zhao, Fei Mi, Yasheng Wang, Minglei Li, Xin Jiang, Qun Liu, Hinrich Schütze

We propose LMTurk, a novel approach that treats few-shot learners as crowdsourcing workers.

Active Learning Language Modelling

Paper
Add Code

JABER and SABER: Junior and Senior Arabic BERt

1 code implementation • 8 Dec 2021 • Abbas Ghaddar, Yimeng Wu, Ahmad Rashid, Khalil Bibi, Mehdi Rezagholizadeh, Chao Xing, Yasheng Wang, Duan Xinyu, Zhefeng Wang, Baoxing Huai, Xin Jiang, Qun Liu, Philippe Langlais

Language-specific pre-trained models have proven to be more accurate than multilingual ones in a monolingual evaluation setting, Arabic is no exception.

Language Modelling NER

2,954

Paper
Code

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

1 code implementation • 22 Nov 2021 • Zihan Yan, Li Liu, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang

Social network alignment aims at aligning person identities across social networks.

Meta-Learning

Paper
Code

CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis

no code implementations • 16 Nov 2021 • Nianzu Zheng, Liqun Deng, Wenyong Huang, Yu Ting Yeung, Baohua Xu, Yuanyuan Guo, Yasheng Wang, Xiao Chen, Xin Jiang, Qun Liu

We utilize conv-transformer structure to encode input speech in a streaming manner.

Multi-Task Learning Phone-level pronunciation scoring

Paper
Add Code

UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented Dialogues

no code implementations • dialdoc (ACL) 2022 • Xinyan Zhao, Bin He, Yasheng Wang, Yitong Li, Fei Mi, Yajiao Liu, Xin Jiang, Qun Liu, Huanhuan Chen

With the advances in deep learning, tremendous progress has been made with chit-chat dialogue systems and task-oriented dialogue systems.

Task-Oriented Dialogue Systems

Paper
Add Code

bert2BERT: Towards Reusable Pretrained Language Models

no code implementations • ACL 2022 • Cheng Chen, Yichun Yin, Lifeng Shang, Xin Jiang, Yujia Qin, Fengyu Wang, Zhi Wang, Xiao Chen, Zhiyuan Liu, Qun Liu

However, large language model pre-training costs intensive computational resources and most of the models are trained from scratch without reusing the existing pre-trained models, which is wasteful.

Language Modelling Large Language Model

Paper
Add Code

Speech-MLP: a simple MLP architecture for speech processing

no code implementations • 29 Sep 2021 • Chao Xing, Dong Wang, LiRong Dai, Qun Liu, Anderson Avila

Overparameterized transformer-based architectures have shown remarkable performance in recent years, achieving state-of-the-art results in speech processing tasks such as speech recognition, speech synthesis, keyword spotting, and speech enhancement et al.

Keyword Spotting Speech Enhancement +3

Paper
Add Code

Multi-Semantic Image Recognition Model and Evaluating Index for explaining the deep learning models

no code implementations • 28 Sep 2021 • Qianmengke Zhao, Ye Wang, Qun Liu

Although deep learning models are powerful among various applications, most deep learning models are still a black box, lacking verifiability and interpretability, which means the decision-making process that human beings cannot understand.

Decision Making Image Classification

Paper
Add Code

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling

1 code implementation • EMNLP 2021 • Baojun Wang, Zhao Zhang, Kun Xu, Guang-Yuan Hao, Yuyang Zhang, Lifeng Shang, Linlin Li, Xiao Chen, Xin Jiang, Qun Liu

Incorporating lexical knowledge into deep learning models has been proved to be very effective for sequence labeling tasks.

Denoising TAG

835

Paper
Code

Improving Unsupervised Question Answering via Summarization-Informed Question Generation

no code implementations • EMNLP 2021 • Chenyang Lyu, Lifeng Shang, Yvette Graham, Jennifer Foster, Xin Jiang, Qun Liu

Template-based QG uses linguistically-informed heuristics to transform declarative sentences into interrogatives, whereas supervised QG uses existing Question Answering (QA) datasets to train a system to generate a question given a passage and an answer.

Dependency Parsing named-entity-recognition +8

Paper
Add Code

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation

no code implementations • 13 Sep 2021 • Zhengkun Zhang, Xiaojun Meng, Yasheng Wang, Xin Jiang, Qun Liu, Zhenglu Yang

Specially, we adopt knowledge distillation from a vision-language pretrained model to improve image selection, which avoids any requirement on the existence and quality of image captions.

Abstractive Text Summarization Image Captioning +2

Paper
Add Code

CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems

no code implementations • 10 Sep 2021 • Fei Mi, Yitong Li, Yasheng Wang, Xin Jiang, Qun Liu

As labeling cost for different modules in task-oriented dialog (ToD) systems is high, a major challenge in practice is to learn different tasks with the least amount of labeled data.

dialog state tracking Few-Shot Learning +3

Paper
Add Code

KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs

1 code implementation • 9 Sep 2021 • Yinquan Lu, Haonan Lu, Guirong Fu, Qun Liu

Incorporating factual knowledge into pre-trained language models (PLM) such as BERT is an emerging trend in recent NLP studies.

Ranked #11 on Common Sense Reasoning on ReCoRD

Common Sense Reasoning Language Modelling +3

Paper
Code

Generate & Rank: A Multi-task Framework for Math Word Problems

no code implementations • Findings (EMNLP) 2021 • Jianhao Shen, Yichun Yin, Lin Li, Lifeng Shang, Xin Jiang, Ming Zhang, Qun Liu

Math word problem (MWP) is a challenging and critical task in natural language processing.

Ranked #2 on Math Word Problem Solving on Math23K

Language Modelling Math +1

Paper
Add Code

NumGPT: Improving Numeracy Ability of Generative Pre-trained Models

no code implementations • 7 Sep 2021 • Zhihua Jin, Xin Jiang, Xingbo Wang, Qun Liu, Yong Wang, Xiaozhe Ren, Huamin Qu

However, those models do not consider the numerical properties of numbers and cannot perform robustly on numerical reasoning tasks (e. g., math word problems and measurement estimation).

Math

Paper
Add Code

Integrating Regular Expressions with Neural Networks via DFA

no code implementations • 7 Sep 2021 • Shaobo Li, Qun Liu, Xin Jiang, Yichun Yin, Chengjie Sun, Bingquan Liu, Zhenzhou Ji, Lifeng Shang

Human-designed rules are widely used to build industry applications.

intent-classification Intent Classification +1

Paper
Add Code

Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training

no code implementations • EMNLP 2021 • Minghao Wu, Yitong Li, Meng Zhang, Liangyou Li, Gholamreza Haffari, Qun Liu

In this work, we propose an approach, MultiUAT, that dynamically adjusts the training data usage based on the model's uncertainty on a small set of trusted clean data for multi-corpus machine translation.

Machine Translation Translation

Paper
Add Code

GhostBERT: Generate More Features with Cheap Operations for BERT

no code implementations • ACL 2021 • Zhiqi Huang, Lu Hou, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu

Transformer-based pre-trained language models like BERT, though powerful in many tasks, are expensive in both memory and computation, due to their large number of parameters.

Paper
Add Code

TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language Models

no code implementations • ACL 2021 • Jie He, Bo Peng, Yi Liao, Qun Liu, Deyi Xiong

Each error is hence manually labeled with comprehensive annotations, including the span of the error, the associated span, minimal correction to the error, the type of the error, and rationale behind the error.

Common Sense Reasoning Text Generation

Paper
Add Code

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models

1 code implementation • ACL 2021 • Yichun Yin, Cheng Chen, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu

Specifically, we carefully design the techniques of one-shot learning and the search space to provide an adaptive and efficient development way of tiny PLMs for various latency constraints.

Neural Architecture Search One-Shot Learning

2,954

Paper
Code

A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering

1 code implementation • ACL 2021 • Zhihong Shao, Lifeng Shang, Qun Liu, Minlie Huang

This setting gives rise to the spurious solution problem: there may exist many spurious solutions that coincidentally derive the correct answer, but training on such solutions can hurt model performance (e. g., producing wrong solutions or answers).

Question Answering

Paper
Code

RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer

no code implementations • Findings (ACL) 2021 • Xingshan Zeng, Liangyou Li, Qun Liu

To bridge the modality gap between speech and text, RealTranS gradually downsamples the input speech with interleaved convolution and unidirectional Transformer layers for acoustic modeling, and then maps speech features into text space with a weighted-shrinking operation and a semantic encoder.

Translation

Paper
Add Code

Learning Multilingual Representation for Natural Language Understanding with Enhanced Cross-Lingual Supervision

no code implementations • 9 Jun 2021 • Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu

Recently, pre-training multilingual language models has shown great potential in learning multilingual representation, a crucial topic of natural language processing.

Natural Language Understanding

Paper
Add Code

Sub-Character Tokenization for Chinese Pretrained Language Models

2 code implementations • 1 Jun 2021 • Chenglei Si, Zhengyan Zhang, Yingfa Chen, Fanchao Qi, Xiaozhi Wang, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun

2) Pronunciation-based SubChar tokenizers can encode Chinese homophones into the same transliteration sequences and produce the same tokenization output, hence being robust to homophone typos.

Chinese Word Segmentation Computational Efficiency +2

Paper
Code

Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021

no code implementations • 1 Jun 2021 • Xingshan Zeng, Liangyou Li, Qun Liu

Data Augmentation Machine Translation +4

Paper
Add Code

Improved OOD Generalization via Adversarial Training and Pre-training

no code implementations • 24 May 2021 • Mingyang Yi, Lu Hou, Jiacheng Sun, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma

In this paper, after defining OOD generalization via Wasserstein distance, we theoretically show that a model robust to input perturbation generalizes well on OOD data.

Image Classification Natural Language Understanding

Paper
Add Code

Dynamic Multi-Branch Layers for On-Device Neural Machine Translation

1 code implementation • 14 May 2021 • Zhixing Tan, Zeyuan Yang, Meng Zhang, Qun Liu, Maosong Sun, Yang Liu

With the rapid development of artificial intelligence (AI), there is a trend in moving AI applications, such as neural machine translation (NMT), from cloud to mobile devices.

Machine Translation NMT +1

Paper
Code

HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management

1 code implementation • Findings (ACL) 2021 • Silin Gao, Ryuichi Takanobu, Wei Peng, Qun Liu, Minlie Huang

To address this task, we propose a TOD system with hybrid knowledge management, HyKnow.

Management Retrieval

Paper
Code

PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

4 code implementations • 26 Apr 2021 • Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, ZhenZhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, YaoWei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian

To enhance the generalization ability of PanGu-$\alpha$, we collect 1. 1TB high-quality Chinese data from a wide range of domains to pretrain the model.

Ranked #1 on Reading Comprehension (One-Shot) on DuReader

Cloze (multi-choices) (Few-Shot) Cloze (multi-choices) (One-Shot) +19

219

Paper
Code

Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation

no code implementations • 24 Apr 2021 • Cheng Chen, Yichun Yin, Lifeng Shang, Zhi Wang, Xin Jiang, Xiao Chen, Qun Liu

Task-agnostic knowledge distillation, a teacher-student framework, has been proved effective for BERT compression.

Knowledge Distillation

Paper
Add Code

From Fully Trained to Fully Random Embeddings: Improving Neural Machine Translation with Compact Word Embedding Tables

no code implementations • 18 Apr 2021 • Krtin Kumar, Peyman Passban, Mehdi Rezagholizadeh, Yiu Sing Lau, Qun Liu

Embedding matrices are key components in neural natural language processing (NLP) models that are responsible to provide numerical representations of input tokens.\footnote{In this paper words and subwords are referred to as \textit{tokens} and the term \textit{embedding} only refers to embeddings of inputs.}

Machine Translation NMT +2

Paper
Add Code

An Approach to Improve Robustness of NLP Systems against ASR Errors

no code implementations • 25 Mar 2021 • Tong Cui, Jinghui Xiao, Liangyou Li, Xin Jiang, Qun Liu

Speech-enabled systems typically first convert audio to text through an automatic speech recognition (ASR) model and then feed the text to downstream natural language processing (NLP) modules.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Dependency Graph-to-String Statistical Machine Translation

no code implementations • 20 Mar 2021 • Liangyou Li, Andy Way, Qun Liu

We present graph-based translation models which translate source graphs into target strings.

Machine Translation Translation

Paper
Add Code

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

1 code implementation • ICLR 2021 • Mingyang Yi, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma

Inspired by adversarial training, we minimize this maximal expected loss (MMEL) and obtain a simple and interpretable closed-form solution: more attention should be paid to augmented samples with large loss values (i. e., harder examples).

Image Augmentation Image Classification +1

Paper
Code

LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation

no code implementations • 11 Mar 2021 • Xiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang, Qun Liu

The multilingual pre-trained language models (e. g, mBERT, XLM and XLM-R) have shown impressive performance on cross-lingual natural language understanding tasks.

Natural Language Understanding XLM-R

Paper
Add Code

Training Multilingual Pre-trained Language Model with Byte-level Subwords

1 code implementation • 23 Jan 2021 • Junqiu Wei, Qun Liu, Yinpeng Guo, Xin Jiang

The pre-trained language models have achieved great successes in various natural language understanding (NLU) tasks due to its capacity to capture the deep contextualized information in text by pre-training on large-scale corpora.

Language Modelling Natural Language Understanding

2,954

Paper
Code

On Position Embeddings in BERT

no code implementations • ICLR 2021 • Benyou Wang, Lifeng Shang, Christina Lioma, Xin Jiang, Hao Yang, Qun Liu, Jakob Grue Simonsen

Various Position Embeddings (PEs) have been proposed in Transformer based architectures~(e. g. BERT) to model word order.

General Classification Position +1

Paper
Add Code

HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions

no code implementations • 31 Dec 2020 • Shaobo Li, Xiaoguang Li, Lifeng Shang, Xin Jiang, Qun Liu, Chengjie Sun, Zhenzhou Ji, Bingquan Liu

In this paper, we propose a new retrieval target, hop, to collect the hidden reasoning evidence from Wikipedia for complex question answering.

Ranked #6 on Question Answering on HotpotQA

Document Embedding Open-Domain Question Answering +1

Paper
Add Code

BinaryBERT: Pushing the Limit of BERT Quantization

1 code implementation • ACL 2021 • Haoli Bai, Wei zhang, Lu Hou, Lifeng Shang, Jing Jin, Xin Jiang, Qun Liu, Michael Lyu, Irwin King

In this paper, we propose BinaryBERT, which pushes BERT quantization to the limit by weight binarization.

Binarization Model Compression +1

2,954

Paper
Code

Revisiting Robust Neural Machine Translation: A Transformer Case Study

no code implementations • Findings (EMNLP) 2021 • Peyman Passban, Puneeth S. M. Saladi, Qun Liu

There is a large body of work in the NMT literature on analyzing the behavior of conventional models for the problem of noise but Transformers are relatively understudied in this context.

Denoising Machine Translation +2

Paper
Add Code

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning

1 code implementation • 31 Dec 2020 • Chenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun

In this work, we propose a simple and effective method to cover a much larger proportion of the attack search space, called Adversarial and Mixup Data Augmentation (AMDA).

Adversarial Robustness Text Augmentation +2

Paper
Code

ALP-KD: Attention-Based Layer Projection for Knowledge Distillation

no code implementations • 27 Dec 2020 • Peyman Passban, Yimeng Wu, Mehdi Rezagholizadeh, Qun Liu

Knowledge distillation is considered as a training and compression strategy in which two neural networks, namely a teacher and a student, are coupled together during training.

Knowledge Distillation

Paper
Add Code

Improving Task-Agnostic BERT Distillation with Layer Mapping Search

no code implementations • 11 Dec 2020 • Xiaoqi Jiao, Huating Chang, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang, Qun Liu

Comprehensive experiments on the evaluation benchmarks demonstrate that 1) layer mapping strategy has a significant effect on task-agnostic BERT distillation and different layer mappings can result in quite different performances; 2) the optimal layer mapping strategy from the proposed search process consistently outperforms the other heuristic ones; 3) with the optimal layer mapping, our student model achieves state-of-the-art performance on the GLUE tasks.

Knowledge Distillation

Paper
Add Code

KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning

no code implementations • 7 Dec 2020 • Bin He, Xin Jiang, Jinghui Xiao, Qun Liu

Recent studies on pre-trained language models have demonstrated their ability to capture factual knowledge and applications in knowledge-aware downstream tasks.

Language Modelling Machine Reading Comprehension +2

Paper
Add Code

PPKE: Knowledge Representation Learning by Path-based Pre-training

no code implementations • 7 Dec 2020 • Bin He, Di Zhou, Jing Xie, Jinghui Xiao, Xin Jiang, Qun Liu

Entities may have complex interactions in a knowledge graph (KG), such as multi-step relationships, which can be viewed as graph contextual information of the entities.

Link Prediction Representation Learning

Paper
Add Code

Document Graph for Neural Machine Translation

no code implementations • EMNLP 2021 • Mingzhou Xu, Liangyou Li, Derek. F. Wong, Qun Liu, Lidia S. Chao

Previous works have shown that contextual information can improve the performance of neural machine translation (NMT).

Machine Translation NMT +1

Paper
Add Code

From Unsupervised Machine Translation To Adversarial Text Generation

no code implementations • 10 Nov 2020 • Ahmad Rashid, Alan Do-Omri, Md. Akmal Haidar, Qun Liu, Mehdi Rezagholizadeh

B-GAN is able to generate a distributed latent space representation which can be paired with an attention based decoder to generate fluent sentences.

Adversarial Text Text Generation +2

Paper
Add Code

Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads

no code implementations • 7 Nov 2020 • Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Qun Liu, Maosong Sun

To measure the informativeness of attention heads, we train our Single-Shot Meta-Pruner (SMP) with a meta-learning paradigm aiming to maintain the distribution of text representations after pruning.

Informativeness Meta-Learning +1

Paper
Add Code

The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Jie He, Tao Wang, Deyi Xiong, Qun Liu

Our experiments and analyses demonstrate that neural machine translation performs poorly on commonsense reasoning of the three ambiguity types in terms of both reasoning accuracy ( 6 60. 1{\%}) and reasoning consistency (6 31{\%}).

Common Sense Reasoning Machine Translation +2

Paper
Code

BERT-MK: Integrating Graph Contextualized Knowledge into Pre-trained Language Models

no code implementations • Findings of the Association for Computational Linguistics 2020 • Bin He, Di Zhou, Jinghui Xiao, Xin Jiang, Qun Liu, Nicholas Jing Yuan, Tong Xu

Complex node interactions are common in knowledge graphs (KGs), and these interactions can be considered as contextualized knowledge exists in the topological structure of KGs.

Knowledge Graphs Language Modelling +1

Paper
Add Code

HyperText: Endowing FastText with Hyperbolic Geometry

3 code implementations • Findings of the Association for Computational Linguistics 2020 • Yudong Zhu, Di Zhou, Jinghui Xiao, Xin Jiang, Xiao Chen, Qun Liu

Natural language data exhibit tree-like hierarchical structures such as the hypernym-hyponym relations in WordNet.

General Classification Text Classification

2,954

Paper
Code

LiDAM: Semi-Supervised Learning with Localized Domain Adaptation and Iterative Matching

no code implementations • 13 Oct 2020 • Qun Liu, Matthew Shreve, Raja Bala

Although data is abundant, data labeling is expensive.

Ranked #1 on Semi-Supervised Image Classification on CIFAR-100, 5000Labels

Domain Adaptation Semi-Supervised Image Classification

Paper
Add Code

Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers

2 code implementations • EMNLP 2020 • Yimeng Wu, Peyman Passban, Mehdi Rezagholizade, Qun Liu

With the growth of computing power neural machine translation (NMT) models also grow accordingly and become better.

Knowledge Distillation Machine Translation +2

Paper
Code

SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval

no code implementations • 2 Oct 2020 • Yang Bai, Xiaoguang Li, Gang Wang, Chaoliang Zhang, Lifeng Shang, Jun Xu, Zhaowei Wang, Fangshan Wang, Qun Liu

Term-based sparse representations dominate the first-stage text retrieval in industrial applications, due to its advantage in efficiency, interpretability, and exact term matching.

Language Modelling Retrieval +1

Paper
Add Code

TernaryBERT: Distillation-aware Ultra-low Bit BERT

5 code implementations • EMNLP 2020 • Wei Zhang, Lu Hou, Yichun Yin, Lifeng Shang, Xiao Chen, Xin Jiang, Qun Liu

Transformer-based pre-training models like BERT have achieved remarkable performance in many natural language processing tasks. However, these models are both computation and memory expensive, hindering their deployment to resource-constrained devices.

Knowledge Distillation Quantization

2,954

Paper
Code

TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling

no code implementations • 28 Jul 2020 • Shuai Zhang, Peng Zhang, Xindian Ma, Junqiu Wei, Ningning Wang, Qun Liu

Transformer has been widely-used in many Natural Language Processing (NLP) tasks and the scaled dot-product attention between tokens is a core module of Transformer.

Language Modelling Machine Translation +2

Paper
Add Code

Learning to Detect Unacceptable Machine Translations for Downstream Tasks

no code implementations • 8 May 2020 • Meng Zhang, Xin Jiang, Yang Liu, Qun Liu

In this work, we put machine translation in a cross-lingual pipeline and introduce downstream tasks to define task-specific acceptability of machine translations.

Machine Translation Translation

Paper
Add Code

Accurate Word Alignment Induction from Neural Machine Translation

1 code implementation • EMNLP 2020 • Yun Chen, Yang Liu, Guanhua Chen, Xin Jiang, Qun Liu

Shift-Att is an interpretation method that induces alignments from the attention weights of Transformer and does not require parameter update or architecture change.

Machine Translation Multi-Task Learning +2

Paper
Code

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

1 code implementation • ACL 2020 • Zhiyong Wu, Yun Chen, Ben Kao, Qun Liu

However, this approach of evaluating a language model is undermined by the uncertainty of the amount of knowledge that is learned by the probe itself.

Dependency Parsing Language Modelling +2

102

Paper
Code

Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order

3 code implementations • ACL 2020 • Yi Liao, Xin Jiang, Qun Liu

Masked language model and autoregressive language model are two types of language models.

Language Modelling Natural Language Understanding +1

2,954

Paper
Code

DynaBERT: Dynamic BERT with Adaptive Width and Depth

3 code implementations • NeurIPS 2020 • Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu

The pre-trained language models like BERT, though powerful in many natural language processing tasks, are both computation and memory expensive.

Language Modelling

11,398

Paper
Code

Dictionary-based Data Augmentation for Cross-Domain Neural Machine Translation

no code implementations • 6 Apr 2020 • Wei Peng, Chongxuan Huang, Tian-Hao Li, Yun Chen, Qun Liu

Existing data augmentation approaches for neural machine translation (NMT) have predominantly relied on back-translating in-domain (IND) monolingual corpora.

Data Augmentation Machine Translation +2

Paper
Add Code

A One-Shot Learning Framework for Assessment of Fibrillar Collagen from Second Harmonic Generation Images of an Infarcted Myocardium

no code implementations • 23 Jan 2020 • Qun Liu, Supratik Mukhopadhyay, Maria Ximena Bastidas Rodriguez, Xing Fu, Sushant Sahu, David Burk, Manas Gartia

Myocardial infarction (MI) is a scientific term that refers to heart attack.

One-Shot Learning Specificity

Paper
Add Code

Context-Aware Design of Cyber-Physical Human Systems (CPHS)

no code implementations • 7 Jan 2020 • Supratik Mukhopadhyay, Qun Liu, Edward Collier, Yimin Zhu, Ravindra Gudishala, Chanachok Chokwitthaya, Robert DiBiano, Alimire Nabijiang, Sanaz Saeidi, Subhajit Sidhanta, Arnab Ganguly

The impacts of context factors driving human system interaction are challenging and are difficult to capture and replicate in existing design models.

Decision Making

Paper
Add Code

Multi-channel Reverse Dictionary Model

1 code implementation • 18 Dec 2019 • Lei Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun

A reverse dictionary takes the description of a target word as input and outputs the target word together with other words that match the description.

Reverse Dictionary Sentence

106

Paper
Code

Deep Learning-Based Feature-Aware Data Modeling for Complex Physics Simulations

no code implementations • 8 Dec 2019 • Qun Liu, Subhashis Hazarika, John M. Patchett, James Paul Ahrens, Ayan Biswas

Data modeling and reduction for in situ is important.

Paper
Add Code

Learning to Predict Explainable Plots for Neural Story Generation

no code implementations • 5 Dec 2019 • Gang Chen, Yang Liu, Huanbo Luan, Meng Zhang, Qun Liu, Maosong Sun

While the use of neural networks has proven effective in improving story generation, how to learn to generate an explainable high-level plot still remains a major challenge.

Sentence Story Generation

Paper
Add Code

Integrating Graph Contextualized Knowledge into Pre-trained Language Models

no code implementations • 30 Nov 2019 • Bin He, Di Zhou, Jinghui Xiao, Xin Jiang, Qun Liu, Nicholas Jing Yuan, Tong Xu

Complex node interactions are common in knowledge graphs, and these interactions also contain rich knowledge information.

Knowledge Graphs Representation Learning

Paper
Add Code

Deep-seismic-prior-based reconstruction of seismic data using convolutional neural networks

no code implementations • 20 Nov 2019 • Qun Liu, Lihua Fu, Meng Zhang

Synthetic and field data were tested to assess the performance of the proposed algorithm (DSPRecon algorithm); the advantages of using our method were evaluated by comparing it with the singular spectrum analysis (SSA) method for irregular data reconstruction and de-aliased Cadzow method for regular data reconstruction.

Paper
Add Code

DeepSat V2: Feature Augmented Convolutional Neural Nets for Satellite Image Classification

1 code implementation • 15 Nov 2019 • Qun Liu, Saikat Basu, Sangram Ganguly, Supratik Mukhopadhyay, Robert DiBiano, Manohar Karki, Ramakrishna Nemani

Satellite image classification is a challenging problem that lies at the crossroads of remote sensing, computer vision, and machine learning.

Ranked #1 on Satellite Image Classification on SAT-4

Classification General Classification +1

457

Paper
Code

Zero-Shot Paraphrase Generation with Multilingual Language Models

no code implementations • 9 Nov 2019 • Yinpeng Guo, Yi Liao, Xin Jiang, Qing Zhang, Yibo Zhang, Qun Liu

Leveraging multilingual parallel texts to automatically generate paraphrases has drawn much attention as size of high-quality paraphrase corpus is limited.

Denoising Machine Translation +3

Paper
Add Code

A General Framework for Adaptation of Neural Machine Translation to Simultaneous Translation

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Yun Chen, Liangyou Li, Xin Jiang, Xiao Chen, Qun Liu

Despite the success of neural machine translation (NMT), simultaneous neural machine translation (SNMT), the task of translating in real time before a full sentence has been observed, remains challenging due to the syntactic structure difference and simultaneity requirements.

Machine Translation NMT +2

Paper
Add Code

Pretrained Language Models for Document-Level Neural Machine Translation

no code implementations • 8 Nov 2019 • Liangyou Li, Xin Jiang, Qun Liu

Previous work on document-level NMT usually focuses on limited contexts because of degraded performance on larger contexts.

Machine Translation NMT +2

Paper
Add Code

Word-level Textual Adversarial Attacking as Combinatorial Optimization

1 code implementation • ACL 2020 • Yuan Zang, Fanchao Qi, Chenghao Yang, Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Also, further experiments show our model has higher transferability and can bring more robustness enhancement to victim models by adversarial training.

Adversarial Attack Combinatorial Optimization +3

Paper
Code

Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes

1 code implementation • 20 Oct 2019 • Yujia Qin, Fanchao Qi, Sicong Ouyang, Zhiyuan Liu, Cheng Yang, Yasheng Wang, Qun Liu, Maosong Sun

Sememes, the minimum semantic units of human languages, have been successfully utilized in various natural language processing applications.

Adversarial Attack Language Modelling +2

Paper
Code

TinyBERT: Distilling BERT for Natural Language Understanding

7 code implementations • Findings of the Association for Computational Linguistics 2020 • Xiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang, Qun Liu

To accelerate inference and reduce model size while maintaining accuracy, we first propose a novel Transformer distillation method that is specially designed for knowledge distillation (KD) of the Transformer-based models.

Ranked #1 on Natural Language Inference on MultiNLI Dev

Knowledge Distillation Language Modelling +6

11,398

Paper
Code

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

10 code implementations • 31 Aug 2019 • Junqiu Wei, Xiaozhe Ren, Xiaoguang Li, Wenyong Huang, Yi Liao, Yasheng Wang, Jiashu Lin, Xin Jiang, Xiao Chen, Qun Liu

named-entity-recognition Named Entity Recognition +6

11,398

Paper
Code

Dialog State Tracking with Reinforced Data Augmentation

no code implementations • 21 Aug 2019 • Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu

Neural dialog state trackers are generally limited due to the lack of quantity and diversity of annotated training data.

Data Augmentation dialog state tracking +1

Paper
Add Code

PCGAN-CHAR: Progressively Trained Classifier Generative Adversarial Networks for Classification of Noisy Handwritten Bangla Characters

no code implementations • 11 Aug 2019 • Qun Liu, Edward Collier, Supratik Mukhopadhyay

We show that by learning the features at each resolution independently a trained model is able to accurately classify characters even in the presence of noise.

Ranked #1 on Image Classification on Noisy MNIST (AWGN)

Classification Denoising +3

Paper
Add Code

Huawei's NMT Systems for the WMT 2019 Biomedical Translation Task

no code implementations • WS 2019 • Wei Peng, Jianfeng Liu, Liangyou Li, Qun Liu

This paper describes Huawei{'}s neural machine translation systems for the WMT 2019 biomedical translation shared task.

Domain Adaptation Machine Translation +3

Paper
Add Code

Modeling Semantic Compositionality with Sememe Knowledge

1 code implementation • ACL 2019 • Fanchao Qi, Jun-Jie Huang, Chenghao Yang, Zhiyuan Liu, Xiao Chen, Qun Liu, Maosong Sun

In this paper, we verify the effectiveness of sememes, the minimum semantic units of human languages, in modeling SC by a confirmatory experiment.

multi-word expression embedding multi-word expression sememe prediction

Paper
Code

GPT-based Generation for Classical Chinese Poetry

2 code implementations • 29 Jun 2019 • Yi Liao, Yasheng Wang, Qun Liu, Xin Jiang

We present a simple yet effective method for generating high quality classical Chinese poetry with Generative Pre-trained Language Model (GPT).

Language Modelling

2,954

Paper
Code

Decomposable Neural Paraphrase Generation

no code implementations • ACL 2019 • Zichao Li, Xin Jiang, Lifeng Shang, Qun Liu

Paraphrasing exists at different granularity levels, such as lexical level, phrasal level and sentential level.

Paraphrase Generation Sentence +1

Paper
Add Code

Bridging the Gap between Training and Inference for Neural Machine Translation

no code implementations • ACL 2019 • Wen Zhang, Yang Feng, Fandong Meng, Di You, Qun Liu

Neural Machine Translation (NMT) generates target words sequentially in the way of predicting the next word conditioned on the context words.

Machine Translation NMT +2

Paper
Add Code

ERNIE: Enhanced Language Representation with Informative Entities

2 code implementations • ACL 2019 • Zhengyan Zhang, Xu Han, Zhiyuan Liu, Xin Jiang, Maosong Sun, Qun Liu

Neural language representation models such as BERT pre-trained on large-scale corpora can well capture rich semantic patterns from plain text, and be fine-tuned to consistently improve the performance of various NLP tasks.

Ranked #1 on Entity Linking on FIGER

Entity Linking Entity Typing +6

1,401

Paper
Code

Bilingual-GAN: A Step Towards Parallel Text Generation

no code implementations • WS 2019 • Ahmad Rashid, Alan Do-Omri, Md. Akmal Haidar, Qun Liu, Mehdi Rezagholizadeh

Latent space based GAN methods and attention based sequence to sequence models have achieved impressive results in text generation and unsupervised machine translation respectively.

Denoising Text Generation +2

Paper
Add Code

Improving Domain Adaptation Translation with Domain Invariant and Specific Information

no code implementations • NAACL 2019 • Shuhao Gu, Yang Feng, Qun Liu

Besides, we add a discriminator to the shared encoder and employ adversarial training for the whole model to reinforce the performance of information separation and machine translation simultaneously.

Domain Adaptation Machine Translation +1

Paper
Add Code

Improving Route Choice Models by Incorporating Contextual Factors via Knowledge Distillation

no code implementations • 27 Mar 2019 • Qun Liu, Supratik Mukhopadhyay, Yimin Zhu, Ravindra Gudishala, Sanaz Saeidi, Alimire Nabijiang

High fidelity route choice models are required to predict traffic levels with higher accuracy.

Knowledge Distillation Management

Paper
Add Code

Improving the Robustness of Speech Translation

no code implementations • 2 Nov 2018 • Xiang Li, Haiyang Xue, Wei Chen, Yang Liu, Yang Feng, Qun Liu

Although neural machine translation (NMT) has achieved impressive progress recently, it is usually trained on the clean parallel data set and hence cannot work well when the input sentence is the production of the automatic speech recognition (ASR) system due to the enormous errors in the source.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

E2E NLG Challenge Submission: Towards Controllable Generation of Diverse Natural Language

no code implementations • WS 2018 • Henry Elder, Sebastian Gehrmann, Alex O{'}Connor, er, Qun Liu

In natural language generation (NLG), the task is to generate utterances from a more abstract input, such as structured data.

Machine Translation Task-Oriented Dialogue Systems +1

Paper
Add Code

Learning to Jointly Translate and Predict Dropped Pronouns with a Shared Reconstruction Mechanism

no code implementations • EMNLP 2018 • Long-Yue Wang, Zhaopeng Tu, Andy Way, Qun Liu

Pronouns are frequently omitted in pro-drop languages, such as Chinese, generally leading to significant challenges with respect to the production of complete translations.

Machine Translation Translation

Paper
Add Code

Speeding Up Neural Machine Translation Decoding by Cube Pruning

no code implementations • EMNLP 2018 • Wen Zhang, Liang Huang, Yang Feng, Lei Shen, Qun Liu

Although neural machine translation has achieved promising results, it suffers from slow translation speed.

Machine Translation Translation

Paper
Add Code

Tailoring Neural Architectures for Translating from Morphologically Rich Languages

no code implementations • COLING 2018 • Peyman Passban, Andy Way, Qun Liu

A morphologically complex word (MCW) is a hierarchical constituent with meaning-preserving subunits, so word-based models which rely on surface forms might not be powerful enough to translate such structures.

Machine Translation NMT +2

Paper
Add Code

Knowledge Diffusion for Neural Dialogue Generation

1 code implementation • ACL 2018 • Shuman Liu, Hongshen Chen, Zhaochun Ren, Yang Feng, Qun Liu, Dawei Yin

Our empirical study on a real-world dataset prove that our model is capable of generating meaningful, diverse and natural responses for both factoid-questions and knowledge grounded chi-chats.

Dialogue Generation Question Answering +1

Paper
Code

Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data

no code implementations • WS 2018 • Koel Dutta Chowdhury, Mohammed Hasanuzzaman, Qun Liu

In this paper, we investigate the effectiveness of training a multimodal neural machine translation (MNMT) system with image features for a low-resource language pair, Hindi and English, using synthetic data.

Machine Translation Question Answering +3

Paper
Add Code

Pixel-level Reconstruction and Classification for Noisy Handwritten Bangla Characters

no code implementations • 21 Jun 2018 • Manohar Karki, Qun Liu, Robert DiBiano, Saikat Basu, Supratik Mukhopadhyay

Classification techniques for images of handwritten characters are susceptible to noise.

Ranked #1 on Document Image Classification on n-MNIST

Classification Document Image Classification +1

Paper
Add Code

Understanding Meanings in Multilingual Customer Feedback

no code implementations • 5 Jun 2018 • Chao-Hong Liu, Declan Groves, Akira Hayakawa, Alberto Poncelas, Qun Liu

Understanding and being able to react to customer feedback is the most fundamental task in providing good customer service.

General Classification

Paper
Add Code

Refining Source Representations with Relation Networks for Neural Machine Translation

no code implementations • COLING 2018 • Wen Zhang, Jiawei Hu, Yang Feng, Qun Liu

Although neural machine translation with the encoder-decoder framework has achieved great success recently, it still suffers drawbacks of forgetting distant information, which is an inherent disadvantage of recurrent neural network structure, and disregarding relationship between source words during encoding step.

Machine Translation Memorization +2

Paper
Add Code

SafeRNet: Safe Transportation Routing in the era of Internet of Vehicles and Mobile Crowd Sensing

no code implementations • 3 May 2018 • Qun Liu, Suman Kumar, Vijay Mago

This paper proposes SafeRNet, a safe route computation framework which takes advantage of these technologies to analyze streaming traffic data and historical data to effectively infer safe routes and deliver them back to users in real time.

Cloud Computing

Paper
Add Code

Unsupervised Learning using Pretrained CNN and Associative Memory Bank

no code implementations • 2 May 2018 • Qun Liu, Supratik Mukhopadhyay

In this paper, we present a new architecture and an approach for unsupervised object recognition that addresses the above mentioned problem with fine tuning associated with pretrained CNN-based supervised deep learning approaches while allowing automated feature extraction.

Ranked #1 on Fine-Grained Image Classification on Caltech-101 (Accuracy metric)

Few-Shot Image Classification Fine-Grained Image Classification +2

Paper
Add Code

Improving Character-based Decoding Using Target-Side Morphological Information for Neural Machine Translation

no code implementations • NAACL 2018 • Peyman Passban, Qun Liu, Andy Way

Recently, neural machine translation (NMT) has emerged as a powerful alternative to conventional statistical approaches.

Machine Translation NMT +1

Paper
Add Code

Translating Pro-Drop Languages with Reconstruction Models

1 code implementation • 10 Jan 2018 • Long-Yue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu

Next, the annotated source sentence is reconstructed from hidden representations in the NMT model.

Machine Translation NMT +2

Paper
Code

ADAPT Centre Cone Team at IJCNLP-2017 Task 5: A Similarity-Based Logistic Regression Approach to Multi-choice Question Answering in an Examinations Shared Task

no code implementations • IJCNLP 2017 • Daria Dzendzik, Alberto Poncelas, Carl Vogel, Qun Liu

We describe the work of a team from the ADAPT Centre in Ireland in addressing automatic answer selection for the Multi-choice Question Answering in Examinations shared task.

Answer Selection regression

Paper
Add Code

Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking

no code implementations • IJCNLP 2017 • Long-Yue Wang, Jinhua Du, Liangyou Li, Zhaopeng Tu, Andy Way, Qun Liu

We showcase TODAY, a semantics-enhanced task-oriented dialogue translation system, whose novelties are: (i) task-oriented named entity (NE) definition and a hybrid strategy for NE recognition and translation; and (ii) a novel grounded semantic method for dialogue understanding and task-order management.

Dialogue Understanding Machine Translation +3

Paper
Add Code

CASICT Tibetan Word Segmentation System for MLWS2017

1 code implementation • 17 Oct 2017 • Jiawei Hu, Qun Liu

We participated in the MLWS 2017 on Tibetan word segmentation task, our system is trained in a unrestricted way, by introducing a baseline system and 76w tibetan segmented sentences of ours.

Segmentation

2,153

Paper
Code

Refining Source Representations with Relation Networks for Neural Machine Translation

no code implementations • 12 Sep 2017 • Wen Zhang, Jiawei Hu, Yang Feng, Qun Liu

Although neural machine translation (NMT) with the encoder-decoder framework has achieved great success in recent times, it still suffers from some drawbacks: RNNs tend to forget old information which is often useful and the encoder only operates through words without considering word relationship.

Machine Translation NMT +2

Paper
Add Code

Information-Propogation-Enhanced Neural Machine Translation by Relation Model

no code implementations • 6 Sep 2017 • Wen Zhang, Jiawei Hu, Yang Feng, Qun Liu

Even though sequence-to-sequence neural machine translation (NMT) model have achieved state-of-art performance in the recent fewer years, but it is widely concerned that the recurrent neural network (RNN) units are very hard to capture the long-distance state information, which means RNN can hardly find the feature with long term dependency as the sequence becomes longer.

Machine Translation NMT +4

Paper
Add Code

Blend: a Novel Combined MT Metric Based on Direct Assessment --- CASICT-DCU submission to WMT17 Metrics Task

no code implementations • WS 2017 • Qingsong Ma, Yvette Graham, Shugen Wang, Qun Liu

Machine Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.