Search Results for author: Xuanjing Huang

Found 261 papers, 120 papers with code

The Rise and Potential of Large Language Model Based Agents: A Survey

1 code implementation • 14 Sep 2023 • Zhiheng Xi, Wenxiang Chen, Xin Guo, wei he, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui

Many efforts have been made to develop intelligent agents, but they mainly focus on advancement in algorithms or training strategies to enhance specific capabilities or performance on particular tasks.

Language Modelling Large Language Model

5,243

Paper
Code

Pre-trained Models for Natural Language Processing: A Survey

3 code implementations • 18 Mar 2020 • Xipeng Qiu, Tianxiang Sun, Yige Xu, Yunfan Shao, Ning Dai, Xuanjing Huang

Recently, the emergence of pre-trained models (PTMs) has brought natural language processing (NLP) to a new era.

Representation Learning

2,026

Paper
Code

Secrets of RLHF in Large Language Models Part I: PPO

1 code implementation • 11 Jul 2023 • Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang

Therefore, we explore the PPO-max, an advanced version of PPO algorithm, to efficiently improve the training stability of the policy model.

1,161

Paper
Code

Secrets of RLHF in Large Language Models Part II: Reward Modeling

1 code implementation • 11 Jan 2024 • Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

We introduce a series of novel methods to mitigate the influence of incorrect and ambiguous preferences in the dataset and fully leverage high-quality preference data.

Contrastive Learning Meta-Learning +1

1,161

Paper
Code

FLAT: Chinese NER Using Flat-Lattice Transformer

1 code implementation • ACL 2020 • Xiaonan Li, Hang Yan, Xipeng Qiu, Xuanjing Huang

Recently, the character-word lattice structure has been proved to be effective for Chinese named entity recognition (NER) by incorporating the word information.

Ranked #5 on Chinese Named Entity Recognition on MSRA

Chinese Named Entity Recognition named-entity-recognition +3

987

Paper
Code

CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems

2 code implementations • Findings of the Association for Computational Linguistics 2020 • Yiran Chen, PengFei Liu, Ming Zhong, Zi-Yi Dou, Danqing Wang, Xipeng Qiu, Xuanjing Huang

In this paper, we perform an in-depth analysis of characteristics of different datasets and investigate the performance of different summarization models under a cross-dataset setting, in which a summarizer trained on one corpus will be evaluated on a range of out-of-domain corpora.

Text Summarization

896

Paper
Code

fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP

1 code implementation • ACL 2021 • Zhichao Geng, Hang Yan, Xipeng Qiu, Xuanjing Huang

The joint-model is trained and evaluated on 13 corpora of four tasks, yielding near state-of-the-art (SOTA) performance in dependency parsing and NER, achieving SOTA performance in CWS and POS.

Chinese Word Segmentation Dependency Parsing +6

742

Paper
Code

TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

1 code implementation • ACL 2021 • Tao Gui, Xiao Wang, Qi Zhang, Qin Liu, Yicheng Zou, Xin Zhou, Rui Zheng, Chong Zhang, Qinzhuo Wu, Jiacheng Ye, Zexiong Pang, Yongxin Zhang, Zhengyan Li, Ruotian Ma, Zichu Fei, Ruijian Cai, Jun Zhao, Xingwu Hu, Zhiheng Yan, Yiding Tan, Yuan Hu, Qiyuan Bian, Zhihua Liu, Bolin Zhu, Shan Qin, Xiaoyu Xing, Jinlan Fu, Yue Zhang, Minlong Peng, Xiaoqing Zheng, Yaqian Zhou, Zhongyu Wei, Xipeng Qiu, Xuanjing Huang

To guarantee user acceptability, all the text transformations are linguistically based, and we provide a human evaluation for each one.

Adversarial Attack named-entity-recognition +5

627

Paper
Code

How to Fine-Tune BERT for Text Classification?

16 code implementations • 14 May 2019 • Chi Sun, Xipeng Qiu, Yige Xu, Xuanjing Huang

Language model pre-training has proven to be useful in learning universal language representations.

Ranked #1 on Text Classification on Yahoo! Answers

General Classification Language Modelling +2

585

Paper
Code

Extractive Summarization as Text Matching

2 code implementations • ACL 2020 • Ming Zhong, PengFei Liu, Yiran Chen, Danqing Wang, Xipeng Qiu, Xuanjing Huang

This paper creates a paradigm shift with regard to the way we build neural extractive summarization systems.

Ranked #1 on Text Summarization on BBC XSum

Document Summarization Extractive Summarization +4

518

Paper
Code

Simplify the Usage of Lexicon in Chinese NER

2 code implementations • ACL 2020 • Ruotian Ma, Minlong Peng, Qi Zhang, Xuanjing Huang

This method avoids designing a complicated sequence modeling architecture, and for any neural NER model, it requires only subtle adjustment of the character representation layer to introduce the lexicon information.

Ranked #8 on Chinese Named Entity Recognition on Resume NER

Chinese Named Entity Recognition named-entity-recognition +2

430

Paper
Code

DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services

2 code implementations • 20 Sep 2023 • Shengbin Yue, Wei Chen, Siyuan Wang, Bingxuan Li, Chenchen Shen, Shujun Liu, Yuxuan Zhou, Yao Xiao, Song Yun, Xuanjing Huang, Zhongyu Wei

We propose DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services.

Legal Reasoning Retrieval

427

Paper
Code

DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning

1 code implementation • 23 Oct 2023 • Wei Chen, Qiushi Wang, Zefei Long, Xianyin Zhang, Zhongtian Lu, Bingxuan Li, Siyuan Wang, Jiarong Xu, Xiang Bai, Xuanjing Huang, Zhongyu Wei

We propose Multiple Experts Fine-tuning Framework to build a financial large language model (LLM), DISC-FinLLM.

Language Modelling Large Language Model +2

427

Paper
Code

CoNT: Contrastive Neural Text Generation

2 code implementations • 29 May 2022 • Chenxin An, Jiangtao Feng, Kai Lv, Lingpeng Kong, Xipeng Qiu, Xuanjing Huang

We validate CoNT on five generation tasks with ten benchmarks, including machine translation, summarization, code comment generation, data-to-text generation and commonsense generation.

Code Comment Generation Comment Generation +4

420

Paper
Code

DISC-MedLLM: Bridging General Large Language Models and Real-World Medical Consultation

1 code implementation • 28 Aug 2023 • Zhijie Bao, Wei Chen, Shengze Xiao, Kuang Ren, Jiaao Wu, Cheng Zhong, Jiajie Peng, Xuanjing Huang, Zhongyu Wei

We propose DISC-MedLLM, a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare services.

Knowledge Graphs

415

Paper
Code

Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study

1 code implementation • 12 Jan 2020 • Jinlan Fu, PengFei Liu, Qi Zhang, Xuanjing Huang

While neural network-based models have achieved impressive performance on a large body of NLP tasks, the generalization behavior of different models remains poorly understood: Does this excellent performance imply a perfect generalization model, or are there still some limitations?

named-entity-recognition Named Entity Recognition +1

394

Paper
Code

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

1 code implementation • 28 Nov 2022 • Zhengfu He, Tianxiang Sun, Kuanning Wang, Xuanjing Huang, Xipeng Qiu

We present DiffusionBERT, a new generative masked language model based on discrete diffusion models.

Denoising Language Modelling +1

270

Paper
Code

Black-Box Tuning for Language-Model-as-a-Service

2 code implementations • 10 Jan 2022 • Tianxiang Sun, Yunfan Shao, Hong Qian, Xuanjing Huang, Xipeng Qiu

In such a scenario, which we call Language-Model-as-a-Service (LMaaS), the gradients of PTMs are usually unavailable.

In-Context Learning Language Modelling

253

Paper
Code

BBTv2: Towards a Gradient-Free Future with Large Language Models

1 code implementation • 23 May 2022 • Tianxiang Sun, Zhengfu He, Hong Qian, Yunhua Zhou, Xuanjing Huang, Xipeng Qiu

By contrast, gradient-free methods only require the forward computation of the PTM to tune the prompt, retaining the benefits of efficient tuning and deployment.

Few-Shot Learning Language Modelling

253

Paper
Code

Heterogeneous Graph Neural Networks for Extractive Document Summarization

1 code implementation • ACL 2020 • Danqing Wang, PengFei Liu, Yining Zheng, Xipeng Qiu, Xuanjing Huang

An intuitive way is to put them in the graph-based neural network, which has a more complex structure for capturing inter-sentence relationships.

Document Summarization Extractive Document Summarization +3

238

Paper
Code

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

1 code implementation • 18 Mar 2024 • Weikang Zhou, Xiao Wang, Limao Xiong, Han Xia, Yingshuang Gu, Mingxu Chai, Fukang Zhu, Caishuang Huang, Shihan Dou, Zhiheng Xi, Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun Li, Jing Shao, Tao Gui, Qi Zhang, Xuanjing Huang

This paper introduces EasyJailbreak, a unified framework simplifying the construction and evaluation of jailbreak attacks against LLMs.

229

Paper
Code

RethinkCWS: Is Chinese Word Segmentation a Solved Task?

1 code implementation • EMNLP 2020 • Jinlan Fu, PengFei Liu, Qi Zhang, Xuanjing Huang

The performance of the Chinese Word Segmentation (CWS) systems has gradually reached a plateau with the rapid development of deep neural networks, especially the successful use of large pre-trained models.

Chinese Word Segmentation

189

Paper
Code

Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation

4 code implementations • ACL 2019 • Ning Dai, Jianze Liang, Xipeng Qiu, Xuanjing Huang

Disentangling the content and style in the latent space is prevalent in unpaired text style transfer.

Sentence Style Transfer +1

170

Paper
Code

Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning

1 code implementation • ACL 2019 • Minlong Peng, Xiaoyu Xing, Qi Zhang, Jinlan Fu, Xuanjing Huang

In this work, we explore the way to perform named entity recognition (NER) using only unlabeled data and named entity dictionaries.

named-entity-recognition Named Entity Recognition +2

157

Paper
Code

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

2 code implementations • Findings (ACL) 2021 • Ruize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu ji, Guihong Cao, Daxin Jiang, Ming Zhou

We study the problem of injecting knowledge into large pre-trained models like BERT and RoBERTa.

Ranked #1 on Entity Typing on Open Entity

Dependency Parsing Entity Typing +2

151

Paper
Code

SpanNER: Named Entity Re-/Recognition as Span Prediction

1 code implementation • ACL 2021 • Jinlan Fu, Xuanjing Huang, PengFei Liu

Recent years have seen the paradigm shift of Named Entity Recognition (NER) systems from sequence labeling to span prediction.

named-entity-recognition Named Entity Recognition +1

119

Paper
Code

Overview of the NLPCC 2017 Shared Task: Chinese News Headline Categorization

1 code implementation • 9 Jun 2017 • Xipeng Qiu, Jingjing Gong, Xuanjing Huang

In this paper, we give an overview for the shared task at the CCF Conference on Natural Language Processing \& Chinese Computing (NLPCC 2017): Chinese News Headline Categorization.

116

Paper
Code

CoLAKE: Contextualized Language and Knowledge Embedding

1 code implementation • COLING 2020 • Tianxiang Sun, Yunfan Shao, Xipeng Qiu, Qipeng Guo, Yaru Hu, Xuanjing Huang, Zheng Zhang

With the emerging branch of incorporating factual knowledge into pre-trained language models such as BERT, most existing models consider shallow, static, and separately pre-trained entity embeddings, which limits the performance gains of these models.

Entity Embeddings Knowledge Graph Completion +1

114

Paper
Code

Template-free Prompt Tuning for Few-shot NER

1 code implementation • NAACL 2022 • Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang, Xuanjing Huang

Prompt-based methods have been successfully applied in sentence-level few-shot learning tasks, mostly owing to the sophisticated design of templates and label words.

Few-Shot Learning Few-shot NER +1

111

Paper
Code

Orthogonal Subspace Learning for Language Model Continual Learning

1 code implementation • 22 Oct 2023 • Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

In this paper, we propose orthogonal low-rank adaptation (O-LoRA), a simple and efficient approach for continual learning in language models, effectively mitigating catastrophic forgetting while learning new tasks.

Continual Learning Language Modelling

106

Paper
Code

Searching for Effective Neural Extractive Summarization: What Works and What's Next

2 code implementations • ACL 2019 • Ming Zhong, PengFei Liu, Danqing Wang, Xipeng Qiu, Xuanjing Huang

The recent years have seen remarkable success in the use of deep neural networks on text summarization.

Ranked #6 on Extractive Text Summarization on CNN / Daily Mail

Extractive Summarization Extractive Text Summarization

Paper
Code

GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge

3 code implementations • IJCNLP 2019 • Luyao Huang, Chi Sun, Xipeng Qiu, Xuanjing Huang

Word Sense Disambiguation (WSD) aims to find the exact sense of an ambiguous word in a particular context.

Ranked #3 on Word Sense Disambiguation on WiC-TSV

Word Sense Disambiguation

Paper
Code

Learning Sparse Sharing Architectures for Multiple Tasks

1 code implementation • 12 Nov 2019 • Tianxiang Sun, Yunfan Shao, Xiaonan Li, PengFei Liu, Hang Yan, Xipeng Qiu, Xuanjing Huang

Most existing deep multi-task learning models are based on parameter sharing, such as hard sharing, hierarchical sharing, and soft sharing.

Multi-Task Learning

Paper
Code

LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

1 code implementation • 15 Dec 2023 • Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, ShiLiang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks.

Language Modelling Multi-Task Learning +1

Paper
Code

Toward Diverse Text Generation with Inverse Reinforcement Learning

3 code implementations • 30 Apr 2018 • Zhan Shi, Xinchi Chen, Xipeng Qiu, Xuanjing Huang

Similar to the adversarial models, the reward and policy function in IRL are optimized alternately.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling

1 code implementation • 14 Dec 2020 • Yicheng Zou, Lujun Zhao, Yangyang Kang, Jun Lin, Minlong Peng, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics.

Paper
Code

Do Large Language Models Know What They Don't Know?

1 code implementation • 29 May 2023 • Zhangyue Yin, Qiushi Sun, Qipeng Guo, Jiawen Wu, Xipeng Qiu, Xuanjing Huang

Large language models (LLMs) have a wealth of knowledge that allows them to excel in various Natural Language Processing (NLP) tasks.

In-Context Learning

Paper
Code

Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models

1 code implementation • ACL 2021 • Chong Li, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

A sequence-to-sequence learning with neural networks has empirically proven to be an effective framework for Chinese Spelling Correction (CSC), which takes a sentence with some spelling errors as input and outputs the corrected one.

Sentence Spelling Correction +1

Paper
Code

MouSi: Poly-Visual-Expert Vision-Language Models

1 code implementation • 30 Jan 2024 • Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

This technique introduces a fusion network to unify the processing of outputs from different visual experts, while bridging the gap between image encoders and pre-trained LLMs.

Ranked #40 on Visual Question Answering on MM-Vet

Image Segmentation Image-text matching +4

Paper
Code

Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation

1 code implementation • 24 Feb 2020 • Yige Xu, Xipeng Qiu, Ligao Zhou, Xuanjing Huang

Fine-tuning pre-trained language models like BERT has become an effective way in NLP and yields state-of-the-art results on many downstream tasks.

Natural Language Inference text-classification +1

Paper
Code

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

1 code implementation • NAACL 2022 • Xiangyang Liu, Tianxiang Sun, Junliang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

ELUE is dedicated to depict the Pareto Frontier for various language understanding tasks, such that it can tell whether and how much a method achieves Pareto improvement.

Paper
Code

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

1 code implementation • 1 Jan 2024 • Junjie Ye, Guanyu Li, Songyang Gao, Caishuang Huang, Yilong Wu, Sixian Li, Xiaoran Fan, Shihan Dou, Qi Zhang, Tao Gui, Xuanjing Huang

Furthermore, a sole emphasis on outcomes disregards the intricate capabilities essential for LLMs to effectively utilize tools.

Paper
Code

Hierarchical Reinforcement Learning for Automatic Disease Diagnosis

4 code implementations • 29 Apr 2020 • Cheng Zhong, Kangenbei Liao, Wei Chen, Qianlong Liu, Baolin Peng, Xuanjing Huang, Jiajie Peng, Zhongyu Wei

Existing approaches usually employ a flat policy structure that treat all symptoms and diseases equally for action making.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Code

Paradigm Shift in Natural Language Processing

1 code implementation • 26 Sep 2021 • Tianxiang Sun, Xiangyang Liu, Xipeng Qiu, Xuanjing Huang

In this paper, we review such phenomenon of paradigm shifts in recent years, highlighting several paradigms that have the potential to solve different NLP tasks.

Chunking NER +3

Paper
Code

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets

1 code implementation • 19 Apr 2022 • Wei Chen, Zhiwei Li, Hongyi Fang, Qianyuan Yao, Cheng Zhong, Jianye Hao, Qi Zhang, Xuanjing Huang, Jiajie Peng, Zhongyu Wei

In recent years, interest has arisen in using machine learning to improve the efficiency of automatic medical consultation and enhance patient experience.

Dialogue Act Classification Dialogue Understanding +4

Paper
Code

Information Aggregation via Dynamic Routing for Sequence Encoding

2 code implementations • COLING 2018 • Jingjing Gong, Xipeng Qiu, Shaojing Wang, Xuanjing Huang

The dynamic routing policy is dynamically deciding that what and how much information need be transferred from each word to the final encoding of the text sequence.

Ranked #43 on Sentiment Analysis on IMDb

Sentiment Analysis text-classification +1

Paper
Code

Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment Analysis

1 code implementation • EMNLP 2020 • Xiaoyu Xing, Zhijing Jin, Di Jin, Bingning Wang, Qi Zhang, Xuanjing Huang

Based on the SemEval 2014 dataset, we construct the Aspect Robustness Test Set (ARTS) as a comprehensive probe of the aspect robustness of ABSA models.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Paper
Code

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

1 code implementation • 10 Oct 2023 • Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we introduce TRACE, a novel benchmark designed to evaluate continual learning in LLMs.

Code Generation Continual Learning +3

Paper
Code

Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

1 code implementation • 14 Oct 2022 • Tianxiang Sun, Zhengfu He, Qin Zhu, Xipeng Qiu, Xuanjing Huang

MP2 is a set of combinable prompts pre-trained on 38 Chinese tasks.

Few-Shot Learning Machine Reading Comprehension

Paper
Code

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

1 code implementation • 2 Feb 2024 • Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui

The advancement of large language models (LLMs) has significantly propelled the field of code generation.

Code Completion Code Generation +2

Paper
Code

A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing

1 code implementation • TACL 2020 • Hang Yan, Xipeng Qiu, Xuanjing Huang

Our graph-based joint model achieves better performance than previous joint models and state-of-the-art results in both Chinese word segmentation and dependency parsing.

Chinese Word Segmentation Dependency Parsing +3

Paper
Code

BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation

1 code implementation • 14 Oct 2022 • Tianxiang Sun, Junliang He, Xipeng Qiu, Xuanjing Huang

Automatic evaluation metrics are crucial to the development of generative systems.

Fairness Language Modelling +1

Paper
Code

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

1 code implementation • 14 Dec 2020 • Yicheng Zou, Jun Lin, Lujun Zhao, Yangyang Kang, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

Automatic chat summarization can help people quickly grasp important information from numerous chat messages.

Denoising Topic coverage

Paper
Code

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

2 code implementations • ACL 2022 • Xiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui, Liang Qiao, Zhanzhan Cheng, Xuanjing Huang

NER model has achieved promising performance on standard NER benchmarks.

Ranked #8 on Named Entity Recognition (NER) on WNUT 2017

named-entity-recognition Named Entity Recognition +1

Paper
Code

MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning

1 code implementation • 29 Jan 2022 • Zejun Li, Zhihao Fan, Huaixiao Tou, Jingjing Chen, Zhongyu Wei, Xuanjing Huang

In MVPTR, we follow the nested structure of both modalities to introduce concepts as high-level semantics.

Image-text matching Language Modelling +3

Paper
Code

SENT: Sentence-level Distant Relation Extraction via Negative Training

1 code implementation • ACL 2021 • Ruotian Ma, Tao Gui, Linyang Li, Qi Zhang, Yaqian Zhou, Xuanjing Huang

In this work, we propose the use of negative training (NT), in which a model is trained using complementary labels regarding that ``the instance does not belong to these complementary labels".

Relation Relation Extraction +1

Paper
Code

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

1 code implementation • 4 Oct 2023 • Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, Jingjing Chen, Xuanjing Huang, Zhongyu Wei

Recent years have witnessed remarkable progress in the development of large vision-language models (LVLMs).

Paper
Code

Long Short-Term Memory with Dynamic Skip Connections

1 code implementation • 9 Nov 2018 • Tao Gui, Qi Zhang, Lujun Zhao, Yaosong Lin, Minlong Peng, Jingjing Gong, Xuanjing Huang

In recent years, long short-term memory (LSTM) has been successfully used to model sequential data of variable length.

Ranked #33 on Sentiment Analysis on IMDb

Named Entity Recognition (NER) Sentiment Analysis

Paper
Code

Accelerating BERT Inference for Sequence Labeling via Early-Exit

1 code implementation • ACL 2021 • Xiaonan Li, Yunfan Shao, Tianxiang Sun, Hang Yan, Xipeng Qiu, Xuanjing Huang

To alleviate this problem, we extend the recent successful early-exit mechanism to accelerate the inference of PTMs for sequence labeling tasks.

Sentence

Paper
Code

CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

1 code implementation • 9 May 2023 • Peng Li, Tianxiang Sun, Qiong Tang, Hang Yan, Yuanbin Wu, Xuanjing Huang, Xipeng Qiu

A common practice is to recast the task into a text-to-text format such that generative LLMs of natural language (NL-LLMs) like GPT-3 can be prompted to solve it.

Code Generation Few-Shot Learning +4

Paper
Code

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

1 code implementation • 8 Feb 2024 • Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, wei he, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models.

GSM8K reinforcement-learning +1

Paper
Code

Uncertainty-Aware Label Refinement for Sequence Labeling

1 code implementation • EMNLP 2020 • Tao Gui, Jiacheng Ye, Qi Zhang, Zhengyan Li, Zichu Fei, Yeyun Gong, Xuanjing Huang

Conditional random fields (CRF) for label decoding has become ubiquitous in sequence labeling tasks.

Paper
Code

Enhancing Scientific Papers Summarization with Citation Graph

1 code implementation • 7 Apr 2021 • Chenxin An, Ming Zhong, Yiran Chen, Danqing Wang, Xipeng Qiu, Xuanjing Huang

Previous work for text summarization in scientific domain mainly focused on the content of the input document, but seldom considering its citation network.

Text Summarization

Paper
Code

COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization

1 code implementation • COLING 2022 • Chenxin An, Ming Zhong, Zhiyong Wu, Qin Zhu, Xuanjing Huang, Xipeng Qiu

Traditional training paradigms for extractive and abstractive summarization systems always only use token-level or sentence-level training objectives.

Abstractive Text Summarization Contrastive Learning +2

Paper
Code

PathQG: Neural Question Generation from Facts

1 code implementation • EMNLP 2020 • Siyuan Wang, Zhongyu Wei, Zhihao Fan, Zengfeng Huang, Weijian Sun, Qi Zhang, Xuanjing Huang

Human evaluation also proves that our model is able to generate relevant and informative questions.

Question Generation Question-Generation +1

Paper
Code

KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier

1 code implementation • 6 Oct 2021 • Linyang Li, Demin Song, Ruotian Ma, Xipeng Qiu, Xuanjing Huang

Pre-trained models are widely used in fine-tuning downstream tasks with linear classifiers optimized by the cross-entropy loss, which might face robustness and stability problems.

Contrastive Learning text-classification +1

Paper
Code

CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation

1 code implementation • ACL 2022 • Zichu Fei, Qi Zhang, Tao Gui, Di Liang, Sirui Wang, Wei Wu, Xuanjing Huang

CQG employs a simple method to generate the multi-hop questions that contain key entities in multi-hop reasoning chains, which ensure the complexity and quality of the questions.

Question Generation Question-Generation

Paper
Code

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

1 code implementation • 16 Feb 2024 • Yi Lu, Xin Zhou, wei he, Jun Zhao, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

Instead of allowing each head to attend to the full sentence, which struggles with generalizing to longer sequences due to out-of-distribution (OOD) issues, we allow each head to process in-distribution length by selecting and attending to important context chunks.

Sentence

Paper
Code

Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering

1 code implementation • COLING 2022 • Siyuan Wang, Zhongyu Wei, Zhihao Fan, Qi Zhang, Xuanjing Huang

In this paper, we propose an interpretable stepwise reasoning framework to incorporate both single-hop supporting sentence identification and single-hop question generation at each intermediate step, and utilize the inference of the current hop for the next until reasoning out the final result.

Multi-hop Question Answering Question Answering +3

Paper
Code

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

1 code implementation • 23 May 2023 • Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang

To enhance the multi-step reasoning capabilities of large language models, researchers have extensively explored prompting methods, notably the Chain-of-Thought (CoT) method which explicitly elicits human-like rationales.

GSM8K

Paper
Code

From Hypergraph Energy Functions to Hypergraph Neural Networks

1 code implementation • 16 Jun 2023 • Yuxin Wang, Quan Gan, Xipeng Qiu, Xuanjing Huang, David Wipf

Hypergraphs are a powerful abstraction for representing higher-order interactions between entities of interest.

Bilevel Optimization Node Classification

Paper
Code

A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Xipeng Qiu, Hengzhi Pei, Hang Yan, Xuanjing Huang

Multi-criteria Chinese word segmentation (MCCWS) aims to exploit the relations among the multiple heterogeneous segmentation criteria and further improve the performance of each single criterion.

Chinese Word Segmentation Multi-Task Learning +1

Paper
Code

Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning

1 code implementation • ACL 2019 • Zhihao Fan, Zhongyu Wei, Siyuan Wang, Xuanjing Huang

Existing research usually employs the architecture of CNN-RNN that views the generation as a sequential decision-making process and the entire dataset vocabulary is used as decoding space.

Decision Making Image Captioning

Paper
Code

VCWE: Visual Character-Enhanced Word Embeddings

1 code implementation • NAACL 2019 • Chi Sun, Xipeng Qiu, Xuanjing Huang

Chinese is a logographic writing system, and the shape of Chinese characters contain rich syntactic and semantic information.

named-entity-recognition Named Entity Recognition +5

Paper
Code

Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble

1 code implementation • 20 Jun 2020 • Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang

Despite neural networks have achieved prominent performance on many natural language processing (NLP) tasks, they are vulnerable to adversarial examples.

Sentence

Paper
Code

Certified Robustness to Text Adversarial Attacks by Randomized [MASK]

1 code implementation • 8 May 2021 • Jiehang Zeng, Xiaoqing Zheng, Jianhan Xu, Linyang Li, Liping Yuan, Xuanjing Huang

Recently, few certified defense methods have been developed to provably guarantee the robustness of a text classifier to adversarial synonym substitutions.

Paper
Code

Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble

1 code implementation • ACL 2021 • Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang

Although deep neural networks have achieved prominent performance on many NLP tasks, they are vulnerable to adversarial examples.

Sentence

Paper
Code

Robust Lottery Tickets for Pre-trained Language Models

2 code implementations • ACL 2022 • Rui Zheng, Rong Bao, Yuhao Zhou, Di Liang, Sirui Wang, Wei Wu, Tao Gui, Qi Zhang, Xuanjing Huang

Recent works on Lottery Ticket Hypothesis have shown that pre-trained language models (PLMs) contain smaller matching subnetworks(winning tickets) which are capable of reaching accuracy comparable to the original models.

Adversarial Robustness

Paper
Code

Mask Attention Networks: Rethinking and Strengthen Transformer

1 code implementation • NAACL 2021 • Zhihao Fan, Yeyun Gong, Dayiheng Liu, Zhongyu Wei, Siyuan Wang, Jian Jiao, Nan Duan, Ruofei Zhang, Xuanjing Huang

We therefore introduce a new layer named dynamic mask attention network (DMAN) with a learnable mask matrix which is able to model localness adaptively.

Ranked #11 on Machine Translation on WMT2014 English-German

Abstractive Text Summarization Machine Translation +2

Paper
Code

Flooding-X: Improving BERT’s Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning

2 code implementations • ACL 2022 • Qin Liu, Rui Zheng, Bao Rong, Jingyi Liu, Zhihua Liu, Zhanzhan Cheng, Liang Qiao, Tao Gui, Qi Zhang, Xuanjing Huang

Adversarial robustness has attracted much attention recently, and the mainstream solution is adversarial training.

Adversarial Robustness text-classification +1

Paper
Code

SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation

1 code implementation • 29 Aug 2023 • Changze Lv, Tianlong Li, Jianhan Xu, Chenxi Gu, Zixuan Ling, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

Spiking neural networks (SNNs) offer a promising avenue to implement deep neural networks in a more energy-efficient way.

Knowledge Distillation text-classification +1

Paper
Code

Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication

1 code implementation • 4 Dec 2023 • Zhangyue Yin, Qiushi Sun, Cheng Chang, Qipeng Guo, Junqi Dai, Xuanjing Huang, Xipeng Qiu

Large Language Models (LLMs) have recently made significant strides in complex reasoning tasks through the Chain-of-Thought technique.

Language Modelling Large Language Model

Paper
Code

Math Word Problem Solving with Explicit Numerical Values

1 code implementation • ACL 2021 • Qinzhuo Wu, Qi Zhang, Zhongyu Wei, Xuanjing Huang

In recent years, math word problem solving has received considerable attention and achieved promising results, but previous methods rarely take numerical values into consideration.

Math Math Word Problem Solving

Paper
Code

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

1 code implementation • 16 Jan 2024 • Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang

To bridge this gap, we introduce RoTBench, a multi-level benchmark for evaluating the robustness of LLMs in tool learning.

Paper
Code

What Dense Graph Do You Need for Self-Attention?

1 code implementation • 27 May 2022 • Yuxin Wang, Chu-Tak Lee, Qipeng Guo, Zhangyue Yin, Yunhua Zhou, Xuanjing Huang, Xipeng Qiu

Transformers have made progress in miscellaneous tasks, but suffer from quadratic computational and memory complexities.

Miscellaneous

Paper
Code

Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts

1 code implementation • 20 Oct 2022 • Xiangyang Liu, Tianxiang Sun, Xuanjing Huang, Xipeng Qiu

Through extensive experimental results across various tasks and PTMs, we show that LPT can achieve competitive performance to full model tuning and other PETuning methods under both full-data and few-shot scenarios while possessing faster training speed and lower memory cost.

Paper
Code

Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification With K-means Features

1 code implementation • 18 Nov 2019 • Tao Gui, Lizhi Qing, Qi Zhang, Jiacheng Ye, HangYan, Zichu Fei, Xuanjing Huang

In order to effectively reduce the impact of non-ideal auxiliary tasks on the main task, we further proposed a novel meta-learning-based multi-task learning approach, which trained the shared hidden layers on auxiliary tasks, while the meta-optimization objective was to minimize the loss on the main task, ensuring that the optimizing direction led to an improvement on the main task.

Clustering Data Augmentation +4

Paper
Code

Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective

2 code implementations • COLING 2022 • Shihan Dou, Rui Zheng, Ting Wu, Songyang Gao, Junjie Shan, Qi Zhang, Yueming Wu, Xuanjing Huang

Most of the existing debiasing methods often identify and weaken these samples with biased features (i. e., superficial surface features that cause such spurious correlations).

Fact Verification Natural Language Inference +1

Paper
Code

A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation

1 code implementation • Findings (ACL) 2022 • Tianxiang Sun, Xiangyang Liu, Wei Zhu, Zhichao Geng, Lingling Wu, Yilong He, Yuan Ni, Guotong Xie, Xuanjing Huang, Xipeng Qiu

Previous works usually adopt heuristic metrics such as the entropy of internal outputs to measure instance difficulty, which suffers from generalization and threshold-tuning.

Paper
Code

Query Structure Modeling for Inductive Logical Reasoning Over Knowledge Graphs

1 code implementation • 23 May 2023 • Siyuan Wang, Zhongyu Wei, Meng Han, Zhihao Fan, Haijun Shan, Qi Zhang, Xuanjing Huang

The results demonstrate the effectiveness of our method on logical reasoning over KGs in both inductive and transductive settings.

Knowledge Graphs Logical Reasoning

Paper
Code

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

1 code implementation • 18 Feb 2024 • Jun Zhao, Can Zu, Hao Xu, Yi Lu, wei he, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang

Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks.

Multi-hop Question Answering Question Answering +1

Paper
Code

CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models

1 code implementation • 26 Feb 2024 • Huijie Lv, Xiao Wang, Yuansen Zhang, Caishuang Huang, Shihan Dou, Junjie Ye, Tao Gui, Qi Zhang, Xuanjing Huang

Adversarial misuse, particularly through `jailbreaking' that circumvents a model's safety and ethical protocols, poses a significant challenge for Large Language Models (LLMs).

Code Completion Response Generation

Paper
Code

Attention-Based Convolutional Neural Network for Semantic Relation Extraction

1 code implementation • COLING 2016 • Yatian Shen, Xuanjing Huang

Nowadays, neural networks play an important role in the task of relation classification.

Ranked #26 on Relation Extraction on SemEval-2010 Task-8

Feature Engineering General Classification +5

Paper
Code

Discrete Argument Representation Learning for Interactive Argument Pair Identification

1 code implementation • NAACL 2021 • Lu Ji, Zhongyu Wei, Jing Li, Qi Zhang, Xuanjing Huang

In this paper, we focus on extracting interactive argument pairs from two posts with opposite stances to a certain topic.

Representation Learning

Paper
Code

Chinese Named Entity Recognition Augmented with Lexicon Memory

1 code implementation • 17 Dec 2019 • Yi Zhou, Xiaoqing Zheng, Xuanjing Huang

Inspired by a concept of content-addressable retrieval from cognitive science, we propose a novel fragment-based model augmented with a lexicon-based memory for Chinese NER, in which both the character-level and word-level features are combined to generate better feature representations for possible name candidates.

Chinese Named Entity Recognition named-entity-recognition +4

Paper
Code

Incorporating Argument-Level Interactions for Persuasion Comments Evaluation using Co-attention Model

1 code implementation • COLING 2018 • Lu Ji, Zhongyu Wei, Xiangkun Hu, Yang Liu, Qi Zhang, Xuanjing Huang

In this paper, we investigate the issue of persuasiveness evaluation for argumentative comments.

Persuasiveness

Paper
Code

Rethinking Label Smoothing on Multi-hop Question Answering

2 code implementations • 19 Dec 2022 • Zhangyue Yin, Yuxin Wang, Xiannian Hu, Yiguang Wu, Hang Yan, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

Multi-Hop Question Answering (MHQA) is a significant area in question answering, requiring multiple reasoning components, including document retrieval, supporting sentence prediction, and answer span extraction.

Image Classification Machine Reading Comprehension +6

Paper
Code

RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification

1 code implementation • 14 Oct 2023 • Junjie Ye, Jie zhou, Junfeng Tian, Rui Wang, Qi Zhang, Tao Gui, Xuanjing Huang

Recently, Target-oriented Multimodal Sentiment Classification (TMSC) has gained significant attention among scholars.

Sentiment Analysis Sentiment Classification

Paper
Code

Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

1 code implementation • 18 Feb 2024 • Siyuan Wang, Zhuohan Long, Zhihao Fan, Zhongyu Wei, Xuanjing Huang

Towards a more scalable, robust and fine-grained evaluation, we implement six reframing operations to construct evolving instances testing LLMs against diverse queries, data noise and probing their problem-solving sub-abilities.

Model Selection

Paper
Code

Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding

1 code implementation • 14 Oct 2022 • Songyang Gao, Shihan Dou, Qi Zhang, Xuanjing Huang

Dataset bias has attracted increasing attention recently for its detrimental effect on the generalization ability of fine-tuned models.

Sentence Sentence Embedding +2

Paper
Code

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods

1 code implementation • 26 Jan 2024 • Yu Sun, Keyu Chen, Shujie Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin

However, these evaluation benchmarks are limited to assessing the instruction-following capabilities, overlooking the fundamental abilities that emerge during the pre-training stage.

Instruction Following

Paper
Code

Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM

1 code implementation • 12 Mar 2024 • Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei

How can we construct an automated debate judge to evaluate an extensive, vibrant, multi-turn debate?

Paper
Code

Length Generalization of Causal Transformers without Position Encoding

1 code implementation • 18 Apr 2024 • Jie Wang, Tao Ji, Yuanbin Wu, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang, Xiaoling Wang

Generalizing to longer sentences is important for recent Transformer-based language models.

Language Modelling Position +1

Paper
Code

Learning Task-specific Representation for Novel Words in Sequence Labeling

1 code implementation • 29 May 2019 • Minlong Peng, Qi Zhang, Xiaoyu Xing, Tao Gui, Jinlan Fu, Xuanjing Huang

However, representations of unseen or rare words trained on the end task are usually poor for appreciable performance.

named-entity-recognition Named Entity Recognition +3

Paper
Code

On the Tip of the Tongue: Analyzing Conceptual Representation in Large Language Models with Reverse-Dictionary Probe

1 code implementation • 22 Feb 2024 • Ningyu Xu, Qi Zhang, Menghan Zhang, Peng Qian, Xuanjing Huang

Here we re-purpose the reverse dictionary task as a case study to probe LLMs' capacity for conceptual inference.

In-Context Learning Reverse Dictionary

Paper
Code

A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

1 code implementation • 21 May 2023 • Limao Xiong, Jie zhou, Qunxi Zhu, Xiao Wang, Yuanbin Wu, Qi Zhang, Tao Gui, Xuanjing Huang, Jin Ma, Ying Shan

Particularly, we propose a Confidence-based Partial Label Learning (CPLL) method to integrate the prior confidence (given by annotators) and posterior confidences (learned by models) for crowd-annotated NER.

named-entity-recognition Named Entity Recognition +2

Paper
Code

Open Set Relation Extraction via Unknown-Aware Training

1 code implementation • 8 Jun 2023 • Jun Zhao, Xin Zhao, WenYu Zhan, Qi Zhang, Tao Gui, Zhongyu Wei, Yunwen Chen, Xiang Gao, Xuanjing Huang

Inspired by text adversarial attacks, we adaptively apply small but critical perturbations to original training instances and thus synthesizing negative instances that are more likely to be mistaken by the model as known relations.

Relation Relation Extraction

Paper
Code

Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

1 code implementation • 21 Dec 2023 • Jiayu Lin, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei

The results show the competitiveness of our proposed framework and evaluator in counter-argument generation tasks.

Sentence

Paper
Code

A Progressive Framework for Role-Aware Rumor Resolution

1 code implementation • COLING 2022 • Lei Chen, Guanying Li, Zhongyu Wei, Yang Yang, Baohua Zhou, Qi Zhang, Xuanjing Huang

Existing works on rumor resolution have shown great potential in recognizing word appearance and user participation.

Paper
Code

Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks

1 code implementation • COLING 2022 • Xin Zhou, Ruotian Ma, Yicheng Zou, Xuanting Chen, Tao Gui, Qi Zhang, Xuanjing Huang, Rui Xie, Wei Wu

Specifically, we re-formulate both token and sentence classification tasks into a unified language modeling task, and map label spaces of different tasks into the same vocabulary space.

Language Modelling Sentence +2

Paper
Code

A Structure-Aware Argument Encoder for Literature Discourse Analysis

1 code implementation • COLING 2022 • Yinzi Li, Wei Chen, Zhongyu Wei, Yujun Huang, Chujun Wang, Siyuan Wang, Qi Zhang, Xuanjing Huang, Libo Wu

Existing research for argument representation learning mainly treats tokens in the sentence equally and ignores the implied structure information of argumentative context.

Position Representation Learning +1

Paper
Code

Efficient Adversarial Training with Robust Early-Bird Tickets

1 code implementation • 14 Nov 2022 • Zhiheng Xi, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

Adversarial training is one of the most powerful methods to improve the robustness of pre-trained language models (PLMs).

Paper
Code

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?

1 code implementation • 21 Dec 2022 • Ningyu Xu, Tao Gui, Ruotian Ma, Qi Zhang, Jingting Ye, Menghan Zhang, Xuanjing Huang

We demonstrate that the distance between the distributions of different languages is highly consistent with the syntactic difference in terms of linguistic formalisms.

Zero-Shot Cross-Lingual Transfer

Paper
Code

Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization

1 code implementation • 20 May 2023 • Ting Wu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

Models trained with empirical risk minimization (ERM) are revealed to easily rely on spurious correlations, resulting in poor generalization.

Out-of-Distribution Generalization text-classification +1

Paper
Code

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages

1 code implementation • 16 Feb 2024 • Junjie Ye, Sixian Li, Guanyu Li, Caishuang Huang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang

Tool learning is widely acknowledged as a foundational approach or deploying large language models (LLMs) in real-world scenarios.

Paper
Code

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

1 code implementation • 1 Apr 2024 • wei he, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

The generated demos strategically interpolate between existing demos and the given query, transforming the query from OOD to ID.

In-Context Learning Math

Paper
Code

Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization

1 code implementation • 19 Oct 2023 • Ningyu Xu, Qi Zhang, Jingting Ye, Menghan Zhang, Xuanjing Huang

We then propose a meta-learning-based method to learn to align conceptual spaces of different languages, which facilitates zero-shot and few-shot generalization in concept classification and also offers insights into the cross-lingual in-context learning phenomenon.

In-Context Learning Meta-Learning +1

Paper
Code

Hi-ArG: Exploring the Integration of Hierarchical Argumentation Graphs in Language Pretraining

1 code implementation • 1 Dec 2023 • Jingcong Liang, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei

In this paper, we propose the Hierarchical Argumentation Graph (Hi-ArG), a new structure to organize arguments.

Knowledge Graphs

Paper
Code

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

1 code implementation • 2 Apr 2024 • Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang, Zhongyu Wei

For task completion, the agent needs to align and integrate various navigation modalities, including instruction, observation and navigation history.

Contrastive Learning Decision Making +2

Paper
Code

Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method

no code implementations • 25 Feb 2018 • Jinyue Su, Jiacheng Xu, Xipeng Qiu, Xuanjing Huang

Generating plausible and fluent sentence with desired properties has long been a challenge.

Sentence

Paper
Add Code

Meta Multi-Task Learning for Sequence Modeling

no code implementations • 25 Feb 2018 • Junkun Chen, Xipeng Qiu, Pengfei Liu, Xuanjing Huang

Specifically, we use a shared meta-network to capture the meta-knowledge of semantic composition and generate the parameters of the task-specific semantic composition models.

Multi-Task Learning Representation Learning +3

Paper
Add Code

A Feature-Enriched Neural Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging

no code implementations • 16 Nov 2016 • Xinchi Chen, Xipeng Qiu, Xuanjing Huang

Recently, neural network models for natural language processing tasks have been increasingly focused on for their ability of alleviating the burden of manual feature engineering.

Chinese Word Segmentation Feature Engineering +1

Paper
Add Code

DAG-based Long Short-Term Memory for Neural Word Segmentation

no code implementations • 2 Jul 2017 • Xinchi Chen, Zhan Shi, Xipeng Qiu, Xuanjing Huang

In this paper, we propose a new neural model to incorporate the word-level information for Chinese word segmentation.

Chinese Word Segmentation Feature Engineering +2

Paper
Add Code

Dynamic Compositional Neural Networks over Tree Structure

no code implementations • 11 May 2017 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

Tree-structured neural networks have proven to be effective in learning semantic representations by exploiting syntactic information.

Learning Semantic Representations

Paper
Add Code

Adversarial Multi-Criteria Learning for Chinese Word Segmentation

no code implementations • ACL 2017 • Xinchi Chen, Zhan Shi, Xipeng Qiu, Xuanjing Huang

Different linguistic perspectives causes many diverse segmentation criteria for Chinese word segmentation (CWS).

Chinese Word Segmentation Segmentation

Paper
Add Code

Adversarial Multi-task Learning for Text Classification

no code implementations • ACL 2017 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

Neural network models have shown their promising opportunities for multi-task learning, which focus on learning the shared layers to extract the common and task-invariant features.

General Classification Multi-Task Learning +2

Paper
Add Code

Knowledge Graph Representation with Jointly Structural and Textual Encoding

no code implementations • 26 Nov 2016 • Jiacheng Xu, Kan Chen, Xipeng Qiu, Xuanjing Huang

In this paper, we propose a novel deep architecture to utilize both structural and textual information of entities.

General Classification Knowledge Graph Embedding +2

Paper
Add Code

End-to-End Neural Sentence Ordering Using Pointer Network

no code implementations • 15 Nov 2016 • Jingjing Gong, Xinchi Chen, Xipeng Qiu, Xuanjing Huang

However, it is nontrivial for pair-wise models to incorporate the contextual sentence information.

Sentence Sentence Ordering

Paper
Add Code

Deep Multi-Task Learning with Shared Memory

no code implementations • 23 Sep 2016 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

Neural network based models have achieved impressive results on various specific tasks.

General Classification Multi-Task Learning +2

Paper
Add Code

Learning Word Embeddings from Intrinsic and Extrinsic Views

no code implementations • 20 Aug 2016 • Jifan Chen, Kan Chen, Xipeng Qiu, Qi Zhang, Xuanjing Huang, Zheng Zhang

To prove the effectiveness of our model, we evaluate it on four tasks, including word similarity, reverse dictionaries, Wiki link prediction, and document classification.

Descriptive Document Classification +4

Paper
Add Code

Neural Sentence Ordering

no code implementations • 23 Jul 2016 • Xinchi Chen, Xipeng Qiu, Xuanjing Huang

Sentence ordering is a general and critical task for natural language generation applications.

Document Summarization Multi-Document Summarization +2

Paper
Add Code

Syntax-based Attention Model for Natural Language Inference

no code implementations • 22 Jul 2016 • PengFei Liu, Xipeng Qiu, Xuanjing Huang

Introducing attentional mechanism in neural network is a powerful concept, and has achieved impressive results in many natural language processing tasks.

Natural Language Inference Sentence

Paper
Add Code

Modelling Interaction of Sentence Pair with coupled-LSTMs

no code implementations • EMNLP 2016 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

Recently, there is rising interest in modelling the interactions of two sentences with deep neural networks.

Ranked #73 on Natural Language Inference on SNLI

Sentence

Paper
Add Code

Recurrent Neural Network for Text Classification with Multi-Task Learning

no code implementations • 17 May 2016 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

Neural network based methods have obtained great progress on a variety of natural language processing tasks.

Ranked #10 on Emotion Recognition in Conversation on CPED

Emotion Recognition in Conversation General Classification +3

Paper
Add Code

Bridging LSTM Architecture and the Neural Dynamics during Reading

no code implementations • 22 Apr 2016 • Peng Qian, Xipeng Qiu, Xuanjing Huang

Recently, the long short-term memory neural network (LSTM) has attracted wide interest due to its success in many tasks.

Paper
Add Code

Gaussian Mixture Embeddings for Multiple Word Prototypes

no code implementations • 19 Nov 2015 • Xinchi Chen, Xipeng Qiu, Jingxiang Jiang, Xuanjing Huang

In this paper, we propose the Gaussian mixture skip-gram (GMSG) model to learn the Gaussian mixture embeddings for words based on skip-gram framework.

Paper
Add Code

Overview of the NLPCC 2015 Shared Task: Chinese Word Segmentation and POS Tagging for Micro-blog Texts

no code implementations • 28 May 2015 • Xipeng Qiu, Peng Qian, Liusong Yin, Shiyu Wu, Xuanjing Huang

In this paper, we give an overview for the shared task at the 4th CCF Conference on Natural Language Processing \& Chinese Computing (NLPCC 2015): Chinese word segmentation and part-of-speech (POS) tagging for micro-blog texts.

Chinese Word Segmentation Part-Of-Speech Tagging +3

Paper
Add Code

A Re-ranking Model for Dependency Parser with Recursive Convolutional Neural Network

no code implementations • IJCNLP 2015 • Chenxi Zhu, Xipeng Qiu, Xinchi Chen, Xuanjing Huang

In this work, we address the problem to model all the nodes (words or phrases) in a dependency tree with the dense representations.

Dependency Parsing Re-Ranking

Paper
Add Code

Gaussian Word Embedding with a Wasserstein Distance Loss

no code implementations • 21 Aug 2018 • Chi Sun, Hang Yan, Xipeng Qiu, Xuanjing Huang

Therefore, with the aim of representing words in a highly efficient way, we propose to operate a Gaussian word embedding model with a loss function based on the Wasserstein distance.

Document Classification General Classification +1

Paper
Add Code

Exploring Shared Structures and Hierarchies for Multiple NLP Tasks

no code implementations • 23 Aug 2018 • Junkun Chen, Kaiyu Chen, Xinchi Chen, Xipeng Qiu, Xuanjing Huang

Designing shared neural architecture plays an important role in multi-task learning.

General Classification Multi-Task Learning +5

Paper
Add Code

Deformable Stacked Structure for Named Entity Recognition

no code implementations • 24 Sep 2018 • Shuyang Cao, Xipeng Qiu, Xuanjing Huang

Neural architecture for named entity recognition has achieved great success in the field of natural language processing.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Meta-Learning Multi-task Communication

no code implementations • 23 Oct 2018 • Pengfei Liu, Xuanjing Huang

In this paper, we describe a general framework: Parameters Read-Write Networks (PRaWNs) to systematically analyze current neural models for multi-task learning, in which we find that existing models expect to disentangle features into different spaces while features learned in practice are still entangled in shared space, leaving potential hazards for other training or unseen tasks.

Inductive Bias Meta-Learning +1

Paper
Add Code

Contextualized Non-local Neural Networks for Sequence Learning

no code implementations • 21 Nov 2018 • Pengfei Liu, Shuaichen Chang, Xuanjing Huang, Jian Tang, Jackie Chi Kit Cheung

Recently, a large number of neural mechanisms and models have been proposed for sequence learning, of which self-attention, as exemplified by the Transformer model, and graph neural networks (GNNs) have attracted much attention.

General Classification Sentence +2

Paper
Add Code

Automatic Essay Scoring Incorporating Rating Schema via Reinforcement Learning

no code implementations • EMNLP 2018 • Yucheng Wang, Zhongyu Wei, Yaqian Zhou, Xuanjing Huang

Automatic essay scoring (AES) is the task of assigning grades to essays without human interference.

Machine Translation reinforcement-learning +3

Paper
Add Code

Convolutional Interaction Network for Natural Language Inference

no code implementations • EMNLP 2018 • Jingjing Gong, Xipeng Qiu, Xinchi Chen, Dong Liang, Xuanjing Huang

Attention-based neural models have achieved great success in natural language inference (NLI).

Information Retrieval Natural Language Inference +2

Paper
Add Code

Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging

no code implementations • EMNLP 2018 • Tao Gui, Qi Zhang, Jingjing Gong, Minlong Peng, Di Liang, Keyu Ding, Xuanjing Huang

However, from a linguistic perspective, Twitter users not only tend to mimic the formal expressions of traditional media, like news, but they also appear to be developing linguistically informal styles.

Ranked #2 on Part-Of-Speech Tagging on Ritter

Domain Adaptation Multi-Task Learning +4

Paper
Add Code

Cross-Domain Sentiment Classification with Target Domain Specific Information

no code implementations • ACL 2018 • Minlong Peng, Qi Zhang, Yu-Gang Jiang, Xuanjing Huang

And we introduce a few target domain labeled data for learning domain-specific information.

Classification General Classification +2

Paper
Add Code

Task-oriented Dialogue System for Automatic Diagnosis

no code implementations • ACL 2018 • Zhongyu Wei, Qianlong Liu, Baolin Peng, Huaixiao Tou, Ting Chen, Xuanjing Huang, Kam-Fai Wong, Xiangying Dai

In this paper, we make a move to build a dialogue system for automatic diagnosis.

Paper
Add Code

Deep Fusion LSTMs for Text Semantic Matching

no code implementations • ACL 2016 • Pengfei Liu, Xipeng Qiu, Jifan Chen, Xuanjing Huang

Ranked #76 on Natural Language Inference on SNLI

Machine Translation Question Answering +2

Paper
Add Code

Investigating Language Universal and Specific Properties in Word Embeddings

no code implementations • ACL 2016 • Peng Qian, Xipeng Qiu, Xuanjing Huang

Word Embeddings

Paper
Add Code

Implicit Discourse Relation Detection via a Deep Architecture with Gated Relevance Network

no code implementations • ACL 2016 • Jifan Chen, Qi Zhang, PengFei Liu, Xipeng Qiu, Xuanjing Huang

Opinion Mining Relation +1

Paper
Add Code

A New Psychometric-inspired Evaluation Metric for Chinese Word Segmentation

no code implementations • ACL 2016 • Peng Qian, Xipeng Qiu, Xuanjing Huang

Chinese Word Segmentation Feature Engineering

Paper
Add Code

Idiom-Aware Compositional Distributed Semantics

no code implementations • EMNLP 2017 • Pengfei Liu, Kaiyu Qian, Xipeng Qiu, Xuanjing Huang

Idioms are peculiar linguistic constructions that impose great challenges for representing the semantics of language, especially in current prevailing end-to-end neural models, which assume that the semantics of a phrase or sentence can be literally composed from its constitutive words.

General Classification Machine Translation +4

Paper
Add Code

Part-of-Speech Tagging for Twitter with Adversarial Neural Networks

no code implementations • EMNLP 2017 • Tao Gui, Qi Zhang, Haoran Huang, Minlong Peng, Xuanjing Huang

In this work, we study the problem of part-of-speech tagging for Tweets.

Ranked #3 on Part-Of-Speech Tagging on Ritter

Part-Of-Speech Tagging Stock Prediction

Paper
Add Code

Deep Multi-Task Learning with Shared Memory for Text Classification

no code implementations • EMNLP 2016 • Pengfei Liu, Xipeng Qiu, Xuanjing Huang

General Classification Machine Translation +3

Paper
Add Code

Generating Abbreviations for Chinese Named Entities Using Recurrent Neural Network with Dynamic Dictionary

no code implementations • EMNLP 2016 • Qi Zhang, Jin Qian, Ya Guo, Yaqian Zhou, Xuanjing Huang

Named Entity Recognition (NER) Opinion Mining

Paper
Add Code

Analyzing Linguistic Knowledge in Sequential Model of Sentence

no code implementations • EMNLP 2016 • Peng Qian, Xipeng Qiu, Xuanjing Huang

Language Modelling Sentence +1

Paper
Add Code

Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter

no code implementations • EMNLP 2016 • Qi Zhang, Yang Wang, Yeyun Gong, Xuanjing Huang

Clustering Keyphrase Extraction

Paper
Add Code

Incorporating Topic Aspects for Online Comment Convincingness Evaluation

no code implementations • WS 2018 • Yunfan Gu, Zhongyu Wei, Maoran Xu, Hao Fu, Yang Liu, Xuanjing Huang

In this paper, we propose to incorporate topic aspects information for online comments convincingness evaluation.

Argument Mining

Paper
Add Code

A Lexicon-Based Supervised Attention Model for Neural Sentiment Analysis

no code implementations • COLING 2018 • Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang

Attention mechanisms have been leveraged for sentiment classification tasks because not all words have the same importance.

Classification General Classification +2

Paper
Add Code

A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators

no code implementations • COLING 2018 • Zhihao Fan, Zhongyu Wei, Siyuan Wang, Yang Liu, Xuanjing Huang

Visual Question Generation (VQG) aims to ask natural questions about an image automatically.

Attribute Natural Questions +8

Paper
Add Code

Hashtag Recommendation Using End-To-End Memory Networks with Hierarchical Attention

no code implementations • COLING 2016 • Haoran Huang, Qi Zhang, Yeyun Gong, Xuanjing Huang

By incorporating the hierarchical attention mechanism, the relative improvement in the proposed method over the state-of-the-art method is around 67. 9{\%} in the F1-score.

Collaborative Filtering General Classification +3

Paper
Add Code

A Learning Error Analysis for Structured Prediction with Approximate Inference

no code implementations • NeurIPS 2017 • Yuanbin Wu, Man Lan, Shiliang Sun, Qi Zhang, Xuanjing Huang

In this work, we try to understand the differences between exact and approximate inference algorithms in structured prediction.

Dependency Parsing General Classification +3

Paper
Add Code

Multi-task Learning with Gradient Communication

no code implementations • ICLR 2019 • Pengfei Liu, Xuanjing Huang

In this paper, we describe a general framework to systematically analyze current neural models for multi-task learning, in which we find that existing models expect to disentangle features into different spaces while features learned in practice are still entangled in shared space, leaving potential hazards for other training or unseen tasks.

Inductive Bias Multi-Task Learning

Paper
Add Code

Gated Recursive Neural Network for Chinese Word Segmentation

no code implementations • IJCNLP 2015 • Xinchi Chen, Xipeng Qiu, Chenxi Zhu, Xuanjing Huang

Chinese Word Segmentation Feature Engineering

Paper
Add Code

Latent Semantic Tensor Indexing for Community-based Question Answering

no code implementations • ACL 2013 • Xipeng Qiu, Le Tian, Xuanjing Huang

Question Answering

Paper
Add Code

FudanNLP: A Toolkit for Chinese Natural Language Processing

no code implementations • ACL 2013 • Xipeng Qiu, Qi Zhang, Xuanjing Huang

Dependency Parsing Named Entity Recognition (NER) +1

Paper
Add Code

Hashtag Recommendation Using Dirichlet Process Mixture Models Incorporating Types of Hashtags

no code implementations • EMNLP 2015 • Yeyun Gong, Qi Zhang, Xuanjing Huang

Sentiment Analysis

Paper
Add Code

Sentence Modeling with Gated Recursive Neural Network

no code implementations • EMNLP 2015 • Xinchi Chen, Xipeng Qiu, Chenxi Zhu, Shiyu Wu, Xuanjing Huang

Chinese Word Segmentation Dependency Parsing +5

Paper
Add Code

Long Short-Term Memory Neural Networks for Chinese Word Segmentation

no code implementations • EMNLP 2015 • Xinchi Chen, Xipeng Qiu, Chenxi Zhu, PengFei Liu, Xuanjing Huang

Ranked #3 on Chinese Word Segmentation on MSRA

Chinese Word Segmentation Feature Engineering

Paper
Add Code

Transition-based Dependency Parsing Using Two Heterogeneous Gated Recursive Neural Networks

no code implementations • EMNLP 2015 • Xinchi Chen, Yaqian Zhou, Chenxi Zhu, Xipeng Qiu, Xuanjing Huang

Feature Engineering Transition-Based Dependency Parsing +1

Paper
Add Code

Multi-Timescale Long Short-Term Memory Neural Network for Modelling Sentences and Documents

no code implementations • EMNLP 2015 • Pengfei Liu, Xipeng Qiu, Xinchi Chen, Shiyu Wu, Xuanjing Huang

Machine Translation Spoken Language Understanding +1

Paper
Add Code

Joint Chinese Word Segmentation and POS Tagging on Heterogeneous Annotated Corpora with Multiple Task Learning

no code implementations • EMNLP 2013 • Xipeng Qiu, Jiayi Zhao, Xuanjing Huang

Chinese Word Segmentation Part-Of-Speech Tagging +2

Paper
Add Code

Discourse Level Explanatory Relation Extraction from Product Reviews Using First-Order Logic

no code implementations • EMNLP 2013 • Qi Zhang, Jin Qian, Huan Chen, Jihua Kang, Xuanjing Huang

Opinion Mining Relation +2

Paper
Add Code

Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features

no code implementations • EMNLP 2012 • Jiayi Zhao, Xipeng Qiu, Shu Zhang, Feng Ji, Xuanjing Huang

Part-Of-Speech Tagging

Paper
Add Code

Time-aware Personalized Hashtag Recommendation on Social Media

no code implementations • COLING 2014 • Qi Zhang, Yeyun Gong, Xuyang Sun, Xuanjing Huang

Sentiment Analysis

Paper
Add Code

A Generative Model for Identifying Target Companies of Microblogs

no code implementations • COLING 2014 • Yeyun Gong, Yaqian Zhou, Ya Guo, Qi Zhang, Xuanjing Huang

Domain Adaptation Opinion Mining +1

Paper
Add Code

Automatic Corpus Expansion for Chinese Word Segmentation by Exploiting the Redundancy of Web Information

no code implementations • COLING 2014 • Xipeng Qiu, ChaoChao Huang, Xuanjing Huang

Chinese Word Segmentation Domain Adaptation

Paper
Add Code

Automatic Hashtag Recommendation for Microblogs using Topic-Specific Translation Model

no code implementations • COLING 2012 • Zhuoye Ding, Qi Zhang, Xuanjing Huang

Opinion Mining Translation

Paper
Add Code

Joint Segmentation and Tagging with Coupled Sequences Labeling

no code implementations • COLING 2012 • Xipeng Qiu, Feng Ji, Jiayi Zhao, Xuanjing Huang

Chinese Word Segmentation

Paper
Add Code

Detecting Spammers in Community Question Answering

no code implementations • IJCNLP 2013 • Zhuoye Ding, Yeyun Gong, Yaqian Zhou, Qi Zhang, Xuanjing Huang

Community Question Answering

Paper
Add Code

Chinese Named Entity Abbreviation Generation Using First-Order Logic

no code implementations • IJCNLP 2013 • Huan Chen, Qi Zhang, Jin Qian, Xuanjing Huang

Question Answering Sentiment Analysis

Paper
Add Code

Understanding the Semantic Intent of Natural Language Query

no code implementations • IJCNLP 2013 • Juan Xu, Qi Zhang, Xuanjing Huang

Paper
Add Code

Bilingual Product Name Dictionary Construction Using a Two Stage Method

no code implementations • WS 2014 • Yatian Shen, Xuanjing Huang

Vocal Bursts Valence Prediction

Paper
Add Code

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

no code implementations • 25 Jul 2019 • Lin Zehui, PengFei Liu, Luyao Huang, Junkun Chen, Xipeng Qiu, Xuanjing Huang

Variants dropout methods have been designed for the fully-connected layer, convolutional layer and recurrent layer in neural networks, and shown to be effective to avoid overfitting.

Paper
Add Code

Generating Responses with a Specific Emotion in Dialog

no code implementations • ACL 2019 • Zhenqiao Song, Xiaoqing Zheng, Lu Liu, Mu Xu, Xuanjing Huang

It is desirable for dialog systems to have capability to express specific emotions during a conversation, which has a direct, quantifiable impact on improvement of their usability and user satisfaction.

Paper
Add Code

Exploring Domain Shift in Extractive Text Summarization

no code implementations • 30 Aug 2019 • Danqing Wang, PengFei Liu, Ming Zhong, Jie Fu, Xipeng Qiu, Xuanjing Huang

Although domain shift has been well explored in many NLP applications, it still has received little attention in the domain of extractive text summarization.

Extractive Text Summarization Meta-Learning

Paper
Add Code

Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis

no code implementations • COLING 2020 • Minlong Peng, Qi Zhang, Xuanjing Huang

To address this problem, we propose a modification to DIRL, obtaining a novel weighted domain-invariant representation learning (WDIRL) framework.

Domain Adaptation Representation Learning +1

Paper
Add Code

A Closer Look at Data Bias in Neural Extractive Summarization Models

no code implementations • WS 2019 • Ming Zhong, Danqing Wang, PengFei Liu, Xipeng Qiu, Xuanjing Huang

In this paper, we take stock of the current state of summarization datasets and explore how different factors of datasets influence the generalization behaviour of neural extractive summarization models.

Extractive Summarization

Paper
Add Code

A Lexicon-Based Graph Neural Network for Chinese NER

no code implementations • IJCNLP 2019 • Tao Gui, Yicheng Zou, Qi Zhang, Minlong Peng, Jinlan Fu, Zhongyu Wei, Xuanjing Huang

Recurrent neural networks (RNN) used for Chinese named entity recognition (NER) that sequentially track character and word information have achieved great success.

Ranked #13 on Chinese Named Entity Recognition on OntoNotes 4

Chinese Named Entity Recognition named-entity-recognition +3

Paper
Add Code

Asynchronous Deep Interaction Network for Natural Language Inference

no code implementations • IJCNLP 2019 • Di Liang, Fubao Zhang, Qi Zhang, Xuanjing Huang

However, in the process of reasoning, the role of the two sentences is obviously different, and the sentence pairs for NLI are asymmetrical corpora.

Natural Language Inference Sentence

Paper
Add Code

Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication

no code implementations • COLING 2020 • Ruize Wang, Zhongyu Wei, Ying Cheng, Piji Li, Haijun Shan, Ji Zhang, Qi Zhang, Xuanjing Huang

Visual storytelling aims to generate a narrative paragraph from a sequence of images automatically.

Ranked #9 on Visual Storytelling on VIST

Image Captioning Question Generation +1

Paper
Add Code

Unified Multi-Criteria Chinese Word Segmentation with BERT

no code implementations • 13 Apr 2020 • Zhen Ke, Liang Shi, Erli Meng, Bin Wang, Xipeng Qiu, Xuanjing Huang

Besides, the pre-trained BERT language model has been also introduced into the MCCWS task in a multi-task learning framework.

Chinese Word Segmentation Language Modelling +3

Paper
Add Code

Leveraging Declarative Knowledge in Text and First-Order Logic for Fine-Grained Propaganda Detection

no code implementations • EMNLP 2020 • Ruize Wang, Duyu Tang, Nan Duan, Wanjun Zhong, Zhongyu Wei, Xuanjing Huang, Daxin Jiang, Ming Zhou

We study the detection of propagandistic text fragments in news articles.

Propaganda detection

Paper
Add Code

Evaluating and Enhancing the Robustness of Neural Network-based Dependency Parsing Models with Adversarial Examples

no code implementations • ACL 2020 • Xiaoqing Zheng, Jiehang Zeng, Yi Zhou, Cho-Jui Hsieh, Minhao Cheng, Xuanjing Huang

Despite achieving prominent performance on many important tasks, it has been reported that neural networks are vulnerable to adversarial examples.

Dependency Parsing Question Answering +3

Paper
Add Code

Text Information Aggregation with Centrality Attention

no code implementations • 16 Nov 2020 • Jingjing Gong, Hang Yan, Yining Zheng, Xipeng Qiu, Xuanjing Huang

A lot of natural language processing problems need to encode the text sequence as a fix-length vector, which usually involves aggregation process of combining the representations of all the words, such as pooling or self-attention.

Sentence text-classification +1

Paper
Add Code

A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving

no code implementations • EMNLP 2020 • Qinzhuo Wu, Qi Zhang, Jinlan Fu, Xuanjing Huang

With the advancements in natural language processing tasks, math word problem solving has received increasing attention.

Common Sense Reasoning Graph Attention +2

Paper
Add Code

An Enhanced Knowledge Injection Model for Commonsense Generation

no code implementations • COLING 2020 • Zhihao Fan, Yeyun Gong, Zhongyu Wei, Siyuan Wang, Yameng Huang, Jian Jiao, Xuanjing Huang, Nan Duan, Ruofei Zhang

Commonsense generation aims at generating plausible everyday scenario description based on a set of provided concepts.

Position

Paper
Add Code

SenSeNet: Neural Keyphrase Generation with Document Structure

no code implementations • 12 Dec 2020 • Yichao Luo, Zhengyan Li, Bingning Wang, Xiaoyu Xing, Qi Zhang, Xuanjing Huang

Keyphrase Generation (KG) is the task of generating central topics from a given document or literary work, which captures the crucial information necessary to understand the content.

Inductive Bias Keyphrase Generation +1

Paper
Add Code

Generating Adversarial Examples in Chinese Texts Using Sentence-Pieces

no code implementations • 29 Dec 2020 • Linyang Li, Yunfan Shao, Demin Song, Xipeng Qiu, Xuanjing Huang

The substitutions in the generated adversarial examples are not characters or words but \textit{'pieces'}, which are more natural to Chinese readers.

Language Modelling Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.