Search Results for author: Xu sun

Found 196 papers, 96 papers with code

Translation as Cross-Domain Knowledge: Attention Augmentation for Unsupervised Cross-Domain Segmenting and Labeling Tasks

1 code implementation • Findings (EMNLP) 2021 • Ruixuan Luo, Yi Zhang, Sishuo Chen, Xu sun

The nature of no word delimiter or inflection that can indicate segment boundaries or word semantics increases the difficulty of Chinese text understanding, and also intensifies the demand for word-level semantic knowledge to accomplish the tagging goal in Chinese segmenting and labeling tasks.

Translation

Paper
Code

Rethinking Denoised Auto-Encoding in Language Pre-Training

no code implementations • EMNLP 2021 • Fuli Luo, Pengcheng Yang, Shicheng Li, Xuancheng Ren, Xu sun, Songfang Huang, Fei Huang

Pre-trained self-supervised models such as BERT have achieved striking success in learning sequence representations, especially for natural language processing.

Natural Language Understanding Sentence

Paper
Add Code

Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation

1 code implementation • Findings (EMNLP) 2021 • Hua Zheng, Lei LI, Damai Dai, Deli Chen, Tianyu Liu, Xu sun, Yang Liu

In this paper, we propose to leverage word-formation knowledge to enhance Chinese WSD.

Word Sense Disambiguation

Paper
Code

Position Offset Label Prediction for Grammatical Error Correction

no code implementations • COLING 2022 • Xiuyu Wu, Jingsong Yu, Xu sun, Yunfang Wu

We introduce a novel position offset label prediction subtask to the encoder-decoder architecture for grammatical error correction (GEC) task.

Data Augmentation Decoder +3

Paper
Add Code

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

1 code implementation • 16 Apr 2024 • Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu sun

Diffusion models have exhibited remarkable capabilities in text-to-image generation.

Image Captioning Text Generation +1

Paper
Code

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

1 code implementation • 28 Mar 2024 • Sishuo Chen, Lei LI, Shuhuai Ren, Rundong Gao, Yuanxin Liu, Xiaohan Bi, Xu sun, Lu Hou

Video paragraph captioning (VPC) involves generating detailed narratives for long videos, utilizing supportive modalities such as speech and event boundaries.

Data Augmentation Video Understanding

Paper
Code

TempCompass: Do Video LLMs Really Understand Videos?

1 code implementation • 1 Mar 2024 • Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei LI, Sishuo Chen, Xu sun, Lu Hou

Motivated by these two problems, we propose the \textbf{TempCompass} benchmark, which introduces a diversity of temporal aspects and task formats.

Paper
Code

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

1 code implementation • 17 Feb 2024 • Wenkai Yang, Xiaohan Bi, Yankai Lin, Sishuo Chen, Jie zhou, Xu sun

We first formulate a general framework of agent backdoor attacks, then we present a thorough analysis on the different forms of agent backdoor attacks.

Backdoor Attack Data Poisoning

Paper
Code

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

1 code implementation • 4 Dec 2023 • Shuhuai Ren, Linli Yao, Shicheng Li, Xu sun, Lu Hou

This work proposes TimeChat, a time-sensitive multimodal large language model specifically designed for long video understanding.

Dense Captioning Highlight Detection +5

203

Paper
Code

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

1 code implementation • 29 Nov 2023 • Shicheng Li, Lei LI, Shuhuai Ren, Yuanxin Liu, Yi Liu, Rundong Gao, Xu sun, Lu Hou

The ability to perceive how objects change over time is a crucial ingredient in human intelligence.

counterfactual

Paper
Code

RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge

no code implementations • 14 Nov 2023 • Yi Liu, Lianzhe Huang, Shicheng Li, Sishuo Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

Therefore, to evaluate the ability of LLMs to discern the reliability of external knowledge, we create a benchmark from existing knowledge bases.

counterfactual Knowledge Graphs +2

Paper
Add Code

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

1 code implementation • NeurIPS 2023 • Yuanxin Liu, Lei LI, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu sun, Lu Hou

The multi-aspect categorization of FETV enables fine-grained analysis of the metrics' reliability in different scenarios.

Text-to-Video Generation Video Generation

Paper
Code

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

1 code implementation • 29 Oct 2023 • Shuhuai Ren, Sishuo Chen, Shicheng Li, Xu sun, Lu Hou

TESTA can reduce the number of visual tokens by 75% and thus accelerate video encoding.

Ranked #1 on Video Retrieval on Condensed Movies (using extra training data)

Language Modelling Retrieval +2

Paper
Code

Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction

1 code implementation • 11 Sep 2023 • Ruibo Chen, Zhiyuan Zhang, Yi Liu, Ruihan Bao, Keiko Harimoto, Xu sun

Existing multimodal works that train models from scratch face the problem of lacking universal knowledge when modeling financial news.

Time Series

Paper
Code

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning

1 code implementation • 25 Aug 2023 • Bang Yang, Fenglin Liu, Xian Wu, YaoWei Wang, Xu sun, Yuexian Zou

To deal with the label shortage problem, we present a simple yet effective zero-shot approach MultiCapCLIP that can generate visual captions for different scenarios and languages without any labeled vision-caption pairs of downstream datasets.

Image Captioning Video Captioning

Paper
Code

Towards Codable Watermarking for Injecting Multi-bits Information to LLMs

1 code implementation • 29 Jul 2023 • Lean Wang, Wenkai Yang, Deli Chen, Hao Zhou, Yankai Lin, Fandong Meng, Jie zhou, Xu sun

As large language models (LLMs) generate texts with increasing fluency and realism, there is a growing need to identify the source of texts to prevent the abuse of LLMs.

Language Modelling

Paper
Code

M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

no code implementations • 7 Jun 2023 • Lei LI, Yuwei Yin, Shicheng Li, Liang Chen, Peiyi Wang, Shuhuai Ren, Mukai Li, Yazheng Yang, Jingjing Xu, Xu sun, Lingpeng Kong, Qi Liu

To tackle this challenge and promote research in the vision-language field, we introduce the Multi-Modal, Multilingual Instruction Tuning (M$^3$IT) dataset, designed to optimize VLM alignment with human instructions.

World Knowledge

Paper
Add Code

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

1 code implementation • 23 May 2023 • Lean Wang, Lei LI, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

In-context learning (ICL) emerges as a promising capability of large language models (LLMs) by providing them with demonstration examples to perform diverse tasks.

In-Context Learning

119

Paper
Code

Can Language Models Understand Physical Concepts?

1 code implementation • 23 May 2023 • Lei LI, Jingjing Xu, Qingxiu Dong, Ce Zheng, Qi Liu, Lingpeng Kong, Xu sun

Language models~(LMs) gradually become general-purpose interfaces in the interactive and embodied world, where the understanding of physical concepts is an essential prerequisite.

Paper
Code

Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter

1 code implementation • 21 May 2023 • Yi Liu, Xiaohan Bi, Lei LI, Sishuo Chen, Wenkai Yang, Xu sun

However, as pre-trained language models (PLMs) continue to increase in size, the communication cost for transmitting parameters during synchronization has become a training speed bottleneck.

Clustering Federated Learning +2

Paper
Code

PALM: Open Fundus Photograph Dataset with Pathologic Myopia Recognition and Anatomical Structure Annotation

1 code implementation • 13 May 2023 • Huihui Fang, Fei Li, Junde Wu, Huazhu Fu, Xu sun, José Ignacio Orlando, Hrvoje Bogunović, Xiulan Zhang, Yanwu Xu

Our databases comprises 1200 images with associated labels for the pathologic myopia category and manual annotations of the optic disc, the position of the fovea and delineations of lesions such as patchy retinal atrophy (including peripapillary atrophy) and retinal detachment.

Paper
Code

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias

no code implementations • 8 May 2023 • Zhiyuan Zhang, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

To settle this issue, we propose the Fine-purifying approach, which utilizes the diffusion theory to study the dynamic process of fine-tuning for finding potentially poisonous dimensions.

Paper
Add Code

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition

1 code implementation • NeurIPS 2023 • Shuhuai Ren, Aston Zhang, Yi Zhu, Shuai Zhang, Shuai Zheng, Mu Li, Alex Smola, Xu sun

This work proposes POMP, a prompt pre-training method for vision-language models.

Ranked #1 on Open Vocabulary Semantic Segmentation on COCO-Stuff-171

Image Classification object-detection +3

248

Paper
Code

Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features

2 code implementations • 30 Jan 2023 • Sishuo Chen, Wenkai Yang, Xiaohan Bi, Xu sun

We find that: (1) no existing method behaves well in both settings; (2) fine-tuning PLMs on in-distribution data benefits detecting semantic shifts but severely deteriorates detecting non-semantic shifts, which can be attributed to the distortion of task-agnostic features.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Code

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

no code implementations • 25 Jan 2023 • Wenkai Yang, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

Federated Learning (FL) has become a popular distributed learning paradigm that involves multiple clients training a global model collaboratively in a data privacy-preserving manner.

Federated Learning Privacy Preserving

Paper
Add Code

When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning

no code implementations • 25 Jan 2023 • Wenkai Yang, Yankai Lin, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

Federated Learning has become a widely-used framework which allows learning a global model on decentralized local datasets under the condition of protecting local data privacy.

Federated Learning text-classification +1

Paper
Add Code

A Survey on In-context Learning

1 code implementation • 31 Dec 2022 • Qingxiu Dong, Damai Dai, Ce Zheng, Zhiyong Wu, Baobao Chang, Xu sun, Jingjing Xu, Lei LI, Zhifang Sui

With the increasing ability of large language models (LLMs), in-context learning (ICL) has become a new paradigm for natural language processing (NLP), where LLMs make predictions only based on contexts augmented with a few examples.

In-Context Learning

751

Paper
Code

Aligning Source Visual and Target Language Domains for Unpaired Video Captioning

no code implementations • 22 Nov 2022 • Fenglin Liu, Xian Wu, Chenyu You, Shen Ge, Yuexian Zou, Xu sun

To this end, we introduce the unpaired video captioning task aiming to train models without coupled video-caption pairs in target language.

Translation Video Captioning

Paper
Add Code

Gradient Knowledge Distillation for Pre-trained Language Models

1 code implementation • 2 Nov 2022 • Lean Wang, Lei LI, Xu sun

Knowledge distillation (KD) is an effective framework to transfer knowledge from a large-scale teacher to a compact yet well-performing student.

Knowledge Distillation

Paper
Code

DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention

no code implementations • 28 Oct 2022 • Fenglin Liu, Xian Wu, Shen Ge, Xuancheng Ren, Wei Fan, Xu sun, Yuexian Zou

To enhance the correlation between vision and language in disentangled spaces, we introduce the visual concepts to DiMBERT which represent visual information in textual format.

Image Captioning Language Modelling +3

Paper
Add Code

Generating Accurate and Faithful Discharge Instructions: Task, Dataset, and Model

2 code implementations • 23 Oct 2022 • Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu sun, Yang Yang, David A. Clifton

We build a benchmark clinical dataset and propose the Re3Writer, which imitates the working patterns of physicians to first retrieve related working experience from historical PIs written by physicians, then reason related medical knowledge.

Paper
Code

Prophet Attention: Predicting Attention with Future Attention for Image Captioning

no code implementations • 19 Oct 2022 • Fenglin Liu, Xuancheng Ren, Xian Wu, Wei Fan, Yuexian Zou, Xu sun

Especially for image captioning, the attention based models are expected to ground correct image regions with proper generated words.

Image Captioning

Paper
Add Code

Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models

1 code implementation • 18 Oct 2022 • Zhiyuan Zhang, Lingjuan Lyu, Xingjun Ma, Chenguang Wang, Xu sun

In this work, we take the first step to exploit the pre-trained (unfine-tuned) weights to mitigate backdoors in fine-tuned language models.

Language Modelling Sentence +4

126,508

Paper
Code

Holistic Sentence Embeddings for Better Out-of-Distribution Detection

1 code implementation • 14 Oct 2022 • Sishuo Chen, Xiaohan Bi, Rundong Gao, Xu sun

On the basis of the observations that token averaging and layer combination contribute to improving OOD detection, we propose a simple embedding approach named Avg-Avg, which averages all token representations from each intermediate layer as the sentence embedding and significantly surpasses the state-of-the-art on a comprehensive suite of benchmarks by a 9. 33% FAR95 margin.

Avg Out-of-Distribution Detection +4

Paper
Code

Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks

1 code implementation • 14 Oct 2022 • Sishuo Chen, Wenkai Yang, Zhiyuan Zhang, Xiaohan Bi, Xu sun

In this work, we take the first step to investigate the unconcealment of textual poisoned samples at the intermediate-feature level and propose a feature-based efficient online defense method.

backdoor defense Sentiment Analysis

Paper
Code

Dim-Krum: Backdoor-Resistant Federated Learning for NLP with Dimension-wise Krum-Based Aggregation

no code implementations • 13 Oct 2022 • Zhiyuan Zhang, Qi Su, Xu sun

NLP attacks tend to have small relative backdoor strengths, which may result in the failure of robust federated aggregation methods for NLP attacks.

Federated Learning

Paper
Add Code

GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved Generalization

no code implementations • 13 Oct 2022 • Zhiyuan Zhang, Ruixuan Luo, Qi Su, Xu sun

It demonstrates that flat minima tend to imply better generalization abilities.

Paper
Add Code

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models

1 code implementation • 11 Oct 2022 • Lei LI, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

We then design a Model Uncertainty--aware Knowledge Integration (MUKI) framework to recover the golden supervision for the student.

Paper
Code

Stock Trading Volume Prediction with Dual-Process Meta-Learning

1 code implementation • 11 Oct 2022 • Ruibo Chen, Wei Li, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu sun

Our method can model the common pattern behind different stocks with a meta-learner, while modeling the specific pattern for each stock across time spans with stock-dependent parameters.

Algorithmic Trading Meta-Learning

Paper
Code

Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction

1 code implementation • 4 Aug 2022 • Lei LI, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu sun

Traditional knowledge distillation in classification problems transfers the knowledge via class correlations in the soft label produced by teacher models, which are not available in regression problems like stock trading volume prediction.

Knowledge Distillation regression

Paper
Code

Delving into the Openness of CLIP

1 code implementation • 4 Jun 2022 • Shuhuai Ren, Lei LI, Xuancheng Ren, Guangxiang Zhao, Xu sun

However, evaluating the openness of CLIP-like models is challenging, as the models are open to arbitrary vocabulary in theory, but their accuracy varies in practice.

Image Classification Text Matching

Paper
Code

Hierarchical Inductive Transfer for Continual Dialogue Learning

no code implementations • Findings (ACL) 2022 • Shaoxiong Feng, Xuancheng Ren, Kan Li, Xu sun

However, for the continual increase of online chit-chat scenarios, directly fine-tuning these models for each of the new tasks not only explodes the capacity of the dialogue system on the embedded devices but also causes knowledge forgetting on pre-trained models and knowledge interference among diverse dialogue tasks.

General Knowledge

Paper
Add Code

DFTR: Depth-supervised Fusion Transformer for Salient Object Detection

no code implementations • 12 Mar 2022 • Heqin Zhu, Xu sun, Yuexiang Li, Kai Ma, S. Kevin Zhou, Yefeng Zheng

This paper, for the first time, seeks to expand the applicability of depth supervision to the Transformer architecture.

Benchmarking Object +3

Paper
Add Code

REFUGE2 Challenge: A Treasure Trove for Multi-Dimension Analysis and Evaluation in Glaucoma Screening

no code implementations • 18 Feb 2022 • Huihui Fang, Fei Li, Junde Wu, Huazhu Fu, Xu sun, Jaemin Son, Shuang Yu, Menglu Zhang, Chenglang Yuan, Cheng Bian, Baiying Lei, Benjian Zhao, Xinxing Xu, Shaohua Li, Francisco Fumero, José Sigut, Haidar Almubarak, Yakoub Bazi, Yuanhao Guo, Yating Zhou, Ujjwal Baid, Shubham Innani, Tianjiao Guo, Jie Yang, José Ignacio Orlando, Hrvoje Bogunović, Xiulan Zhang, Yanwu Xu

Here we release a multi-annotation, multi-quality, and multi-device color fundus image dataset for glaucoma analysis on an original challenge -- Retinal Fundus Glaucoma Challenge 2nd Edition (REFUGE2).

Domain Adaptation Optic Disc Segmentation

Paper
Add Code

ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images

no code implementations • 16 Feb 2022 • Huihui Fang, Fei Li, Huazhu Fu, Xu sun, Xingxing Cao, Fengbin Lin, Jaemin Son, Sunho Kim, Gwenole Quellec, Sarah Matta, Sharath M Shankaranarayana, Yi-Ting Chen, Chuen-heng Wang, Nisarg A. Shah, Chia-Yen Lee, Chih-Chung Hsu, Hai Xie, Baiying Lei, Ujjwal Baid, Shubham Innani, Kang Dang, Wenxiu Shi, Ravi Kamble, Nitin Singhal, Ching-Wei Wang, Shih-Chang Lo, José Ignacio Orlando, Hrvoje Bogunović, Xiulan Zhang, Yanwu Xu, iChallenge-AMD study group

The ADAM challenge consisted of four tasks which cover the main aspects of detecting and characterizing AMD from fundus images, including detection of AMD, detection and segmentation of optic disc, localization of fovea, and detection and segmentation of lesions.

Paper
Add Code

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

no code implementations • 27 Dec 2021 • Yuan YAO, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang, Chaojun Xiao, Xiaozhi Wang, Fanchao Qi, Junwei Bao, Jinran Nie, Zheni Zeng, Yuxian Gu, Kun Zhou, Xuancheng Huang, Wenhao Li, Shuhuai Ren, Jinliang Lu, Chengqiang Xu, Huadong Wang, Guoyang Zeng, Zile Zhou, Jiajun Zhang, Juanzi Li, Minlie Huang, Rui Yan, Xiaodong He, Xiaojun Wan, Xin Zhao, Xu sun, Yang Liu, Zhiyuan Liu, Xianpei Han, Erhong Yang, Zhifang Sui, Maosong Sun

We argue that for general-purpose language intelligence evaluation, the benchmark itself needs to be comprehensive and systematic.

Paper
Add Code

Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models

no code implementations • 14 Dec 2021 • Lei LI, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

As many fine-tuned pre-trained language models~(PLMs) with promising performance are generously released, investigating better ways to reuse these models is vital as it can greatly reduce the retraining computational cost and the potential environmental side-effects.

Paper
Add Code

KNAS: Green Neural Architecture Search

1 code implementation • 26 Nov 2021 • Jingjing Xu, Liang Zhao, Junyang Lin, Rundong Gao, Xu sun, Hongxia Yang

Many existing neural architecture search (NAS) solutions rely on downstream training for architecture evaluation, which takes enormous computations.

Ranked #8 on Neural Architecture Search on NATS-Bench Topology, CIFAR-100

Image Classification Neural Architecture Search +2

Paper
Code

Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation

no code implementations • NeurIPS 2021 • Fenglin Liu, Chenyu You, Xian Wu, Shen Ge, Sheng Wang, Xu sun

KGAE consists of a pre-constructed knowledge graph, a knowledge-driven encoder and a knowledge-driven decoder.

Decoder Medical Report Generation

Paper
Add Code

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

1 code implementation • EMNLP 2021 • Wenkai Yang, Yankai Lin, Peng Li, Jie zhou, Xu sun

Motivated by this observation, we construct a word-based robustness-aware perturbation to distinguish poisoned samples from clean samples to defend against the backdoor attacks on natural language processing (NLP) models.

Sentiment Analysis

Paper
Code

Well-classified Examples are Underestimated in Classification with Deep Neural Networks

1 code implementation • 13 Oct 2021 • Guangxiang Zhao, Wenkai Yang, Xuancheng Ren, Lei LI, Yunfang Wu, Xu sun

The conventional wisdom behind learning deep classification models is to focus on bad-classified examples and ignore well-classified examples that are far from the decision boundary.

Graph Classification imbalanced classification +4

Paper
Code

Topology-Imbalance Learning for Semi-Supervised Node Classification

1 code implementation • NeurIPS 2021 • Deli Chen, Yankai Lin, Guangxiang Zhao, Xuancheng Ren, Peng Li, Jie zhou, Xu sun

The class imbalance problem, as an important issue in learning node representations, has drawn increasing attention from the community.

Classification Node Classification

Paper
Code

Dynamic Knowledge Distillation for Pre-trained Language Models

1 code implementation • EMNLP 2021 • Lei LI, Yankai Lin, Shuhuai Ren, Peng Li, Jie zhou, Xu sun

Knowledge distillation~(KD) has been proved effective for compressing large-scale pre-trained language models.

Knowledge Distillation

Paper
Code

Adversarial Parameter Defense by Multi-Step Risk Minimization

no code implementations • 7 Sep 2021 • Zhiyuan Zhang, Ruixuan Luo, Xuancheng Ren, Qi Su, Liangyou Li, Xu sun

To enhance neural networks, we propose the adversarial parameter defense algorithm that minimizes the average risk of multiple adversarial parameter corruptions.

Paper
Add Code

How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data

no code implementations • ICLR 2022 • Zhiyuan Zhang, Lingjuan Lyu, Weiqiang Wang, Lichao Sun, Xu sun

In this work, we observe an interesting phenomenon that the variations of parameters are always AWPs when tuning the trained clean model to inject backdoors.

Paper
Add Code

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification

1 code implementation • EMNLP 2021 • Shuhuai Ren, Jinchao Zhang, Lei LI, Xu sun, Jie zhou

Data augmentation aims to enrich training samples for alleviating the overfitting issue in low-resource or class-imbalanced situations.

Bayesian Optimization Data Augmentation +2

125

Paper
Code

Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling

1 code implementation • 23 Aug 2021 • Liang Zhao, Wei Li, Ruihan Bao, Keiko Harimoto, YunfangWu, Xu sun

Trading volume movement prediction is the key in a variety of financial applications.

Paper
Code

ASAT: Adaptively Scaled Adversarial Training in Time Series

no code implementations • 20 Aug 2021 • Zhiyuan Zhang, Wei Li, Ruihan Bao, Keiko Harimoto, Yunfang Wu, Xu sun

Besides the security concerns of potential adversarial examples, adversarial training can also improve the generalization ability of neural networks, train robust neural networks, and provide interpretability for neural networks.

Adversarial Robustness Time Series +1

Paper
Add Code

O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning

no code implementations • Findings (ACL) 2021 • Fenglin Liu, Xuancheng Ren, Xian Wu, Bang Yang, Shen Ge, Yuexian Zou, Xu sun

Video captioning combines video understanding and language generation.

Attribute Caption Generation +4

Paper
Add Code

Rethinking Stealthiness of Backdoor Attack against NLP Models

1 code implementation • ACL 2021 • Wenkai Yang, Yankai Lin, Peng Li, Jie zhou, Xu sun

In this work, we point out a potential problem of current backdoor attacking research: its evaluation ignores the stealthiness of backdoor attacks, and most of existing backdoor attacking methods are not stealthy either to system deployers or to system users.

Backdoor Attack Data Augmentation +2

Paper
Code

Contrastive Attention for Automatic Chest X-ray Report Generation

no code implementations • Findings (ACL) 2021 • Fenglin Liu, Changchang Yin, Xian Wu, Shen Ge, Ping Zhang, Yuexian Zou, Xu sun

In addition, according to the analysis, the CA model can help existing models better attend to the abnormal regions and provide more accurate descriptions which are crucial for an interpretable diagnosis.

Paper
Add Code

Neural Network Surgery: Injecting Data Patterns into Pre-trained Models with Minimal Instance-wise Side Effects

no code implementations • NAACL 2021 • Zhiyuan Zhang, Xuancheng Ren, Qi Su, Xu sun, Bin He

Motivated by neuroscientific evidence and theoretical results, we demonstrate that side effects can be controlled by the number of changed parameters and thus, we propose to conduct \textit{neural network surgery} by only modifying a limited number of parameters.

Paper
Add Code

A Global Past-Future Early Exit Method for Accelerating Inference of Pre-trained Language Models

1 code implementation • NAACL 2021 • Kaiyuan Liao, Yi Zhang, Xuancheng Ren, Qi Su, Xu sun, Bin He

We first take into consideration all the linguistic information embedded in the past layers and then take a further step to engage the future information which is originally inaccessible for predictions.

Paper
Code

Alleviating the Knowledge-Language Inconsistency: A Study for Deep Commonsense Knowledge

no code implementations • 28 May 2021 • Yi Zhang, Lei LI, Yunfang Wu, Qi Su, Xu sun

Knowledge facts are typically represented by relational triples, while we observe that some commonsense facts are represented by the triples whose forms are inconsistent with the expression of language.

Paper
Add Code

Learning Relation Alignment for Calibrated Cross-modal Retrieval

1 code implementation • ACL 2021 • Shuhuai Ren, Junyang Lin, Guangxiang Zhao, Rui Men, An Yang, Jingren Zhou, Xu sun, Hongxia Yang

To bridge the semantic gap between the two modalities, previous studies mainly focus on word-region alignment at the object level, lacking the matching between the linguistic relation among the words and the visual relation among the regions.

Ranked #4 on Image-to-Text Retrieval on MS COCO

Cross-Modal Retrieval Image-to-Text Retrieval +4

Paper
Code

Rethinking Skip Connection with Layer Normalization in Transformers and ResNets

no code implementations • 15 May 2021 • Fenglin Liu, Xuancheng Ren, Zhiyuan Zhang, Xu sun, Yuexian Zou

In this work, we investigate how the scale factors in the effectiveness of the skip connection and reveal that a trivial adjustment of the scale will lead to spurious gradient exploding or vanishing in line with the deepness of the models, which could be addressed by normalization, in particular, layer normalization, which induces consistent improvements over the plain skip connection.

Image Classification Machine Translation +1

Paper
Add Code

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

1 code implementation • NAACL 2021 • Wenkai Yang, Lei LI, Zhiyuan Zhang, Xuancheng Ren, Xu sun, Bin He

However, in this paper, we find that it is possible to hack the model in a data-free way by modifying one single word embedding vector, with almost no accuracy sacrificed on clean samples.

Backdoor Attack Data Poisoning +4

Paper
Code

Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation

no code implementations • 22 Feb 2021 • Shaoxiong Feng, Xuancheng Ren, Kan Li, Xu sun

The finding of general knowledge is further hindered by the unidirectional distillation, as the student should obey the teacher and may discard some knowledge that is truly general but refuted by the teacher.

Dialogue Generation General Knowledge +1

Paper
Add Code

A Gradient-based Kernel Approach for Efficient Network Architecture Search

no code implementations • 1 Jan 2021 • Jingjing Xu, Liang Zhao, Junyang Lin, Xu sun, Hongxia Yang

Inspired by our new finding, we explore a simple yet effective network architecture search (NAS) approach that leverages gradient correlation and gradient values to find well-performing architectures.

Image Classification text-classification +1

Paper
Add Code

High-Likelihood Area Matters --- Rewarding Near-Correct Predictions Under Imbalanced Distributions

no code implementations • 1 Jan 2021 • Guangxiang Zhao, Lei LI, Xuancheng Ren, Xu sun, Bin He

We find in practice that the high-likelihood area contains correct predictions for tail classes and it plays a vital role in learning imbalanced class distributions.

Vocal Bursts Intensity Prediction

Paper
Add Code

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade

1 code implementation • Findings (EMNLP) 2021 • Lei LI, Yankai Lin, Deli Chen, Shuhuai Ren, Peng Li, Jie zhou, Xu sun

On the other hand, the exiting decisions made by internal classifiers are unreliable, leading to wrongly emitted early predictions.

Knowledge Distillation Model Selection

Paper
Code

Learning Robust Representation for Clustering through Locality Preserving Variational Discriminative Network

1 code implementation • 25 Dec 2020 • Ruixuan Luo, Wei Li, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu sun

Recent deep learning based methods focus on learning clustering oriented representations.

Clustering

Paper
Code

Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification

no code implementations • 14 Dec 2020 • Deli Chen, Yankai Lin, Lei LI, Xuancheng Ren, Peng Li, Jie zhou, Xu sun

Graph Contrastive Learning (GCL) has proven highly effective in promoting the performance of Semi-Supervised Node Classification (SSNC).

Contrastive Learning Graph Learning +1

Paper
Add Code

EQG-RACE: Examination-Type Question Generation

1 code implementation • 11 Dec 2020 • Xin Jia, Wenjie Zhou, Xu sun, Yunfang Wu

Question Generation (QG) is an essential component of the automatic intelligent tutoring systems, which aims to generate high-quality questions for facilitating the reading practice and assessments.

Question Generation Question-Generation +2

Paper
Code

Prophet Attention: Predicting Attention with Future Attention

no code implementations • NeurIPS 2020 • Fenglin Liu, Xuancheng Ren, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou, Xu sun

Especially for image captioning, the attention based models are expected to ground correct image regions with proper generated words.

Image Captioning

Paper
Add Code

Rethinking Skip Connection with Layer Normalization

no code implementations • COLING 2020 • Fenglin Liu, Xuancheng Ren, Zhiyuan Zhang, Xu sun, Yuexian Zou

In this work, we investigate how the scale factors in the effectiveness of the skip connection and reveal that a trivial adjustment of the scale will lead to spurious gradient exploding or vanishing in line with the deepness of the models, which could by addressed by normalization, in particular, layer normalization, which induces consistent improvements over the plain skip connection.

Image Classification Machine Translation +1

Paper
Add Code

Pretrain-KGE: Learning Knowledge Representation from Pretrained Language Models

no code implementations • Findings of the Association for Computational Linguistics 2020 • Zhiyuan Zhang, Xiaoqian Liu, Yi Zhang, Qi Su, Xu sun, Bin He

Conventional knowledge graph embedding (KGE) often suffers from limited knowledge representation, leading to performance degradation especially on the low-resource problem.

Knowledge Graph Embedding World Knowledge

Paper
Add Code

A Backbone Replaceable Fine-tuning Framework for Stable Face Alignment

no code implementations • 19 Oct 2020 • Xu sun, Zhenfeng Fan, Zihao Zhang, Yingjie Guo, Shihong Xia

The proposed framework achieves at least 40% improvement on stability evaluation metrics while enhancing detection accuracy versus state-of-the-art methods.

Attribute Face Alignment

Paper
Add Code

CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations

no code implementations • 13 Oct 2020 • Fuli Luo, Pengcheng Yang, Shicheng Li, Xuancheng Ren, Xu sun

Pre-trained self-supervised models such as BERT have achieved striking success in learning sequence representations, especially for natural language processing.

Natural Language Understanding Sentence

Paper
Add Code

Regularizing Dialogue Generation by Imitating Implicit Scenarios

no code implementations • EMNLP 2020 • Shaoxiong Feng, Xuancheng Ren, Hongshen Chen, Bin Sun, Kan Li, Xu sun

Human dialogues are scenario-based and appropriate responses generally relate to the latent context knowledge entailed by the specific scenario.

Dialogue Generation Imitation Learning

Paper
Add Code

Graph-based Multi-hop Reasoning for Long Text Generation

no code implementations • 28 Sep 2020 • Liang Zhao, Jingjing Xu, Junyang Lin, Yichang Zhang, Hongxia Yang, Xu sun

The reasoning module is responsible for searching skeleton paths from a knowledge graph to imitate the imagination process in the human writing for semantic transfer.

Review Generation Sentence +1

Paper
Add Code

Collaborative Group Learning

no code implementations • 16 Sep 2020 • Shaoxiong Feng, Hongshen Chen, Xuancheng Ren, Zhuoye Ding, Kan Li, Xu sun

Collaborative learning has successfully applied knowledge transfer to guide a pool of small student networks towards robust local minima.

Computational Efficiency Inductive Bias +1

Paper
Add Code

Robust Retinal Vessel Segmentation from a Data Augmentation Perspective

1 code implementation • 31 Jul 2020 • Xu Sun, Huihui Fang, Yehui Yang, Dongwei Zhu, Lei Wang, Junwei Liu, Yanwu Xu

In this paper, we propose two new data augmentation modules, namely, channel-wise random Gamma correction and channel-wise random vessel augmentation.

Data Augmentation Retinal Vessel Segmentation

1,694

Paper
Code

How to Ask Good Questions? Try to Leverage Paraphrases

no code implementations • ACL 2020 • Xin Jia, Wenjie Zhou, Xu sun, Yunfang Wu

Given a sentence and its relevant answer, how to ask good questions is a challenging task, which has many real applications.

Multi-Task Learning Paraphrase Generation +4

Paper
Add Code

Exploring the Vulnerability of Deep Neural Networks: A Study of Parameter Corruption

1 code implementation • 10 Jun 2020 • Xu Sun, Zhiyuan Zhang, Xuancheng Ren, Ruixuan Luo, Liangyou Li

We argue that the vulnerability of model parameters is of crucial value to the study of model robustness and generalization but little research has been devoted to understanding this matter.

Paper
Code

Building BROOK: A Multi-modal and Facial Video Database for Human-Vehicle Interaction Research

no code implementations • 18 May 2020 • Xiangjun Peng, Zhentao Huang, Xu sun

Finally, we discuss related issues when building such a database and our future directions in the context of BROOK.

Autonomous Vehicles

Paper
Add Code

Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding

no code implementations • 16 May 2020 • Fenglin Liu, Xuancheng Ren, Guangxiang Zhao, Chenyu You, Xuewei Ma, Xian Wu, Xu sun

While it is common practice to draw information from only the last encoder layer, recent work has proposed to use representations from different encoder layers for diversified levels of information.

Abstractive Text Summarization Decoder +6

Paper
Add Code

Parallel Data Augmentation for Formality Style Transfer

1 code implementation • ACL 2020 • Yi Zhang, Tao Ge, Xu sun

The main barrier to progress in the task of Formality Style Transfer is the inadequacy of training data.

Data Augmentation Formality Style Transfer +2

Paper
Code

AGE Challenge: Angle Closure Glaucoma Evaluation in Anterior Segment Optical Coherence Tomography

no code implementations • 5 May 2020 • Huazhu Fu, Fei Li, Xu sun, Xingxing Cao, Jingan Liao, Jose Ignacio Orlando, Xing Tao, Yuexiang Li, Shihao Zhang, Mingkui Tan, Chenglang Yuan, Cheng Bian, Ruitao Xie, Jiongcheng Li, Xiaomeng Li, Jing Wang, Le Geng, Panming Li, Huaying Hao, Jiang Liu, Yan Kong, Yongyong Ren, Hrvoje Bogunovic, Xiulan Zhang, Yanwu Xu

To address this, we organized the Angle closure Glaucoma Evaluation challenge (AGE), held in conjunction with MICCAI 2019.

Paper
Add Code

Query-Variant Advertisement Text Generation with Association Knowledge

1 code implementation • 14 Apr 2020 • Siyu Duan, Wei Li, Cai Jing, Yancheng He, Yunfang Wu, Xu sun

In this paper, we propose the query-variant advertisement text generation task that aims to generate candidate advertisement texts for different web search queries with various needs based on queries and item keywords.

Text Generation

Paper
Code

Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

2 code implementations • 14 Apr 2020 • Shu Liu, Wei Li, Yunfang Wu, Qi Su, Xu sun

Target-Based Sentiment Analysis aims to detect the opinion aspects (aspect extraction) and the sentiment polarities (sentiment detection) towards them.

Aspect Extraction Sentiment Analysis

386

Paper
Code

Exploring and Distilling Cross-Modal Information for Image Captioning

no code implementations • 28 Feb 2020 • Fenglin Liu, Xuancheng Ren, Yuanxin Liu, Kai Lei, Xu sun

Recently, attention-based encoder-decoder models have been used extensively in image captioning.

Attribute Decoder +1

Paper
Add Code

Mining Commonsense Facts from the Physical World

no code implementations • 8 Feb 2020 • Yanyan Zou, Wei Lu, Xu sun

In this paper, we propose a new task of mining commonsense facts from the raw text that describes the physical world.

Knowledge Base Completion

Paper
Add Code

Visual Agreement Regularized Training for Multi-Modal Machine Translation

no code implementations • 27 Dec 2019 • Pengcheng Yang, Boxing Chen, Pei Zhang, Xu sun

Further analysis demonstrates that the proposed regularized training can effectively improve the agreement of attention on the image, leading to better use of visual information.

Machine Translation Sentence +1

Paper
Add Code

Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection

2 code implementations • 25 Dec 2019 • Guangxiang Zhao, Junyang Lin, Zhiyuan Zhang, Xuancheng Ren, Qi Su, Xu sun

Self-attention based Transformer has demonstrated the state-of-the-art performances in a number of natural language processing tasks.

Image Captioning Language Modelling +2

Paper
Code

Pretrain-KGEs: Learning Knowledge Representation from Pretrained Models for Knowledge Graph Embeddings

no code implementations • 1 Dec 2019 • Zhiyuan Zhang, Xiaoqian Liu, Yi Zhang, Qi Su, Xu sun, Bin He

Learning knowledge graph embeddings (KGEs) is an efficient approach to knowledge graph completion.

Knowledge Graph Embeddings Link Prediction +1

Paper
Add Code

MUSE: Parallel Multi-Scale Attention for Sequence to Sequence Learning

2 code implementations • 17 Nov 2019 • Guangxiang Zhao, Xu sun, Jingjing Xu, Zhiyuan Zhang, Liangchen Luo

In this work, we explore parallel multi-scale representation learning on sequence data, striving to capture both long-range and short-range language structures.

Ranked #8 on Machine Translation on WMT2014 English-French

Machine Translation Representation Learning +1

Paper
Code

Understanding and Improving Layer Normalization

2 code implementations • NeurIPS 2019 • Jingjing Xu, Xu sun, Zhiyuan Zhang, Guangxiang Zhao, Junyang Lin

Unlike them, we find that the derivatives of the mean and variance are more important than forward normalization by re-centering and re-scaling backward gradients.

Ranked #5 on Machine Translation on IWSLT2015 English-Vietnamese

Machine Translation Translation

Paper
Code

HighwayGraph: Modelling Long-distance Node Relations for Improving General Graph Neural Network

no code implementations • 10 Nov 2019 • Deli Chen, Xiaoqian Liu, Yankai Lin, Peng Li, Jie zhou, Qi Su, Xu sun

To address this issue, we propose to model long-distance node relations by simply relying on shallow GNN architectures with two solutions: (1) Implicitly modelling by learning to predict node pair relations (2) Explicitly modelling by adding edges between nodes that potentially have the same label.

General Classification Node Classification

Paper
Add Code

Asking Clarification Questions in Knowledge-Based Question Answering

no code implementations • IJCNLP 2019 • Jingjing Xu, Yuechen Wang, Duyu Tang, Nan Duan, Pengcheng Yang, Qi Zeng, Ming Zhou, Xu sun

We provide representative baselines for these tasks and further introduce a coarse-to-fine model for clarification question generation.

Question Answering Question Generation +1

Paper
Add Code

Specificity-Driven Cascading Approach for Unsupervised Sentiment Modification

no code implementations • IJCNLP 2019 • Pengcheng Yang, Junyang Lin, Jingjing Xu, Jun Xie, Qi Su, Xu sun

The task of unsupervised sentiment modification aims to reverse the sentiment polarity of the input text while preserving its semantic content without any parallel data.

Specificity

Paper
Add Code

LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification

no code implementations • IJCNLP 2019 • Jingjing Xu, Liang Zhao, Hanqi Yan, Qi Zeng, Yun Liang, Xu sun

The generator learns to generate examples to attack the classifier while the classifier learns to defend these attacks.

General Classification Sentiment Analysis +3

Paper
Add Code

Blast-wave description of $Υ$ elliptic flow at energies available at the CERN Large Hadron Collider

1 code implementation • 31 Oct 2019 • Klaus Reygers, Alexander Schmah, Anastasia Berdnikova, Xu sun

A simultaneous blast-wave fit to particle yields and elliptic flow ($v_{2}$) measured as a function of transverse momentum in Pb-Pb collisions at LHC energies is presented.

High Energy Physics - Phenomenology Nuclear Experiment Nuclear Theory

Paper
Code

An Adaptive and Momental Bound Method for Stochastic Learning

2 code implementations • 27 Oct 2019 • Jianbang Ding, Xuancheng Ren, Ruixuan Luo, Xu sun

The dynamic learning rate bounds are based on the exponential moving averages of the adaptive learning rates themselves, which smooth out unexpected large learning rates and stabilize the training of deep neural networks.

Stochastic Optimization

125

Paper
Code

Pun-GAN: Generative Adversarial Network for Pun Generation

1 code implementation • IJCNLP 2019 • Fuli Luo, Shunyao Li, Pengcheng Yang, Lei LI, Baobao Chang, Zhifang Sui, Xu sun

It consists of a generator to produce pun sentences, and a discriminator to distinguish between the generated pun sentences and the real sentences with specific word senses.

Generative Adversarial Network Sentence

Paper
Code

Aligning Cross-Lingual Entities with Multi-Aspect Information

1 code implementation • IJCNLP 2019 • Hsiu-Wei Yang, Yanyan Zou, Peng Shi, Wei Lu, Jimmy Lin, Xu sun

Multilingual knowledge graphs (KGs), such as YAGO and DBpedia, represent entities in different languages.

Entity Alignment Entity Embeddings +1

Paper
Code

Group, Extract and Aggregate: Summarizing a Large Amount of Finance News for Forex Movement Prediction

no code implementations • WS 2019 • Deli Chen, Shuming Ma, Keiko Harimoto, Ruihan Bao, Qi Su, Xu sun

In this work, we propose a BERT-based Hierarchical Aggregation Model to summarize a large amount of finance news to predict forex movement.

Extractive Summarization Stock Market Prediction

Paper
Add Code

Sparse Transformer: Concentrated Attention Through Explicit Selection

no code implementations • 25 Sep 2019 • Guangxiang Zhao, Junyang Lin, Zhiyuan Zhang, Xuancheng Ren, Xu sun

Extensive experimental results on a series of natural language processing tasks, including neural machine translation, image captioning, and language modeling, all demonstrate the advantages of Sparse Transformer in model performance.

Image Captioning Language Modelling +2

Paper
Add Code

Recursive Graphical Neural Networks for Text Classification

no code implementations • 18 Sep 2019 • Wei Li, Shuheng Li, Shuming Ma, Yancheng He, Deli Chen, Xu sun

Graph is a natural structure to describe the complicated relation between tokens.

General Classification text-classification +1

Paper
Add Code

Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting

no code implementations • 13 Sep 2019 • Yi Zhang, Tao Ge, Furu Wei, Ming Zhou, Xu sun

We study sequence-to-sequence (seq2seq) pre-training with data augmentation for sentence rewriting.

Data Augmentation Formality Style Transfer +4

Paper
Add Code

Measuring and Relieving the Over-smoothing Problem for Graph Neural Networks from the Topological View

no code implementations • 7 Sep 2019 • Deli Chen, Yankai Lin, Wei Li, Peng Li, Jie zhou, Xu sun

Graph Neural Networks (GNNs) have achieved promising performance on a wide range of graph-based tasks.

Ranked #52 on Node Classification on Cora

Node Classification

Paper
Add Code

Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation

1 code implementation • ACL 2019 • Shuming Ma, Pengcheng Yang, Tianyu Liu, Peng Li, Jie zhou, Xu sun

We propose a novel model to separate the generation into two stages: key fact prediction and surface realization.

Decoder Table-to-Text Generation

Paper
Code

Coherent Comments Generation for Chinese Articles with a Graph-to-Sequence Model

1 code implementation • ACL 2019 • Wei Li, Jingjing Xu, Yancheng He, ShengLi Yan, Yunfang Wu, Xu sun

In this paper, we propose to generate comments with a graph-to-sequence model that models the input news as a topic interaction graph.

Decoder Graph-to-Sequence

174

Paper
Code

Towards Fine-grained Text Sentiment Transfer

1 code implementation • ACL 2019 • Fuli Luo, Peng Li, Pengcheng Yang, Jie zhou, Yutong Tan, Baobao Chang, Zhifang Sui, Xu sun

In this paper, we focus on the task of fine-grained text sentiment transfer (FGST).

Decoder

Paper
Code

MAAM: A Morphology-Aware Alignment Model for Unsupervised Bilingual Lexicon Induction

no code implementations • ACL 2019 • Pengcheng Yang, Fuli Luo, Peng Chen, Tianyu Liu, Xu sun

The task of unsupervised bilingual lexicon induction (UBLI) aims to induce word translations from monolingual corpora in two languages.

Bilingual Lexicon Induction Denoising +2

Paper
Add Code

Enhancing Topic-to-Essay Generation with External Commonsense Knowledge

no code implementations • ACL 2019 • Pengcheng Yang, Lei LI, Fuli Luo, Tianyu Liu, Xu sun

Experiments show that with external commonsense knowledge and adversarial training, the generated essays are more novel, diverse, and topic-consistent than existing methods in terms of both automatic and human evaluation.

Concept-To-Text Generation

Paper
Add Code

Learning to Control the Fine-grained Sentiment for Story Ending Generation

no code implementations • ACL 2019 • Fuli Luo, Damai Dai, Pengcheng Yang, Tianyu Liu, Baobao Chang, Zhifang Sui, Xu sun

Therefore, we propose a generic and novel framework which consists of a sentiment analyzer and a sentimental generator, respectively addressing the two challenges.

Decoder Text Generation

Paper
Add Code

Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information

1 code implementation • ACL 2019 • Pengcheng Yang, Zhihan Zhang, Fuli Luo, Lei LI, Chengyang Huang, Xu sun

Automatic commenting of online articles can provide additional opinions and facts to the reader, which improves user experience and engagement on social media platforms.

Comment Generation

Paper
Code

A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification

1 code implementation • ACL 2019 • Pengcheng Yang, Fuli Luo, Shuming Ma, Junyang Lin, Xu sun

In this way, we can reduce the dependence of the model on the label order, as well as capture high-order correlations between labels.

General Classification Multi-Label Classification

Paper
Code

PKUSEG: A Toolkit for Multi-Domain Chinese Word Segmentation

4 code implementations • 27 Jun 2019 • Ruixuan Luo, Jingjing Xu, Yi Zhang, Zhiyuan Zhang, Xuancheng Ren, Xu sun

Through this method, we generate synthetic data using a large amount of unlabeled data in the target domain and then obtain a word segmentation model for the target domain.

Chinese Word Segmentation Domain Adaptation +3

6,440

Paper
Code

Imitation Learning for Non-Autoregressive Neural Machine Translation

no code implementations • ACL 2019 • Bingzhen Wei, Mingxuan Wang, Hao Zhou, Junyang Lin, Jun Xie, Xu sun

Non-autoregressive translation models (NAT) have achieved impressive inference speedup.

Imitation Learning Machine Translation +2

Paper
Add Code

A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer

1 code implementation • ACL 2019 • Chen Wu, Xuancheng Ren, Fuli Luo, Xu sun

Unsupervised text style transfer aims to alter text styles while preserving the content, without aligned data for supervision.

Sentence Style Transfer +2

Paper
Code

Coherent Comment Generation for Chinese Articles with a Graph-to-Sequence Model

1 code implementation • 4 Jun 2019 • Wei Li, Jingjing Xu, Yancheng He, ShengLi Yan, Yunfang Wu, Xu sun

In this paper, we propose to generate comments with a graph-to-sequence model that models the input news as a topic interaction graph.

Comment Generation Decoder +1

174

Paper
Code

Memorized Sparse Backpropagation

no code implementations • 24 May 2019 • Zhiyuan Zhang, Pengcheng Yang, Xuancheng Ren, Qi Su, Xu sun

Neural network learning is usually time-consuming since backpropagation needs to compute full gradients and backpropagate them across multiple layers.

Paper
Add Code

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

2 code implementations • 24 May 2019 • Fuli Luo, Peng Li, Jie zhou, Pengcheng Yang, Baobao Chang, Zhifang Sui, Xu sun

Therefore, in this paper, we propose a dual reinforcement learning framework to directly transfer the style of the text via a one-step mapping model, without any separation of content and style.

Ranked #1 on Unsupervised Text Style Transfer on GYAFC

reinforcement-learning Reinforcement Learning (RL) +2

262

Paper
Code

Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations

1 code implementation • NeurIPS 2019 • Fenglin Liu, Yuanxin Liu, Xuancheng Ren, Xiaodong He, Xu sun

In vision-and-language grounding problems, fine-grained representations of the image are considered to be of paramount importance.

Image Captioning Question Answering +2

Paper
Code

Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling

1 code implementation • IJCAI 2019 2019 • Pengcheng Yang, Fuli Luo, Peng Chen, Lei LI, Zhiyi Yin, Xiaodong He, Xu sun

The visual storytelling (VST) task aims at generating a reasonable and coherent paragraph-level story with the image stream as input.

Ranked #21 on Visual Storytelling on VIST

Knowledge Graphs Semantic Similarity +2

Paper
Code

Adaptive Gradient Methods with Dynamic Bound of Learning Rate

5 code implementations • ICLR 2019 • Liangchen Luo, Yuanhao Xiong, Yan Liu, Xu sun

Recent work has put forward some algorithms such as AMSGrad to tackle this issue but they failed to achieve considerable improvement over existing methods.

2,910

Paper
Code

Learning Personalized End-to-End Goal-Oriented Dialog

no code implementations • 12 Nov 2018 • Liangchen Luo, Wenhao Huang, Qi Zeng, Zaiqing Nie, Xu sun

Most existing works on dialog systems only consider conversation content while neglecting the personality of the user the bot is interacting with, which begets several unsolved issues.

Goal-Oriented Dialog

Paper
Add Code

Learning Unsupervised Word Mapping by Maximizing Mean Discrepancy

no code implementations • 1 Nov 2018 • Pengcheng Yang, Fuli Luo, Shuangzhi Wu, Jingjing Xu, Dong-dong Zhang, Xu sun

In order to avoid such sophisticated alternate optimization, we propose to learn unsupervised word mapping by directly maximizing the mean discrepancy between the distribution of transferred embedding and target embedding.

Cross-Lingual Word Embeddings Density Estimation +4

Paper
Add Code

Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning

no code implementations • EMNLP 2018 • Chen Shi, Qi Chen, Lei Sha, Sujian Li, Xu Sun, Houfeng Wang, Lintao Zhang

The lack of labeled data is one of the main challenges when building a task-oriented dialogue system.

Active Learning Clustering

Paper
Add Code

Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text Generation

1 code implementation • EMNLP 2018 • Jingjing Xu, Xuancheng Ren, Junyang Lin, Xu sun

Existing text generation methods tend to produce repeated and {''}boring{''} expressions.

Dialogue Generation Generative Adversarial Network +4

Paper
Code

Unsupervised Machine Commenting with Neural Variational Topic Model

no code implementations • 13 Sep 2018 • Shuming Ma, Lei Cui, Furu Wei, Xu sun

To fully exploit the unpaired data, we completely remove the need for parallel data and propose a novel unsupervised approach to train an automatic article commenting model, relying on nothing but unpaired articles and comments.

Retrieval

Paper
Add Code

LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts

3 code implementations • 13 Sep 2018 • Shuming Ma, Lei Cui, Damai Dai, Furu Wei, Xu sun

We introduce the task of automatic live commenting.

Retrieval

124

Paper
Code

Evaluating Semantic Rationality of a Sentence: A Sememe-Word-Matching Neural Network based on HowNet

no code implementations • 11 Sep 2018 • Shu Liu, Jingjing Xu, Xuancheng Ren, Xu sun

To evaluate the effectiveness of the proposed model, we build a large-scale rationality evaluation dataset.

Language Modelling Sentence

Paper
Add Code

A Deep Reinforced Sequence-to-Set Model for Multi-Label Text Classification

no code implementations • 10 Sep 2018 • Pengcheng Yang, Shuming Ma, Yi Zhang, Junyang Lin, Qi Su, Xu sun

However, the Seq2Seq model is not suitable for the MLTC task in essence.

General Classification Multi Label Text Classification +2

Paper
Add Code

simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions

1 code implementation • EMNLP 2018 • Fenglin Liu, Xuancheng Ren, Yuanxin Liu, Houfeng Wang, Xu sun

The encode-decoder framework has shown recent success in image captioning.

Decoder Image Captioning

Paper
Code

An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation

1 code implementation • EMNLP 2018 • Liangchen Luo, Jingjing Xu, Junyang Lin, Qi Zeng, Xu sun

Different from conventional text generation tasks, the mapping between inputs and responses in conversations is more complicated, which highly demands the understanding of utterance-level semantic dependency, a relation between the whole meanings of inputs and outputs.

Ranked #1 on Text Generation on DailyDialog

Dialogue Generation

Paper
Code

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification

1 code implementation • EMNLP 2018 • Junyang Lin, Qi Su, Pengcheng Yang, Shuming Ma, Xu sun

We propose a novel model for multi-label text classification, which is based on sequence-to-sequence learning.

General Classification Multi Label Text Classification +2

154

Paper
Code

Identifying High-Quality Chinese News Comments Based on Multi-Target Text Matching Model

no code implementations • 22 Aug 2018 • Deli Chen, Shuming Ma, Pengcheng Yang, Xu sun

In this work, we introduce a novel task: high-quality comment identification (HQCI), which aims to automatically assess the quality of online comments.

Informativeness Text Matching

Paper
Add Code

Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation

1 code implementation • EMNLP 2018 • Junyang Lin, Xu sun, Xuancheng Ren, Muyu Li, Qi Su

Most of the Neural Machine Translation (NMT) models are based on the sequence-to-sequence (Seq2Seq) model with an encoder-decoder framework equipped with the attention mechanism.

Ranked #7 on Machine Translation on IWSLT2015 English-Vietnamese

Decoder Machine Translation +2

Paper
Code

Learning Sentiment Memories for Sentiment Modification without Parallel Data

1 code implementation • EMNLP 2018 • Yi Zhang, Jingjing Xu, Pengcheng Yang, Xu sun

The task of sentiment modification requires reversing the sentiment of the input and preserving the sentiment-independent content.

Text Style Transfer

Paper
Code

A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation

1 code implementation • EMNLP 2018 • Jingjing Xu, Xuancheng Ren, Yi Zhang, Qi Zeng, Xiaoyan Cai, Xu sun

Compared to the state-of-the-art models, our skeleton-based model can generate significantly more coherent text according to human evaluation and automatic evaluation.

Sentence Story Generation

Paper
Code

Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions

no code implementations • 16 Aug 2018 • Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang, Xu sun

In the experiments, we take a real-world sememe knowledge base HowNet and the corresponding descriptions of the words in Baidu Wiki for training and evaluation.

Paper
Add Code

Primal Meaning Recommendation via On-line Encyclopedia

no code implementations • 14 Aug 2018 • Zhiyuan Zhang, Wei Li, Jingjing Xu, Xu sun

We define the primal meaning of an expression to be a frequently used sense of that expression from which its other frequent senses can be deduced.

Paper
Add Code

A Neural Question Answering Model Based on Semi-Structured Tables

no code implementations • COLING 2018 • Hao Wang, Xiaodong Zhang, Shuming Ma, Xu sun, Houfeng Wang, Mengxiang Wang

Then the system measures the relevance between each question and candidate table cells, and choose the most related cell as the source of answer.

Knowledge Graphs Multiple-choice +1

Paper
Add Code

Question Condensing Networks for Answer Selection in Community Question Answering

1 code implementation • ACL 2018 • Wei Wu, Xu sun, Houfeng Wang

Answer selection is an important subtask of community question answering (CQA).

Answer Selection Community Question Answering

Paper
Code

SGM: Sequence Generation Model for Multi-label Classification

1 code implementation • COLING 2018 • Pengcheng Yang, Xu sun, Wei Li, Shuming Ma, Wei Wu, Houfeng Wang

Further analysis of experimental results demonstrates that the proposed methods not only capture the correlations between labels, but also select the most informative words automatically when predicting different labels.

Classification Decoder +2

429

Paper
Code

Deconvolution-Based Global Decoding for Neural Machine Translation

1 code implementation • COLING 2018 • Junyang Lin, Xu sun, Xuancheng Ren, Shuming Ma, Jinsong Su, Qi Su

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order.

Ranked #9 on Machine Translation on IWSLT2015 English-Vietnamese

Machine Translation NMT +1

Paper
Code

Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach

1 code implementation • ACL 2018 • Jingjing Xu, Xu sun, Qi Zeng, Xuancheng Ren, Xiaodong Zhang, Houfeng Wang, Wenjie Li

We evaluate our approach on two review datasets, Yelp and Amazon.

Ranked #6 on Unsupervised Text Style Transfer on Yelp

reinforcement-learning Reinforcement Learning (RL) +3

108

Paper
Code

Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization

1 code implementation • ACL 2018 • Shuming Ma, Xu sun, Junyang Lin, Houfeng Wang

In this work, we supervise the learning of the representation of the source content with that of the summary.

Abstractive Text Summarization

136

Paper
Code

Bag-of-Words as Target for Neural Machine Translation

1 code implementation • ACL 2018 • Shuming Ma, Xu sun, Yizhong Wang, Junyang Lin

However, most of the existing neural machine translation models only use one of the correct translations as the targets, and the other correct sentences are punished as the incorrect sentences in the training stage.

Machine Translation Sentence +1

Paper
Code

Automatic Academic Paper Rating Based on Modularized Hierarchical Convolutional Neural Network

1 code implementation • ACL 2018 • Pengcheng Yang, Xu sun, Wei Li, Shuming Ma

As more and more academic papers are being submitted to conferences and journals, evaluating all these papers by professionals is time-consuming and can cause inequality due to the personal factors of the reviewers.

Paper
Code

Global Encoding for Abstractive Summarization

4 code implementations • ACL 2018 • Junyang Lin, Xu sun, Shuming Ma, Qi Su

To tackle the problem, we propose a global encoding framework, which controls the information flow from the encoder to the decoder based on the global information of the source context.

Ranked #29 on Text Summarization on GigaWord

Abstractive Text Summarization Decoder

273

Paper
Code

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

no code implementations • 10 May 2018 • Bingzhen Wei, Xuancheng Ren, Xu sun, Yi Zhang, Xiaoyan Cai, Qi Su

Especially, the proposed approach improves the semantic consistency by 4\% in terms of human evaluation.

Abstractive Text Summarization

Paper
Add Code

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification

no code implementations • 3 May 2018 • Shuming Ma, Xu sun, Junyang Lin, Xuancheng Ren

Text summarization and sentiment classification both aim to capture the main ideas of the text but at different levels.

Abstractive Text Summarization Classification +3

Paper
Add Code

Structure Regularized Neural Network for Entity Relation Classification for Chinese Literature Text

no code implementations • NAACL 2018 • Ji Wen, Xu sun, Xuancheng Ren, Qi Su

In this paper, we propose the task of relation classification for Chinese literature text.

General Classification Relation +1

Paper
Add Code

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

1 code implementation • NAACL 2018 • Shuming Ma, Xu sun, Wei Li, Sujian Li, Wenjie Li, Xuancheng Ren

The existing sequence-to-sequence model tends to memorize the words and the patterns in the training dataset instead of learning the meaning of the words.

Abstractive Text Summarization Decoder +3

Paper
Code

Tag-Enhanced Tree-Structured Neural Networks for Implicit Discourse Relation Classification

no code implementations • IJCNLP 2017 • Yizhong Wang, Sujian Li, Jingfeng Yang, Xu sun, Houfeng Wang

Identifying implicit discourse relations between text spans is a challenging task because it requires understanding the meaning of the text.

General Classification Implicit Discourse Relation Classification +3

Paper
Add Code

Decoding-History-Based Adaptive Control of Attention for Neural Machine Translation

no code implementations • 6 Feb 2018 • Junyang Lin, Shuming Ma, Qi Su, Xu sun

ACA learns to control the attention by keeping track of the decoding history and the current information with a memory vector, so that the model can take the translated contents and the current information into consideration.

Decoder Machine Translation +2

Paper
Add Code

DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text

3 code implementations • 5 Feb 2018 • Jingjing Xu, Xuancheng Ren, Junyang Lin, Xu sun

Existing text generation methods tend to produce repeated and "boring" expressions.

Dialogue Generation Generative Adversarial Network +2

145

Paper
Code

Exploration on Generating Traditional Chinese Medicine Prescription from Symptoms with an End-to-End method

no code implementations • 27 Jan 2018 • Wei Li, Zheng Yang, Xu sun

Traditional Chinese Medicine (TCM) is an influential form of medical treatment in China and surrounding areas.

Decoder

Paper
Add Code

Building an Ellipsis-aware Chinese Dependency Treebank for Web Text

1 code implementation • LREC 2018 • Xuancheng Ren, Xu sun, Ji Wen, Bingzhen Wei, Weidong Zhan, Zhiyuan Zhang

Web 2. 0 has brought with it numerous user-produced data revealing one's thoughts, experiences, and knowledge, which are a great source for many tasks, such as information extraction, and knowledge base construction.

Dependency Parsing Sentence

Paper
Code

A Chinese Dataset with Negative Full Forms for General Abbreviation Prediction

1 code implementation • LREC 2018 • Yi Zhang, Xu sun

However, due to the deficiency in the abbreviation corpora, such a task is limited in current studies, especially considering general abbreviation prediction should also include those full form expressions that do not have valid abbreviations, namely the negative full forms (NFFs).

valid

Paper
Code

Hybrid Oracle: Making Use of Ambiguity in Transition-based Chinese Dependency Parsing

1 code implementation • 28 Nov 2017 • Xuancheng Ren, Xu sun

In the training of transition-based dependency parsers, an oracle is used to predict a transition sequence for a sentence and its gold tree.

Chinese Dependency Parsing Dependency Parsing +1

Paper
Code

Complex Structure Leads to Overfitting: A Structure Regularization Decoding Method for Natural Language Processing

no code implementations • 25 Nov 2017 • Xu Sun, Weiwei Sun, Shuming Ma, Xuancheng Ren, Yi Zhang, Wenjie Li, Houfeng Wang

The decoding of the complex structure model is regularized by the additionally trained simple structure model.

Structured Prediction

Paper
Add Code

Does Higher Order LSTM Have Better Accuracy for Segmenting and Labeling Sequence Data?

1 code implementation • COLING 2018 • Yi Zhang, Xu sun, Shuming Ma, Yang Yang, Xuancheng Ren

In our work, we first design a new model called "high order LSTM" to predict multiple tags for the current token which contains not only the current tag but also the previous several tags.

Chunking NER +1

Paper
Code

A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text

2 code implementations • 19 Nov 2017 • Jingjing Xu, Ji Wen, Xu sun, Qi Su

To build a high quality dataset, we propose two tagging methods to solve the problem of data inconsistency, including a heuristic tagging method and a machine auxiliary tagging method.

named-entity-recognition Named Entity Recognition +3

400

Paper
Code

Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method

3 code implementations • 17 Nov 2017 • Xu Sun, Xuancheng Ren, Shuming Ma, Bingzhen Wei, Wei Li, Jingjing Xu, Houfeng Wang, Yi Zhang

Based on the sparsified gradients, we further simplify the model by eliminating the rows or columns that are seldom updated, which will reduce the computational cost both in the training and decoding, and potentially accelerate decoding in real-world applications.

110

Paper
Code

Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

no code implementations • 4 Nov 2017 • Jingjing Xu, Xu sun, Sujian Li, Xiaoyan Cai, Bingzhen Wei

In this paper, we propose a deep stacking framework to improve the performance on word segmentation tasks with insufficient data by integrating datasets from diverse domains.

Chinese Word Segmentation Transfer Learning

Paper
Add Code

Cascading Multiway Attentions for Document-level Sentiment Classification

no code implementations • IJCNLP 2017 • Dehong Ma, Sujian Li, Xiaodong Zhang, Houfeng Wang, Xu sun

Document-level sentiment classification aims to assign the user reviews a sentiment polarity.

Ranked #5 on Sentiment Analysis on User and product information

Classification General Classification +4

Paper
Add Code

Addressing Domain Adaptation for Chinese Word Segmentation with Global Recurrent Structure

no code implementations • IJCNLP 2017 • Shen Huang, Xu sun, Houfeng Wang

Boundary features are widely used in traditional Chinese Word Segmentation (CWS) methods as they can utilize unlabeled data to help improve the Out-of-Vocabulary (OOV) word recognition performance.

Chinese Word Segmentation Domain Adaptation +2

Paper
Add Code

Label Embedding Network: Learning Label Representation for Soft Training of Deep Networks

1 code implementation • ICLR 2018 • Xu Sun, Bingzhen Wei, Xuancheng Ren, Shuming Ma

We propose a method, called Label Embedding Network, which can learn label representation (label embedding) during the training process of deep networks.

Paper
Code

A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification

1 code implementation • 6 Oct 2017 • Shuming Ma, Xu sun

In this work, our goal is to improve semantic relevance between source texts and simplified texts for text summarization and text simplification.

Decoder Semantic Similarity +4

Paper
Code

Minimal Effort Back Propagation for Convolutional Neural Networks

no code implementations • 18 Sep 2017 • Bingzhen Wei, Xu sun, Xuancheng Ren, Jingjing Xu

As traditional neural network consumes a significant amount of computing resources during back propagation, \citet{Sun2017mePropSB} propose a simple yet effective technique to alleviate this problem.

Paper
Add Code

meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting

2 code implementations • ICML 2017 • Xu Sun, Xuancheng Ren, Shuming Ma, Houfeng Wang

In back propagation, only a small subset of the full gradient is computed to update the model parameters.

110

Paper
Code

Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization

1 code implementation • ACL 2017 • Shuming Ma, Xu sun, Jingjing Xu, Houfeng Wang, Wenjie Li, Qi Su

In this work, our goal is to improve semantic relevance between source texts and summaries for Chinese social media summarization.

Decoder Semantic Similarity +2

Paper
Code

Lock-Free Parallel Perceptron for Graph-based Dependency Parsing

no code implementations • 2 Mar 2017 • Xu Sun, Shuming Ma

To deal with this problem, we propose a parallel algorithm called parallel perceptron.

Dependency Parsing

Paper
Add Code

A Generic Online Parallel Learning Framework for Large Margin Models

no code implementations • 2 Mar 2017 • Shuming Ma, Xu sun

To speed up the training process, many existing systems use parallel technology for online learning algorithms.

Paper
Add Code

Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network

no code implementations • 15 Feb 2017 • Jingjing Xu, Xu sun

First, we train a teacher model on high-resource corpora and then use the learned knowledge to initialize a student model.

Chinese Word Segmentation Segmentation +1

Paper
Add Code

Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features

no code implementations • COLING 2016 • Xu Sun

Existing asynchronous parallel learning methods are only for the sparse feature models, and they face new challenges for the dense feature models like neural networks (e. g., LSTM, RNN).

Low-Rank Matrix Completion

Paper
Add Code

F-Score Driven Max Margin Neural Network for Named Entity Recognition in Chinese Social Media

no code implementations • EACL 2017 • Hangfeng He, Xu sun

We focus on named entity recognition (NER) for Chinese social media.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

A New Recurrent Neural CRF for Learning Non-linear Edge Features

no code implementations • 14 Nov 2016 • Shuming Ma, Xu sun

Conditional Random Field (CRF) and recurrent neural models have achieved success in structured prediction.

Chinese Word Segmentation Chunking +3

Paper
Add Code

Dependency-based Gated Recursive Neural Network for Chinese Word Segmentation

no code implementations • ACL 2016 • Jingjing Xu, Xu sun

Chinese Word Segmentation Feature Engineering

Paper
Add Code

Knowledge-Based Semantic Embedding for Machine Translation

no code implementations • ACL 2016 • Chen Shi, Shujie Liu, Shuo Ren, Shi Feng, Mu Li, Ming Zhou, Xu sun, Houfeng Wang

Machine Translation Translation

Paper
Add Code

Multi-label Text Categorization with Joint Learning Predictions-as-Features Method

no code implementations • EMNLP 2015 • Li Li, Houfeng Wang, Xu sun, Baobao Chang, Shi Zhao, Lei Sha

Multi-Label Learning Text Categorization

Paper
Add Code

Towards Easier and Faster Sequence Labeling for Natural Language Processing: A Search-based Probabilistic Online Learning Framework (SAPO)

4 code implementations • 29 Mar 2015 • Xu Sun, Shuming Ma, Yi Zhang, Xuancheng Ren

We show that this method with fast training and theoretical guarantee of convergence, which is easy to implement, can support search-based optimization and obtain top accuracy.