Search Results for author: Xiaodong He

Found 153 papers, 58 papers with code

Tracking Satisfaction States for Customer Satisfaction Prediction in E-commerce Service Chatbots

no code implementations • COLING 2022 • Yang Sun, Liangqing Wu, Shuangyong Song, Xiaoguang Yu, Xiaodong He, Guohong Fu

In this work, we investigate the problem of satisfaction states tracking and its effects on CSP in E-commerce service chatbots.

Chatbot

Paper
Add Code

Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization

1 code implementation • EMNLP 2021 • Haoran Li, Song Xu, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, BoWen Zhou

It thereby takes advantage of prior copying distributions and, at each time step, explicitly encourages the model to copy the input word that is relevant to the previously copied one.

Ranked #11 on Abstractive Text Summarization on CNN / Daily Mail (using extra training data)

Abstractive Text Summarization News Summarization

Paper
Code

Few-Shot Table Understanding: A Benchmark Dataset and Pre-Training Baseline

no code implementations • COLING 2022 • Ruixue Liu, Shaozu Yuan, Aijun Dai, Lei Shen, Tiangang Zhu, Meng Chen, Xiaodong He

Since there is no large number of public Chinese tables, we also collect a large-scale, multi-domain tabular corpus to facilitate future Chinese table pre-training, which includes one million tables and related natural language text with auxiliary supervised interaction signals.

Paper
Add Code

Don’t Take It Literally: An Edit-Invariant Sequence Loss for Text Generation

1 code implementation • NAACL 2022 • Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu

Such training objective is sub-optimal when the target sequence is not perfect, e. g., when the target sequence is corrupted with noises, or when only weak sequence supervision is available.

Machine Translation Style Transfer +2

Paper
Code

OPERA: Operation-Pivoted Discrete Reasoning over Text

1 code implementation • NAACL 2022 • Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, Jiahui Liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao

To inherit the advantages of these two types of methods, we propose OPERA, an operation-pivoted discrete reasoning framework, where lightweight symbolic operations (compared with logical forms) as neural modules are utilized to facilitate the reasoning ability and interpretability.

Machine Reading Comprehension Semantic Parsing

Paper
Code

E-ConvRec: A Large-Scale Conversational Recommendation Dataset for E-Commerce Customer Service

no code implementations • LREC 2022 • Meihuizi Jia, Ruixue Liu, Peiying Wang, Yang song, Zexi Xi, Haobin Li, Xin Shen, Meng Chen, Jinhui Pang, Xiaodong He

There has been a growing interest in developing conversational recommendation system (CRS), which provides valuable recommendations to users through conversations.

Dialogue Management Management

Paper
Add Code

A Group Fairness Lens for Large Language Models

no code implementations • 24 Dec 2023 • Guanqun Bi, Lei Shen, Yuqiang Xie, Yanan Cao, Tiangang Zhu, Xiaodong He

The rapid advancement of large language models has revolutionized various applications but also raised crucial concerns about their potential to perpetuate biases and unfairness when deployed in social media contexts.

Attribute Fairness +1

Paper
Add Code

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

1 code implementation • 28 Nov 2023 • Yijun Yang, Tianyi Zhou, Kanxue Li, Dapeng Tao, Lusong Li, Li Shen, Xiaodong He, Jing Jiang, Yuhui Shi

While large language models (LLMs) excel in a simulated world of texts, they struggle to interact with the more realistic world without perceptions of other modalities such as visual or audio signals.

Imitation Learning

Paper
Code

Leveraging Label Information for Multimodal Emotion Recognition

no code implementations • 5 Sep 2023 • Peiying Wang, Sunlu Zeng, Junqing Chen, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He

Finally, we devise a novel label-guided attentive fusion module to fuse the label-aware text and speech representations for emotion classification.

Emotion Classification Multimodal Emotion Recognition

Paper
Add Code

AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets

no code implementations • 16 Jun 2023 • Yu Lu, Junwei Bao, Zichen Ma, Xiaoguang Han, Youzheng Wu, Shuguang Cui, Xiaodong He

High-quality data is essential for conversational recommendation systems and serves as the cornerstone of the network architecture development and training strategy design.

Knowledge Graphs Recommendation Systems

Paper
Add Code

OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition

no code implementations • 5 Jun 2023 • Li Fu, Siqi Li, Qingtao Li, Fangzhu Li, Liping Deng, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He

Self-Supervised Learning (SSL) Automatic Speech Recognition (ASR) models have shown great promise over Supervised Learning (SL) ones in low-resource settings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation

no code implementations • 2 Jun 2023 • Guanqun Bi, Lei Shen, Yanan Cao, Meng Chen, Yuqiang Xie, Zheng Lin, Xiaodong He

Empathy is a crucial factor in open-domain conversations, which naturally shows one's caring and understanding to others.

Attribute Empathetic Response Generation +3

Paper
Add Code

Learning to Generate Poetic Chinese Landscape Painting with Calligraphy

no code implementations • 8 May 2023 • Shaozu Yuan, Aijun Dai, Zhiling Yan, Ruixue Liu, Meng Chen, Baoyang Chen, Zhijie Qiu, Xiaodong He

In this paper, we present a novel system (denoted as Polaca) to generate poetic Chinese landscape painting with calligraphy.

Paper
Add Code

Roto-Translation Invariant Formation of Fixed-Wing UAVs in 3D: Feasibility and Control

no code implementations • 23 Feb 2023 • Xiaodong He, Zhongkui Li, Xiangke Wang, Zhiyong Geng

Secondly, given feasible formations, we design a formation controller by introducing a virtual leader and employing the compensation of rotation, followed by proving the stability of the closed-loop system.

Translation

Paper
Add Code

SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation

1 code implementation • 27 Nov 2022 • Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li

The pre-trained model can capture enriched visual concepts for images by learning from a large scale of text-image data.

Ranked #1 on Semantic Segmentation on PASCAL VOC

Open Vocabulary Semantic Segmentation Segmentation +1

Paper
Code

MNER-QG: An End-to-End MRC framework for Multimodal Named Entity Recognition with Query Grounding

no code implementations • 27 Nov 2022 • Meihuizi Jia, Lei Shen, Xin Shen, Lejian Liao, Meng Chen, Xiaodong He, Zhendong Chen, Jiaqi Li

Multimodal named entity recognition (MNER) is a critical step in information extraction, which aims to detect entity spans and classify them to corresponding entity types given a sentence-image pair.

named-entity-recognition Named Entity Recognition +4

Paper
Add Code

MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking

no code implementations • 10 Nov 2022 • Haoning Zhang, Junwei Bao, Haipeng Sun, Youzheng Wu, Wenye Li, Shuguang Cui, Xiaodong He

Then, the noised previous state is used as the input to learn to predict the current state, improving the model's ability to update and correct slot values.

Dialogue State Tracking

Paper
Add Code

UFO2: A unified pre-training framework for online and offline speech recognition

no code implementations • 26 Oct 2022 • Li Fu, Siqi Li, Qingtao Li, Liping Deng, Fangzhu Li, Lu Fan, Meng Chen, Xiaodong He

In this paper, we propose a Unified pre-training Framework for Online and Offline (UFO2) Automatic Speech Recognition (ASR), which 1) simplifies the two separate training workflows for online and offline modes into one process, and 2) improves the Word Error Rate (WER) performance with limited utterance annotating.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

P$^3$LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training

no code implementations • 22 Oct 2022 • Junwei Bao, Yifan Wang, Jiangyong Ying, Yeyun Gong, Jing Zhao, Youzheng Wu, Xiaodong He

Conventional autoregressive left-to-right (L2R) sequence generation faces two issues during decoding: limited to unidirectional target sequence modeling, and constrained on strong local dependencies.

Conversational Question Answering Language Modelling +3

Paper
Add Code

MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering

1 code implementation • 19 Oct 2022 • Yingyao Wang, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He, Tiejun Zhao

To preserve the advantage and eliminate the disadvantage of different granularity evidence, we propose MuGER$^2$, a Multi-Granularity Evidence Retrieval and Reasoning approach.

Navigate Question Answering +1

201

Paper
Code

Mars: Modeling Context & State Representations with Contrastive Learning for End-to-End Task-Oriented Dialog

1 code implementation • 17 Oct 2022 • Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He

Traditional end-to-end task-oriented dialog systems first convert dialog context into belief state and action state before generating the system response.

Contrastive Learning

818

Paper
Code

UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation

1 code implementation • 15 Oct 2022 • Yongwei Zhou, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He, Tiejun Zhao

Question answering requiring discrete reasoning, e. g., arithmetic computing, comparison, and counting, over knowledge is a challenging task.

Question Answering Semantic Parsing

Paper
Code

AutoQGS: Auto-Prompt for Low-Resource Knowledge-based Question Generation from SPARQL

1 code implementation • 26 Aug 2022 • Guanming Xiong, Junwei Bao, Wen Zhao, Youzheng Wu, Xiaodong He

This study investigates the task of knowledge-based question generation (KBQG).

Question Generation Question-Generation

Paper
Code

Composable Text Controls in Latent Space with ODEs

1 code implementation • 1 Aug 2022 • Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu

This paper proposes a new efficient approach for composable text operations in the compact latent space of text.

Ranked #2 on Unsupervised Text Style Transfer on Yelp

Attribute Language Modelling +2

Paper
Code

Flow Completion Network: Inferring the Fluid Dynamics from Incomplete Flow Information using Graph Neural Networks

no code implementations • 10 May 2022 • Xiaodong He, Yinan Wang, Juan Li

This paper introduces a novel neural network - flow completion network (FCN) - to infer the fluid dynamics, includ-ing the flow field and the force acting on the body, from the incomplete data based on Graph Convolution AttentionNetwork.

Paper
Add Code

BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog

1 code implementation • Findings (NAACL) 2022 • Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He

To enhance the denoising capability of the model to reduce the impact of error propagation, denoising reconstruction is used to reconstruct the corrupted dialog state and response.

Denoising

Paper
Code

LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking

1 code implementation • NAACL 2022 • Yifan Wang, Jing Zhao, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He

Dialogue state tracking (DST) aims to predict the current dialogue state given the dialogue history.

Dialogue State Tracking

Paper
Code

OPERA:Operation-Pivoted Discrete Reasoning over Text

no code implementations • 29 Apr 2022 • Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, Jiahui Liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao

Machine Reading Comprehension Semantic Parsing

Paper
Add Code

The low-entropy hydration shell at the binding site of spike RBD determines the contagiousness of SARS-CoV-2 variants

no code implementations • 27 Apr 2022 • Lin Yang, Shuai Guo, Chengyu Houc, Jiacheng Lia, Liping Shi, Chenchen Liao, Rongchun Shi, Xiaoliang Ma, Bing Zheng, Yi Fang, Lin Ye, Xiaodong He

The low-entropy level of hydration shells at the binding site of a spike protein is found to be an important indicator of the contagiousness of the coronavirus.

Paper
Add Code

Label Anchored Contrastive Learning for Language Understanding

no code implementations • NAACL 2022 • Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He

Motivated by this, we propose a novel label anchored contrastive learning approach (denoted as LaCon) for language understanding.

Benchmarking Contrastive Learning +3

Paper
Add Code

SE-GAN: Skeleton Enhanced GAN-based Model for Brush Handwriting Font Generation

no code implementations • 22 Apr 2022 • Shaozu Yuan, Ruixue Liu, Meng Chen, Baoyang Chen, Zhijie Qiu, Xiaodong He

There is rare research on brush handwriting font generation, which involves holistic structure changes and complex strokes transfer.

Font Generation

Paper
Add Code

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue

no code implementations • 18 Apr 2022 • Jiudong Yang, Peiying Wang, Yi Zhu, Mingchao Feng, Meng Chen, Xiaodong He

Turn-taking, aiming to decide when the next speaker can start talking, is an essential component in building human-robot spoken dialogue systems.

Contrastive Learning Data Augmentation +2

Paper
Add Code

A Roadmap for Big Model

no code implementations • 26 Mar 2022 • Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, HuaWei Shen, HUI ZHANG, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan YAO, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, LiWei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Language Modelling Machine Translation +1

Paper
Add Code

Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis

no code implementations • 22 Mar 2022 • Zexun Wang, Yuquan Le, Yi Zhu, Yuming Zhao, Mingchao Feng, Meng Chen, Xiaodong He

Building Spoken Language Understanding (SLU) robust to Automatic Speech Recognition (ASR) errors is an essential issue for various voice-enabled virtual assistants.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Fine- and Coarse-Granularity Hybrid Self-Attention for Efficient BERT

1 code implementation • ACL 2022 • Jing Zhao, Yifan Wang, Junwei Bao, Youzheng Wu, Xiaodong He

To confront this, we propose FCA, a fine- and coarse-granularity hybrid self-attention that reduces the computation cost through progressively shortening the computational sequence length in self-attention.

Informativeness

Paper
Code

Space Layout of Low-entropy Hydration Shells Guides Protein Binding

no code implementations • 22 Feb 2022 • Lin Yang, Shuai Guo, Chengyu Hou, Chencheng Liao, Jiacheng Li, Liping Shi, Xiaoliang Ma, Shenda Jiang, Bing Zheng, Yi Fang, Lin Ye, Xiaodong He

According to an analysis of determined protein complex structures, shape matching between the largest low-entropy hydration shell region of a protein and that of its partner at the binding sites is revealed as a regular pattern.

Paper
Add Code

Cross-modal Contrastive Distillation for Instructional Activity Anticipation

no code implementations • 18 Jan 2022 • Zhengyuan Yang, Jingen Liu, Jing Huang, Xiaodong He, Tao Mei, Chenliang Xu, Jiebo Luo

In this study, we aim to predict the plausible future action steps given an observation of the past and study the task of instructional activity anticipation.

Knowledge Distillation

Paper
Add Code

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

no code implementations • 27 Dec 2021 • Yuan YAO, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang, Chaojun Xiao, Xiaozhi Wang, Fanchao Qi, Junwei Bao, Jinran Nie, Zheni Zeng, Yuxian Gu, Kun Zhou, Xuancheng Huang, Wenhao Li, Shuhuai Ren, Jinliang Lu, Chengqiang Xu, Huadong Wang, Guoyang Zeng, Zile Zhou, Jiajun Zhang, Juanzi Li, Minlie Huang, Rui Yan, Xiaodong He, Xiaojun Wan, Xin Zhao, Xu sun, Yang Liu, Zhiyuan Liu, Xianpei Han, Erhong Yang, Zhifang Sui, Maosong Sun

We argue that for general-purpose language intelligence evaluation, the benchmark itself needs to be comprehensive and systematic.

Paper
Add Code

ViDA-MAN: Visual Dialog with Digital Humans

no code implementations • 26 Oct 2021 • Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei zhang, Xiaodong He, Tao Mei

We demonstrate ViDA-MAN, a digital-human agent for multi-modal interaction, which offers realtime audio-visual responses to instant speech inquiries.

speech-recognition Speech Recognition +2

Paper
Add Code

SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition

no code implementations • 8 Oct 2021 • Li Fu, Xiaoxiao Li, Runyu Wang, Lu Fan, Zhengchen Zhang, Meng Chen, Youzheng Wu, Xiaodong He

End-to-end Automatic Speech Recognition (ASR) models are usually trained to optimize the loss of the whole token sequence, while neglecting explicit phonemic-granularity supervision.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

The JDDC 2.0 Corpus: A Large-Scale Multimodal Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service

no code implementations • 27 Sep 2021 • Nan Zhao, Haoran Li, Youzheng Wu, Xiaodong He, BoWen Zhou

We present the solutions of top-5 teams participating in the JDDC multimodal dialogue challenge based on this dataset, which provides valuable insights for further researches on the multimodal dialogue task.

Paper
Add Code

RoR: Read-over-Read for Long Document Machine Reading Comprehension

1 code implementation • Findings (EMNLP) 2021 • Jing Zhao, Junwei Bao, Yifan Wang, Yongwei Zhou, Youzheng Wu, Xiaodong He, BoWen Zhou

To address this problem, we propose RoR, a read-over-read method, which expands the reading field from chunk to document.

Machine Reading Comprehension TriviaQA

Paper
Code

CUSTOM: Aspect-Oriented Product Summarization for E-Commerce

1 code implementation • 18 Aug 2021 • Jiahui Liang, Junwei Bao, Yifan Wang, Youzheng Wu, Xiaodong He, BoWen Zhou

To address the problem, we propose CUSTOM, aspect-oriented product summarization for e-commerce, which generates diverse and controllable summaries towards different product aspects.

Paper
Code

EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine Reading Comprehension

1 code implementation • 18 Aug 2021 • Yongwei Zhou, Junwei Bao, Haipeng Sun, Jiahui Liang, Youzheng Wu, Xiaodong He, BoWen Zhou, Tiejun Zhao

Reasoning machine reading comprehension (R-MRC) aims to answer complex questions that require discrete reasoning based on text.

Attribute Machine Reading Comprehension +1

Paper
Code

Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation

1 code implementation • 29 Jun 2021 • Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu

Such training objective is sub-optimal when the target sequence is not perfect, e. g., when the target sequence is corrupted with noises, or when only weak sequence supervision is available.

Machine Translation Style Transfer +3

Paper
Code

Joint System-Wise Optimization for Pipeline Goal-Oriented Dialog System

no code implementations • 9 Jun 2021 • Zichuan Lin, Jing Huang, BoWen Zhou, Xiaodong He, Tengyu Ma

Recent work (Takanobu et al., 2020) proposed the system-wise evaluation on dialog systems and found that improvement on individual components (e. g., NLU, policy) in prior work may not necessarily bring benefit to pipeline systems in system-wise evaluation.

Data Augmentation Goal-Oriented Dialog

Paper
Add Code

RevCore: Review-augmented Conversational Recommendation

1 code implementation • Findings (ACL) 2021 • Yu Lu, Junwei Bao, Yan Song, Zichen Ma, Shuguang Cui, Youzheng Wu, Xiaodong He

Existing conversational recommendation (CR) systems usually suffer from insufficient item information when conducted on short dialogue history and unfamiliar items.

Response Generation

Paper
Code

Conversational AI Systems for Social Good: Opportunities and Challenges

no code implementations • 13 May 2021 • Peng Qi, Jing Huang, Youzheng Wu, Xiaodong He, BoWen Zhou

Conversational artificial intelligence (ConvAI) systems have attracted much academic and commercial attention recently, making significant progress on both fronts.

Paper
Add Code

SGG: Learning to Select, Guide, and Generate for Keyphrase Generation

1 code implementation • NAACL 2021 • Jing Zhao, Junwei Bao, Yifan Wang, Youzheng Wu, Xiaodong He, BoWen Zhou

Keyphrases, that concisely summarize the high-level topics discussed in a document, can be categorized into present keyphrase which explicitly appears in the source text, and absent keyphrase which does not match any contiguous subsequence but is highly semantically related to the source.

Keyphrase Generation Text Generation

Paper
Code

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce

1 code implementation • Findings (EMNLP) 2021 • Song Xu, Haoran Li, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, Ying Liu, BoWen Zhou

K-PLUG achieves new state-of-the-art results on a suite of domain-specific NLP tasks, including product knowledge base completion, abstractive product summarization, and multi-turn dialogue, significantly outperforms baselines across the board, which demonstrates that the proposed method effectively learns a diverse set of domain-specific knowledge for both language understanding and generation tasks.

Knowledge Base Completion Language Modelling +2

Paper
Code

Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification

no code implementations • NAACL 2021 • Xiaochen Hou, Peng Qi, Guangtao Wang, Rex Ying, Jing Huang, Xiaodong He, BoWen Zhou

Recent work on aspect-level sentiment classification has demonstrated the efficacy of incorporating syntactic structures such as dependency trees with graph neural networks(GNN), but these approaches are usually vulnerable to parsing errors.

Ensemble Learning General Classification +2

Paper
Add Code

Hydrophobic interaction determines docking affinity of SARS CoV 2 variants with antibodies

no code implementations • 28 Feb 2021 • Jiacheng Li, Chengyu Hou, Menghao Wang, Chencheng Liao, Shuai Guo, Liping Shi, Xiaoliang Ma, Hongchi Zhang, Shenda Jiang, Bing Zheng, Lin Ye, Lin Yang, Xiaodong He

Preliminary epidemiologic, phylogenetic and clinical findings suggest that several novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have increased transmissibility and decreased efficacy of several existing vaccines.

Paper
Add Code

Conversational Query Rewriting with Self-supervised Learning

no code implementations • 9 Feb 2021 • Hang Liu, Meng Chen, Youzheng Wu, Xiaodong He, BoWen Zhou

Conversational Query Rewriting (CQR) aims to simplify the multi-turn dialogue modeling into a single-turn problem by explicitly rewriting the conversational query into a self-contained utterance.

Self-Supervised Learning

Paper
Add Code

K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION

1 code implementation • 1 Jan 2021 • Song Xu, Haoran Li, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, Ying Liu, BoWen Zhou

Chatbot Knowledge Base Completion +4

Paper
Code

Multimodal Sentence Summarization via Multimodal Selective Encoding

no code implementations • COLING 2020 • Haoran Li, Junnan Zhu, Jiajun Zhang, Xiaodong He, Chengqing Zong

Thus, we propose a multimodal selective gate network that considers reciprocal relationships between textual and multi-level visual features, including global image descriptor, activation grids, and object proposals, to select highlights of the event when encoding the source sentence.

Sentence Sentence Summarization

Paper
Add Code

On the Faithfulness for E-commerce Product Summarization

1 code implementation • COLING 2020 • Peng Yuan, Haoran Li, Song Xu, Youzheng Wu, Xiaodong He, BoWen Zhou

In this work, we present a model to generate e-commerce product summaries.

Attribute

Paper
Code

Group Contextual Encoding for 3D Point Clouds

1 code implementation • NeurIPS 2020 • Xu Liu, Chengtao Li, Jian Wang, Jingbo Wang, Boxin Shi, Xiaodong He

In this work, we extended the contextual encoding layer that was originally designed for 2D tasks to 3D Point Cloud scenarios.

Scene Understanding

Paper
Code

Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis

no code implementations • 6 Nov 2020 • Guanghui Xu, Wei Song, Zhengchen Zhang, Chao Zhang, Xiaodong He, BoWen Zhou

Despite prosody is related to the linguistic information up to the discourse structure, most text-to-speech (TTS) systems only take into account that within each sentence, which makes it challenging when converting a paragraph of texts into natural and expressive speech.

Sentence Sentence Embeddings +1

Paper
Add Code

Enhancing Automated Essay Scoring Performance via Fine-tuning Pre-trained Language Models with Combination of Regression and Ranking

no code implementations • Findings of the Association for Computational Linguistics 2020 • Ruosong Yang, Jiannong Cao, Zhiyuan Wen, Youzheng Wu, Xiaodong He

However, to solve the AES task, previous works utilize shallow neural networks to learn essay representations and constrain calculated scores with regression loss or ranking loss, respectively.

Automated Essay Scoring Language Modelling +3

Paper
Add Code

Learning to Decouple Relations: Few-Shot Relation Classification with Entity-Guided Attention and Confusion-Aware Training

no code implementations • COLING 2020 • Yingyao Wang, Junwei Bao, Guangyi Liu, Youzheng Wu, Xiaodong He, BoWen Zhou, Tiejun Zhao

This paper aims to enhance the few-shot relation classification especially for sentences that jointly describe multiple relations.

Few-Shot Relation Classification Relation +1

Paper
Add Code

The role of hydrophobic interactions in folding of $β$-sheets

no code implementations • 16 Sep 2020 • Jiacheng Li, Xiaoliang Ma, Hongchi Zhang, Chengyu Hou, Liping Shi, Shuai Guo, Chenchen Liao, Bing Zheng, Lin Ye, Lin Yang, Xiaodong He

Exploring the protein-folding problem has been a long-standing challenge in molecular biology.

Protein Folding

Paper
Add Code

Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product

2 code implementations • EMNLP 2020 • Tiangang Zhu, Yue Wang, Haoran Li, Youzheng Wu, Xiaodong He, Bo-Wen Zhou

We annotate a multimodal product attribute value dataset that contains 87, 194 instances, and the experimental results on this dataset demonstrate that explicitly modeling the relationship between attributes and values facilitates our method to establish the correspondence between them, and selectively utilizing visual product information is necessary for the task.

Attribute Attribute Value Extraction +1

Paper
Code

A hydrophobic-interaction-based mechanism trigger docking between the SARS CoV 2 spike and angiotensin-converting enzyme 2

no code implementations • 27 Aug 2020 • Jiacheng Li, Xiaoliang Ma, Shuai Guo, Chengyu Hou, Liping Shi, Hongchi Zhang, Bing Zheng, Chencheng Liao, Lin Yang, Lin Ye, Xiaodong He

The hydrophobic interaction between the SARS-CoV-2 S and ACE2 protein is found to be significantly greater than that between SARS-CoV S and ACE2.

Paper
Add Code

Neural Kalman Filtering for Speech Enhancement

no code implementations • 28 Jul 2020 • Wei Xue, Gang Quan, Chao Zhang, Guohong Ding, Xiaodong He, BoWen Zhou

Statistical signal processing based speech enhancement methods adopt expert knowledge to design the statistical models and linear filters, which is complementary to the deep neural network (DNN) based methods which are data-driven.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Self-Attention Guided Copy Mechanism for Abstractive Summarization

no code implementations • ACL 2020 • Song Xu, Haoran Li, Peng Yuan, Youzheng Wu, Xiaodong He, Bo-Wen Zhou

Copy module has been widely equipped in the recent abstractive summarization models, which facilitates the decoder to extract words from the source into the summary.

Abstractive Text Summarization

Paper
Add Code

Incremental Learning for End-to-End Automatic Speech Recognition

no code implementations • 11 May 2020 • Li Fu, Xiaoxiao Li, Libo Zi, Zhengchen Zhang, Youzheng Wu, Xiaodong He, BoWen Zhou

In this paper, we propose an incremental learning method for end-to-end Automatic Speech Recognition (ASR) which enables an ASR system to perform well on new tasks while maintaining the performance on its originally learned ones.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Speaker Diarization with Lexical Information

no code implementations • 13 Apr 2020 • Tae Jin Park, Kyu J. Han, Jing Huang, Xiaodong He, Bo-Wen Zhou, Panayiotis Georgiou, Shrikanth Narayanan

This work presents a novel approach for speaker diarization to leverage lexical information provided by automatic speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Graph Sequential Network for Reasoning over Sequences

no code implementations • 4 Apr 2020 • Ming Tu, Jing Huang, Xiaodong He, Bo-Wen Zhou

We validate the proposed GSN on two NLP tasks: interpretable multi-hop reading comprehension on HotpotQA and graph based fact verification on FEVER.

Fact Verification Machine Reading Comprehension +1

Paper
Add Code

DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images

no code implementations • 10 Dec 2019 • Yi Zhou, Boyang Wang, Xiaodong He, Shanshan Cui, Ling Shao

In this paper, we propose a diabetic retinopathy generative adversarial network (DR-GAN) to synthesize high-resolution fundus images which can be manipulated with arbitrary grading and lesion information.

Data Augmentation Generative Adversarial Network +1

Paper
Add Code

The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service

no code implementations • LREC 2020 • Meng Chen, Ruixue Liu, Lei Shen, Shaozu Yuan, Jingyan Zhou, Youzheng Wu, Xiaodong He, Bo-Wen Zhou

Human conversations are complicated and building a human-like dialogue agent is an extremely challenging task.

Question Answering Retrieval

Paper
Add Code

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

no code implementations • 10 Nov 2019 • Chao Zhang, Zichao Yang, Xiaodong He, Li Deng

This review provides a comprehensive analysis of recent works on multimodal deep learning from three perspectives: learning multimodal representations, fusing multimodal signals at various levels, and multimodal applications.

Caption Generation Multimodal Deep Learning +6

Paper
Add Code

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

no code implementations • ACL 2020 • Yun Tang, Jing Huang, Guangtao Wang, Xiaodong He, Bo-Wen Zhou

Translational distance-based knowledge graph embedding has shown progressive improvements on the link prediction task, from TransE to the latest state-of-the-art RotatE.

Ranked #18 on Link Prediction on FB15k-237

Knowledge Graph Embedding Link Prediction +1

Paper
Add Code

Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents

1 code implementation • 1 Nov 2019 • Ming Tu, Kevin Huang, Guangtao Wang, Jing Huang, Xiaodong He, Bo-Wen Zhou

Interpretable multi-hop reading comprehension (RC) over multiple documents is a challenging problem because it demands reasoning over multiple information sources and explaining the answer prediction by providing supporting evidences.

Learning-To-Rank Multi-Hop Reading Comprehension +2

Paper
Code

Relation Module for Non-Answerable Predictions on Reading Comprehension

no code implementations • CONLL 2019 • Kevin Huang, Yun Tang, Jing Huang, Xiaodong He, Bo-Wen Zhou

We test the relation module on the SQuAD 2. 0 dataset using both the BiDAF and BERT models as baseline readers.

Machine Reading Comprehension Relation +2

Paper
Add Code

Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification

no code implementations • NAACL (TextGraphs) 2021 • Xiaochen Hou, Jing Huang, Guangtao Wang, Xiaodong He, BoWen Zhou

Aspect-level sentiment classification aims to identify the sentiment polarity towards a specific aspect term in a sentence.

General Classification Sentence +2

Paper
Add Code

Relation Module for Non-answerable Prediction on Question Answering

no code implementations • 23 Oct 2019 • Kevin Huang, Yun Tang, Jing Huang, Xiaodong He, Bo-Wen Zhou

In this paper, we aim to improve a MRC model's ability to determine whether a question has an answer in a given context (e. g. the recently proposed SQuAD 2. 0 task).

Machine Reading Comprehension Question Answering +3

Paper
Add Code

Zero-shot Text-to-SQL Learning with Auxiliary Task

1 code implementation • 29 Aug 2019 • Shuaichen Chang, PengFei Liu, Yun Tang, Jing Huang, Xiaodong He, Bo-Wen Zhou

Recent years have seen great success in the use of neural seq2seq models on the text-to-SQL task.

Text-To-SQL

Paper
Code

Multiple instance learning with graph neural networks

no code implementations • 12 Jun 2019 • Ming Tu, Jing Huang, Xiaodong He, Bo-Wen Zhou

In this paper, we propose a new end-to-end graph neural network (GNN) based algorithm for MIL: we treat each bag as a graph and use GNN to learn the bag embedding, in order to explore the useful structural information among instances in bags.

Multiple Instance Learning

Paper
Add Code

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs

no code implementations • ACL 2019 • Ming Tu, Guangtao Wang, Jing Huang, Yun Tang, Xiaodong He, Bo-Wen Zhou

We introduce a heterogeneous graph with different types of nodes and edges, which is named as Heterogeneous Document-Entity (HDE) graph.

Multi-Hop Reading Comprehension

Paper
Add Code

Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations

1 code implementation • NeurIPS 2019 • Fenglin Liu, Yuanxin Liu, Xuancheng Ren, Xiaodong He, Xu sun

In vision-and-language grounding problems, fine-grained representations of the image are considered to be of paramount importance.

Image Captioning Question Answering +2

Paper
Code

Mappa Mundi: An Interactive Artistic Mind Map Generator with Artificial Imagination

no code implementations • 9 May 2019 • Ruixue Liu, Baoyang Chen, Meng Chen, Youzheng Wu, Zhijie Qiu, Xiaodong He

We present a novel real-time, collaborative, and interactive AI painting system, Mappa Mundi, for artistic Mind Map creation.

Paper
Add Code

Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling

1 code implementation • IJCAI 2019 2019 • Pengcheng Yang, Fuli Luo, Peng Chen, Lei LI, Zhiyi Yin, Xiaodong He, Xu sun

The visual storytelling (VST) task aims at generating a reasonable and coherent paragraph-level story with the image stream as input.

Ranked #21 on Visual Storytelling on VIST

Knowledge Graphs Semantic Similarity +2

Paper
Code

From Knowledge Map to Mind Map: Artificial Imagination

no code implementations • 4 Mar 2019 • Ruixue Liu, Baoyang Chen, XIAOYU GUO, Yan Dai, Meng Chen, Zhijie Qiu, Xiaodong He

Imagination is one of the most important factors which makes an artistic painting unique and impressive.

Paper
Add Code

Object-driven Text-to-Image Synthesis via Adversarial Training

1 code implementation • CVPR 2019 • Wenbo Li, Pengchuan Zhang, Lei Zhang, Qiuyuan Huang, Xiaodong He, Siwei Lyu, Jianfeng Gao

In this paper, we propose Object-driven Attentive Generative Adversarial Newtorks (Obj-GANs) that allow object-centered text-to-image synthesis for complex scenes.

Image Generation Object

283

Paper
Code

Deep Speaker Embedding Learning with Multi-Level Pooling for Text-Independent Speaker Verification

no code implementations • 21 Feb 2019 • Yun Tang, Guohong Ding, Jing Huang, Xiaodong He, Bo-Wen Zhou

This paper aims to improve the widely used deep speaker embedding x-vector model.

Text-Independent Speaker Verification

Paper
Add Code

End-to-end Structure-Aware Convolutional Networks for Knowledge Base Completion

1 code implementation • 11 Nov 2018 • Chao Shang, Yun Tang, Jing Huang, Jinbo Bi, Xiaodong He, Bo-Wen Zhou

The recent graph convolutional network (GCN) provides another way of learning graph node embedding by successfully utilizing graph connectivity structure.

Ranked #28 on Link Prediction on FB15k-237

Knowledge Base Completion Knowledge Graph Embedding +2

108

Paper
Code

Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations

no code implementations • EMNLP 2018 • Dipendra Misra, Ming-Wei Chang, Xiaodong He, Wen-tau Yih

Semantic parsing from denotations faces two key challenges in model training: (1) given only the denotations (e. g., answers), search for good candidate semantic parses, and (2) choose the best model update algorithm.

Question Answering Semantic Parsing

Paper
Add Code

Deep Reinforcement Learning for NLP

no code implementations • ACL 2018 • William Yang Wang, Jiwei Li, Xiaodong He

Many Natural Language Processing (NLP) tasks (including generation, language grounding, reasoning, information extraction, coreference resolution, and dialog) can be formulated as deep reinforcement learning (DRL) problems.

Atari Games coreference-resolution +7

Paper
Add Code

Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation

no code implementations • 21 May 2018 • Qiuyuan Huang, Zhe Gan, Asli Celikyilmaz, Dapeng Wu, Jian-Feng Wang, Xiaodong He

We propose a hierarchically structured reinforcement learning approach to address the challenges of planning for generating coherent multi-sentence stories for the visual storytelling task.

Ranked #24 on Visual Storytelling on VIST

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Discourse-Aware Neural Rewards for Coherent Text Generation

no code implementations • NAACL 2018 • Antoine Bosselut, Asli Celikyilmaz, Xiaodong He, Jianfeng Gao, Po-Sen Huang, Yejin Choi

In this paper, we investigate the use of discourse-aware rewards with reinforcement learning to guide a model to generate long, coherent text.

reinforcement-learning Reinforcement Learning (RL) +3

Paper
Add Code

Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning

1 code implementation • 3 Apr 2018 • Dianqi Li, Qiuyuan Huang, Xiaodong He, Lei Zhang, Ming-Ting Sun

By contrasting with human-written captions and image-mismatched captions, the caption generator effectively exploits the inherent characteristics of human languages, and generates more discriminative captions.

Generative Adversarial Network

Paper
Code

Deep Communicating Agents for Abstractive Summarization

no code implementations • NAACL 2018 • Asli Celikyilmaz, Antoine Bosselut, Xiaodong He, Yejin Choi

We present deep communicating agents in an encoder-decoder architecture to address the challenges of representing a long document for abstractive summarization.

Ranked #31 on Abstractive Text Summarization on CNN / Daily Mail (using extra training data)

Abstractive Text Summarization reinforcement-learning +1

Paper
Add Code

Stacked Cross Attention for Image-Text Matching

6 code implementations • ECCV 2018 • Kuang-Huei Lee, Xi Chen, Gang Hua, Houdong Hu, Xiaodong He

Prior work either simply aggregates the similarity of all possible pairs of regions and words without attending differentially to more and less important words or regions, or uses a multi-step attentional process to capture limited number of semantic alignments which is less interpretable.

Ranked #4 on Image Retrieval on PhotoChat

Image Retrieval Image-text matching +5

516

Paper
Code

Natural Language to Structured Query Generation via Meta-Learning

1 code implementation • NAACL 2018 • Po-Sen Huang, Chenglong Wang, Rishabh Singh, Wen-tau Yih, Xiaodong He

In conventional supervised training, a model is trained to fit all the training examples.

Ranked #7 on Code Generation on WikiSQL

Meta-Learning

129

Paper
Code

Attentive Tensor Product Learning

no code implementations • 20 Feb 2018 • Qiuyuan Huang, Li Deng, Dapeng Wu, Chang Liu, Xiaodong He

This paper proposes a new architecture - Attentive Tensor Product Learning (ATPL) - to represent grammatical structures in deep learning models.

Constituency Parsing Image Captioning +4

Paper
Add Code

Constrained Convolutional-Recurrent Networks to Improve Speech Quality with Low Impact on Recognition Accuracy

no code implementations • 16 Feb 2018 • Rasool Fakoor, Xiaodong He, Ivan Tashev, Shuayb Zarar

For a speech-enhancement algorithm, it is highly desirable to simultaneously improve perceptual quality and recognition rate.

Language Modelling Speech Enhancement

Paper
Add Code

From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots

no code implementations • 6 Jan 2018 • Heung-Yeung Shum, Xiaodong He, Di Li

Conversational systems have come a long way since their inception in the 1960s.

Chatbot

Paper
Add Code

Reinforcement Learning To Adapt Speech Enhancement to Instantaneous Input Signal Quality

no code implementations • 29 Nov 2017 • Rasool Fakoor, Xiaodong He, Ivan Tashev, Shuayb Zarar

Today, the optimal performance of existing noise-suppression algorithms, both data-driven and those based on classic statistical methods, is range bound to specific levels of instantaneous input signal-to-noise ratios.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

19 code implementations • CVPR 2018 • Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He

In this paper, we propose an Attentional Generative Adversarial Network (AttnGAN) that allows attention-driven, multi-stage refinement for fine-grained text-to-image generation.

Ranked #1 on Text-to-Image Generation on MS-COCO

Generative Adversarial Network Image-text matching +2

1,318

Paper
Code

CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise

3 code implementations • CVPR 2018 • Kuang-Huei Lee, Xiaodong He, Lei Zhang, Linjun Yang

We demonstrate the effectiveness of the proposed algorithm on both of the label noise detection task and the image classification on noisy data task on several large-scale datasets.

Ranked #2 on Image Classification on Food-101N (using extra training data)

Classification General Classification +2

Paper
Code

On the Discrimination-Generalization Tradeoff in GANs

no code implementations • ICLR 2018 • Pengchuan Zhang, Qiang Liu, Dengyong Zhou, Tao Xu, Xiaodong He

When evaluated with neural distance, our bounds show that generalization is guaranteed as long as the discriminator set is small enough, regardless of the size of the generator or hypothesis set.

Generalization Bounds

Paper
Add Code

A Neural-Symbolic Approach to Design of CAPTCHA

no code implementations • 29 Oct 2017 • Qiuyuan Huang, Paul Smolensky, Xiaodong He, Li Deng, Dapeng Wu

To address this, this paper promotes image/visual captioning based CAPTCHAs, which is robust against machine-learning-based attacks.

BIG-bench Machine Learning Image Captioning +1

Paper
Add Code

Tensor Product Generation Networks for Deep NLP Modeling

2 code implementations • NAACL 2018 • Qiuyuan Huang, Paul Smolensky, Xiaodong He, Li Deng, Dapeng Wu

We present a new approach to the design of deep networks for natural language processing (NLP), based on the general technique of Tensor Product Representations (TPRs) for encoding and processing symbol structures in distributed neural networks.

Caption Generation

Paper
Code

Multiple-Kernel Based Vehicle Tracking Using 3D Deformable Model and Camera Self-Calibration

no code implementations • 22 Aug 2017 • Zheng Tang, Gaoang Wang, Tao Liu, Young-Gun Lee, Adwin Jahn, Xu Liu, Xiaodong He, Jenq-Neng Hwang

In this challenge, we propose a model-based vehicle localization method, which builds a kernel at each patch of the 3D deformable vehicle model and associates them with constraints in 3D space.

Ensemble Learning object-detection +1

Paper
Add Code

Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge

10 code implementations • CVPR 2018 • Damien Teney, Peter Anderson, Xiaodong He, Anton Van Den Hengel

This paper presents a state-of-the-art model for visual question answering (VQA), which won the first place in the 2017 VQA Challenge.

Ranked #30 on Visual Question Answering (VQA) on VQA v2 test-std

Visual Question Answering

1,401

Paper
Code

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

65 code implementations • CVPR 2018 • Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang

Top-down visual attention mechanisms have been used extensively in image captioning and visual question answering (VQA) to enable deeper image understanding through fine-grained analysis and even multiple steps of reasoning.

Ranked #29 on Visual Question Answering (VQA) on VQA v2 test-std

Image Captioning Visual Question Answering

5,414

Paper
Code

StyleNet: Generating Attractive Visual Captions With Styles

no code implementations • CVPR 2017 • Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng

We propose a novel framework named StyleNet to address the task of generating attractive captions for images and videos with different styles.

Caption Generation

Paper
Add Code

Two-Stage Synthesis Networks for Transfer Learning in Machine Comprehension

2 code implementations • EMNLP 2017 • David Golub, Po-Sen Huang, Xiaodong He, Li Deng

We develop a technique for transfer learning in machine comprehension (MC) using a novel two-stage synthesis network (SynNet).

Reading Comprehension Transfer Learning +1

110

Paper
Code

Adversarial Ranking for Language Generation

1 code implementation • NeurIPS 2017 • Kevin Lin, Dianqi Li, Xiaodong He, Zhengyou Zhang, Ming-Ting Sun

Rather than training the discriminator to learn and assign absolute binary predicate for individual data sample, the proposed RankGAN is able to analyze and rank a collection of human-written and machine-written sentences by giving a reference group.

Ranked #1 on Text Generation on Chinese Poems

Generative Adversarial Network Text Generation

Paper
Code

Question-Answering with Grammatically-Interpretable Representations

no code implementations • 23 May 2017 • Hamid Palangi, Paul Smolensky, Xiaodong He, Li Deng

In our application of TPRN, internal representations learned by end-to-end optimization in a deep neural network performing a textual question-answering (QA) task can be interpreted using basic concepts from linguistic theory.

Inductive Bias Question Answering

Paper
Add Code

Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads

no code implementations • 20 Apr 2017 • Ji He, Mari Ostendorf, Xiaodong He

This paper addresses the problem of predicting popularity of comments in an online discussion forum using reinforcement learning, particularly addressing two challenges that arise from having natural language state and action spaces.

Q-Learning reinforcement-learning +1

Paper
Add Code

Character-level Deep Conflation for Business Data Analytics

2 code implementations • 8 Feb 2017 • Zhe Gan, P. D. Singh, Ameet Joshi, Xiaodong He, Jianshu Chen, Jianfeng Gao, Li Deng

Connecting different text attributes associated with the same entity (conflation) is important in business data analytics since it could help merge two different tables in a database to provide a more comprehensive profile of an entity.

Paper
Code

Deep Learning with Low Precision by Half-wave Gaussian Quantization

1 code implementation • CVPR 2017 • Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos

The problem of quantizing the activations of a deep neural network is considered.

Quantization

118

Paper
Code

Learning Generic Sentence Representations Using Convolutional Neural Networks

no code implementations • EMNLP 2017 • Zhe Gan, Yunchen Pu, Ricardo Henao, Chunyuan Li, Xiaodong He, Lawrence Carin

We propose a new encoder-decoder approach to learn distributed sentence representations that are applicable to multiple purposes.

Sentence

Paper
Add Code

Semantic Compositional Networks for Visual Captioning

1 code implementation • CVPR 2017 • Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng

The degree to which each member of the ensemble is used to generate an image caption is tied to the image-dependent probability of the corresponding tag.

Image Captioning Semantic Composition +1

Paper
Code

Bi-directional Attention with Agreement for Dependency Parsing

1 code implementation • EMNLP 2016 • Hao Cheng, Hao Fang, Xiaodong He, Jianfeng Gao, Li Deng

We develop a novel bi-directional attention model for dependency parsing, which learns to agree on headword predictions from the forward and backward parsing directions.

Ranked #4 on Chinese Dependency Parsing on Chinese Pennbank

Dependency Parsing

Paper
Code

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

11 code implementations • 27 Jul 2016 • Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao

In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base.

Face Recognition Image Captioning

21,168

Paper
Code

Unsupervised Learning of Predictors from Unpaired Input-Output Samples

no code implementations • 15 Jun 2016 • Jianshu Chen, Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng

In particular, we show that with regularization via a generative model, learning with the proposed unsupervised objective function converges to an optimal solution.

Paper
Add Code

Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

1 code implementation • EMNLP 2016 • Ji He, Mari Ostendorf, Xiaodong He, Jianshu Chen, Jianfeng Gao, Lihong Li, Li Deng

We introduce an online popularity prediction and tracking task as a benchmark task for reinforcement learning with a combinatorial, natural language action space.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories

no code implementations • NAACL 2016 • Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, V, Lucy erwende, Pushmeet Kohli, James Allen

Question Answering Text Summarization

Paper
Add Code

Hierarchical Attention Networks for Document Classification

1 code implementation • NAACL 2016 • Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, Eduard Hovy

Ranked #4 on Text Classification on arXiv-10

Citation Intent Classification Document Classification +2

457

Paper
Code

Visual Storytelling

1 code implementation • NAACL 2016 • Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling.

Descriptive Visual Storytelling

Paper
Code

A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories

no code implementations • 6 Apr 2016 • Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, James Allen

We created a new corpus of ~50k five-sentence commonsense stories, ROCStories, to enable this evaluation.

Cloze Test Sentence +1

Paper
Add Code

Character-Level Question Answering with Attention

1 code implementation • EMNLP 2016 • David Golub, Xiaodong He

We show that a character-level encoder-decoder framework can be successfully applied to question answering with a structured knowledge base.

Data Augmentation Question Answering

Paper
Code

Rich Image Captioning in the Wild

no code implementations • 30 Mar 2016 • Kenneth Tran, Xiaodong He, Lei Zhang, Jian Sun, Cornelia Carapcea, Chris Thrasher, Chris Buehler, Chris Sienkiewicz

We present an image caption system that addresses new challenges of automatically describing images in the wild.

Image Captioning

Paper
Add Code

Generating Natural Questions About an Image

2 code implementations • ACL 2016 • Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, Xiaodong He, Lucy Vanderwende

There has been an explosion of work in the vision & language community during the past few years from image captioning to video transcription, and answering questions about images.

Image Captioning Natural Questions +3

Paper
Code

Basic Reasoning with Tensor Product Representations

no code implementations • 12 Jan 2016 • Paul Smolensky, Moontae Lee, Xiaodong He, Wen-tau Yih, Jianfeng Gao, Li Deng

In this paper we present the initial development of a general theory for mapping inference in predicate logic to computation over Tensor Product Representations (TPRs; Smolensky (1990), Smolensky & Legendre (2006)).

Question Answering

Paper
Add Code

Reasoning in Vector Space: An Exploratory Study of Question Answering

no code implementations • 19 Nov 2015 • Moontae Lee, Xiaodong He, Wen-tau Yih, Jianfeng Gao, Li Deng, Paul Smolensky

Question answering tasks have shown remarkable progress with distributed vector representation.

Common Sense Reasoning Logical Reasoning +1

Paper
Add Code

Deep Reinforcement Learning with a Natural Language Action Space

3 code implementations • ACL 2016 • Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng, Mari Ostendorf

This paper introduces a novel architecture for reinforcement learning with deep neural networks designed to handle state and action spaces characterized by natural language, as found in text-based games.

Q-Learning reinforcement-learning +2

Paper
Code

Stacked Attention Networks for Image Question Answering

16 code implementations • CVPR 2016 • Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Smola

Thus, we develop a multiple-layer SAN in which we query an image multiple times to infer the answer progressively.

Ranked #5 on Visual Question Answering (VQA) on VQA v1 test-std

Visual Question Answering (VQA)

104

Paper
Code

Recurrent Reinforcement Learning: A Hybrid Approach

no code implementations • 10 Sep 2015 • Xiujun Li, Lihong Li, Jianfeng Gao, Xiaodong He, Jianshu Chen, Li Deng, Ji He

Successful applications of reinforcement learning in real-world problems often require dealing with partially observable states.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Data Selection With Fewer Words

no code implementations • WS 2015 • Amittai Axelrod, Philip Resnik, Xiaodong He, Mari Ostendorf

Language Modelling Machine Translation +1

Paper
Add Code

End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture

1 code implementation • NeurIPS 2015 • Jianshu Chen, Ji He, Yelong Shen, Lin Xiao, Xiaodong He, Jianfeng Gao, Xinying Song, Li Deng

We develop a fully discriminative learning approach for supervised Latent Dirichlet Allocation (LDA) model using Back Propagation (i. e., BP-sLDA), which maximizes the posterior probability of the prediction variable given the input document.

General Classification Topic Models

Paper
Code

Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base

1 code implementation • IJCNLP 2015 • Wen-tau Yih, Ming-Wei Chang, Xiaodong He, Jianfeng Gao

Entity Linking Graph Generation +3

112

Paper
Code

A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems

1 code implementation • WWW 2015 • Ali Elkahky, Yang song, Xiaodong He

We extend the model to jointly learn from features of items from different domains and user features by introducing a multi-view Deep Learning model.

News Recommendation Recommendation Systems

4,088

Paper
Code

Language Models for Image Captioning: The Quirks and What Works

no code implementations • IJCNLP 2015 • Jacob Devlin, Hao Cheng, Hao Fang, Saurabh Gupta, Li Deng, Xiaodong He, Geoffrey Zweig, Margaret Mitchell

Two recent approaches have achieved state-of-the-art results in image captioning.

Image Captioning Language Modelling +1

Paper
Add Code

Deep Learning and Continuous Representations for Natural Language Processing

no code implementations • HLT 2015 • Wen-tau Yih, Xiaodong He, Jianfeng Gao

Information Retrieval Language Modelling +9

Paper
Add Code

Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval

no code implementations • HLT 2015 • Jianfeng Gao, Kevin Duh, Ye-Yi Wang, Xiaodong He, Li Deng, Xiaodong Liu

Domain Adaptation domain classification +9

Paper
Add Code

Joint Learning of Distributed Representations for Images and Texts

no code implementations • 13 Apr 2015 • Xiaodong He, Rupesh Srivastava, Jianfeng Gao, Li Deng

The learned representations attempt to capture the combination of various visual concepts and cues.

Paper
Add Code

A Deep Embedding Model for Co-occurrence Learning

no code implementations • 11 Apr 2015 • Yelong Shen, Ruoming Jin, Jianshu Chen, Xiaodong He, Jianfeng Gao, Li Deng

Co-occurrence Data is a common and important information source in many areas, such as the word co-occurrence in the sentences, friends co-occurrence in social networks and products co-occurrence in commercial transaction data, etc, which contains rich correlation and clustering information about the items.

Clustering

Paper
Add Code

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

no code implementations • 24 Feb 2015 • Hamid Palangi, Li Deng, Yelong Shen, Jianfeng Gao, Xiaodong He, Jianshu Chen, Xinying Song, Rabab Ward

The results show that the proposed method in this paper significantly outperforms it for web document retrieval task.

Information Retrieval Retrieval +3

Paper
Add Code

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

9 code implementations • 20 Dec 2014 • Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, Li Deng

We consider learning representations of entities and relations in KBs using the neural-embedding approach.

Ranked #10 on Link Prediction on UMLS

Link Prediction

20,040

Paper
Code

From Captions to Visual Concepts and Back

1 code implementation • CVPR 2015 • Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, Geoffrey Zweig

The language model learns from a set of over 400, 000 image descriptions to capture the statistics of word usage.

Ranked #1 on Image Captioning on COCO Captions test

Image Captioning Language Modelling +3

150

Paper
Code

Learning Multi-Relational Semantics Using Neural-Embedding Models

no code implementations • 14 Nov 2014 • Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, Li Deng

In this paper we present a unified framework for modeling multi-relational representations, scoring, and learning, and conduct an empirical study of several recent multi-relational embedding models under the framework.

Knowledge Base Completion

Paper
Add Code

Modeling Interestingness with Deep Neural Networks

no code implementations • EMNLP 2014 • Jianfeng Gao, Patrick Pantel, Michael Gamon, Xiaodong He, Li Deng

Recommendation Systems Semantic Textual Similarity +2

Paper
Add Code

Learning Continuous Phrase Representations for Translation Modeling

no code implementations • ACL 2014 • Jianfeng Gao, Xiaodong He, Wen-tau Yih, Li Deng

Machine Translation Translation

Paper
Add Code

Semantic Parsing for Single-Relation Question Answering

no code implementations • ACL 2014 • Wen-tau Yih, Xiaodong He, Christopher Meek

Open-Domain Question Answering Relation +2

Paper
Add Code

Learning Semantic Representations for the Phrase Translation Model

no code implementations • 28 Nov 2013 • Jianfeng Gao, Xiaodong He, Wen-tau Yih, Li Deng

The results show that the new semantic-based phrase translation model significantly improves the performance of a state-of-the-art phrase-based statistical machine translation sys-tem, leading to a gain of 0. 7-1. 0 BLEU points.

Learning Semantic Representations Machine Translation +1

Paper
Add Code

Learning deep structured semantic models for web search using clickthrough data

5 code implementations • CIKM 2013 • Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, Larry Heck

The proposed deep structured semantic models are discriminatively trained by maximizing the conditional likelihood of the clicked documents given a query using the clickthrough data.

Document Ranking

4,088

Paper
Code