no code implementations • 7 Aug 2020 • Tian Lan, Xian-Ling Mao, Wei Wei, He-Yan Huang
Thus, in this paper, we will measure systematically nearly all representative hierarchical and non-hierarchical models over the same experimental settings to check which kind is better.
4 code implementations • NAACL 2021 • Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, He-Yan Huang, Ming Zhou
In this work, we present an information-theoretic framework that formulates cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts.
Ranked #16 on
Zero-Shot Cross-Lingual Transfer
on XTREME
no code implementations • 3 Jul 2020 • Heng-Da Xu, Xian-Ling Mao, Zewen Chi, Jing-Jing Zhu, Fanshu Sun, He-Yan Huang
Specifically, KW-Seq2Seq first uses a keywords decoder to predict some topic keywords, and then generates the final response under the guidance of them.
1 code implementation • EMNLP (NLP-COVID19) 2020 • Yong Hu, He-Yan Huang, Anfan Chen, Xian-Ling Mao
Therefore, in this paper, we release Weibo-COV, a first large-scale COVID-19 social media dataset from Weibo, covering more than 30 million tweets from 1 November 2019 to 30 April 2020.
Social and Information Networks
1 code implementation • 8 May 2020 • Puhai Yang, He-Yan Huang, Xian-Ling Mao
As a key component in a dialogue system, dialogue state tracking plays an important role.
1 code implementation • 6 Apr 2020 • Tian Lan, Xian-Ling Mao, Wei Wei, Xiaoyan Gao, He-Yan Huang
Through extensive experiments, the learning-based metrics are demonstrated that they are the most effective evaluation metrics for open-domain generative dialogue systems.
no code implementations • 4 Apr 2020 • Xiao Liu, He-Yan Huang, Yue Zhang, Changsen Yuan
Thanks to the use of attention over news events, our model is also more explainable.
no code implementations • 4 Mar 2020 • Hongzheng Li, He-Yan Huang
Back translation (BT) has been widely used and become one of standard techniques for data augmentation in Neural Machine Translation (NMT), BT has proven to be helpful for improving the performance of translation effectively, especially for low-resource scenarios.
no code implementations • 20 Dec 2019 • Tian Lan, Xian-Ling Mao, He-Yan Huang, Wei Wei
Intuitively, a dialogue model that can control the timing of talking autonomously based on the conversation context can chat with humans more naturally.
no code implementations • 30 Nov 2019 • Xuewen Shi, He-Yan Huang, Shuyang Zhao, Ping Jian, Yi-Kun Tang
In this paper, we transform tag recommendation into a word-based text generation problem and introduce a sequence-to-sequence model.
1 code implementation • 29 Nov 2019 • Xuewen Shi, He-Yan Huang, Ping Jian, Yuhang Guo, Xiaochi Wei, Yi-Kun Tang
In this paper, we cast the CWS as a sequence translation problem and propose a novel sequence-to-sequence CWS model with an attention-based encoder-decoder framework.
no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Zewen Chi, Li Dong, Furu Wei, Xian-Ling Mao, He-Yan Huang
Multilingual pretrained language models (such as multilingual BERT) have achieved impressive results for cross-lingual transfer.
no code implementations • 8 Nov 2019 • Tan Yan, He-Yan Huang, Xian-Ling Mao
We introduce a new scientific named entity recognizer called SEPT, which stands for Span Extractor with Pre-trained Transformers.
no code implementations • CONLL 2019 • Xuewen Shi, He-Yan Huang, Wenguan Wang, Ping Jian, Yi-Kun Tang
To alleviate this problem, we propose an NMT approach that heightens the adequacy in machine translation by transferring the semantic knowledge learned from bilingual sentence alignment.
1 code implementation • 23 Sep 2019 • Zewen Chi, Li Dong, Furu Wei, Wenhui Wang, Xian-Ling Mao, He-Yan Huang
In this work we focus on transferring supervision signals of natural language generation (NLG) tasks between multiple languages.
no code implementations • 17 Sep 2019 • Tian Lan, Xian-Ling Mao, He-Yan Huang
As far as we know, the existing task-oriented dialogue systems obtain the dialogue policy through classification, which can assign either a dialogue act and its corresponding parameters or multiple dialogue acts without their corresponding parameters for a dialogue action.
1 code implementation • 25 Aug 2019 • Yong Hu, He-Yan Huang, Tian Lan, Xiaochi Wei, Yuxiang Nie, Jiarui Qi, Liner Yang, Xian-Ling Mao
Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned.
1 code implementation • 13 Aug 2019 • Zewen Chi, He-Yan Huang, Heng-Da Xu, Houjin Yu, Wanxuan Yin, Xian-Ling Mao
It also attracts lots of attention to recognize the table structures in PDF files.
no code implementations • 12 Aug 2019 • Jia-Nan Guo, Xian-Ling Mao, Xiao-Jian Jiang, Ying-Xiang Sun, Wei Wei, He-Yan Huang
Network embedding is a promising way of network representation, facilitating many signed social network processing and analysis tasks such as link prediction and node classification.
no code implementations • 29 Jul 2019 • Rong-Cheng Tu, Xian-Ling Mao, Bing Ma, Yong Hu, Tan Yan, Wei Wei, He-Yan Huang
Specifically, by an iterative optimization algorithm, DCHUC jointly learns unified hash codes for image-text pairs in a database and a pair of hash functions for unseen query image-text pairs.
1 code implementation • ACL 2019 • Xiao Liu, He-Yan Huang, Yue Zhang
We consider open domain event extraction, the task of extracting unconstraint types of events from news clusters.
no code implementations • 19 May 2019 • Bowen Xing, Lejian Liao, Dandan song, Jingang Wang, Fuzheng Zhang, Zhongyuan Wang, He-Yan Huang
This paper proposes a novel variant of LSTM, termed as aspect-aware LSTM (AA-LSTM), which incorporates aspect information into LSTM cells in the context modeling stage before the attention mechanism.
no code implementations • 22 Dec 2018 • Changsen Yuan, He-Yan Huang, Chong Feng, Xiao Liu, Xiaochi Wei
Distant supervision for relation extraction is an efficient method to reduce labor costs and has been widely used to seek novel relational facts in large corpora, which can be identified as a multi-instance multi-label problem.
no code implementations • EMNLP 2018 • Ge Shi, Chong Feng, Lifu Huang, Boliang Zhang, Heng Ji, Lejian Liao, He-Yan Huang
Relation Extraction suffers from dramatical performance decrease when training a model on one genre and directly applying it to a new genre, due to the distinct feature distributions.
3 code implementations • EMNLP 2018 • Xiao Liu, Zhunchen Luo, He-Yan Huang
Event extraction is of practical utility in natural language processing.
1 code implementation • COLING 2018 • Qian Liu, He-Yan Huang, Yang Gao, Xiaochi Wei, Yuxin Tian, Luyang Liu
In this paper, we propose a task-oriented word embedding method and apply it to the text classification task.
Ranked #21 on
Text Classification
on AG News
no code implementations • SEMEVAL 2018 • Zewen Chi, He-Yan Huang, Jiangui Chen, Hao Wu, Ran Wei
This paper presents a method for Affect in Tweets, which is the task to automatically determine the intensity of emotions and intensity of sentiment of tweets.
no code implementations • SEMEVAL 2017 • Hao Wu, He-Yan Huang, Ping Jian, Yuhang Guo, Chao Su
This paper presents three systems for semantic textual similarity (STS) evaluation at SemEval-2017 STS task.
no code implementations • SEMEVAL 2017 • Fanqing Meng, Wenpeng Lu, Yuteng Zhang, Ping Jian, Shumin Shi, He-Yan Huang
The techniques of our runs mainly make use of the word embeddings and the knowledge-based method.
no code implementations • 7 Apr 2017 • Yi-Kun Tang, Xian-Ling Mao, He-Yan Huang, Guihua Wen
Recently, topic modeling has been widely used to discover the abstract topics in text corpora.
no code implementations • 7 Apr 2017 • Dan Wang, He-Yan Huang, Chi Lu, Bo-Si Feng, Liqiang Nie, Guihua Wen, Xian-Ling Mao
Specifically, we define a novel similarity formula for hierarchical labeled data by weighting each layer, and design a deep convolutional neural network to obtain a hash code for each data point.
no code implementations • COLING 2016 • Xian-Ling Mao, Yi-Jing Hao, Qiang Zhou, Wen-Qing Yuan, Liner Yang, He-Yan Huang
Recently, topic modeling has been widely applied in data mining due to its powerful ability.
no code implementations • 7 Sep 2016 • Hang Yang, Ming Zhu, Zhongbo Zhang, He-Yan Huang
In the denoising step, the guided filter is used with the two obtained images for efficient edge-preserving filtering.