1 code implementation • COLING 2022 • Ziyao Xu, Houfeng Wang, Bingdong Wang
However, previous work only used the biaffine method at the end of the dependency parser as a scorer, and its application in multi-layer form is ignored.
no code implementations • COLING 2022 • Liang Wen, Juan Li, Houfeng Wang, Yingwei Luo, Xiaolin Wang, Xiaodong Zhang, Zhicong Cheng, Dawei Yin
And their experiments show that leveraging the answer summaries helps to attend the essential information in original lengthy answers and improve the answer selection performance under certain circumstances.
1 code implementation • ACL 2022 • Xin Sun, Houfeng Wang
Modern writing assistance applications are always equipped with a Grammatical Error Correction (GEC) model to correct errors in user-entered sentences.
no code implementations • 30 Jun 2023 • Feifan Song, Bowen Yu, Minghao Li, Haiyang Yu, Fei Huang, Yongbin Li, Houfeng Wang
In this paper, we propose Preference Ranking Optimization (PRO) as an alternative to PPO for directly aligning LLMs with the Bradley-Terry comparison.
no code implementations • 2 Mar 2023 • Guangyue Peng, Tao Ge, Si-Qing Chen, Furu Wei, Houfeng Wang
We demonstrate that SeMem improves the scalability of semiparametric LMs for continual learning over streaming data in two ways: (1) data-wise scalability: as the model becomes stronger through continual learning, it will encounter fewer difficult cases that need to be memorized, causing the growth of the non-parametric memory to slow down over time rather than growing at a linear rate with the size of training data; (2) model-wise scalability: SeMem allows a larger model to memorize fewer samples than its smaller counterpart because it is rarer for a larger model to encounter incomprehensible cases, resulting in a non-parametric memory that does not scale linearly with model size.
no code implementations • 7 Oct 2022 • Feifan Song, Lianzhe Huang, Houfeng Wang
Multi-intent Spoken Language Understanding has great potential for widespread implementation.
1 code implementation • 28 Apr 2022 • Zihan Wang, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui, Houfeng Wang
However, in this paradigm, there exists a huge gap between the classification tasks with sophisticated label hierarchy and the masked language model (MLM) pretraining tasks of PLMs and thus the potentials of PLMs can not be fully tapped.
no code implementations • 17 Mar 2022 • Yantao Gong, Cao Liu, Fan Yang, Xunliang Cai, Guanglu Wan, Jiansong Chen, Weipeng Zhang, Houfeng Wang
Experiments on the open datasets verify that our model outperforms the existing calibration methods and achieves a significant improvement on the calibration metric.
1 code implementation • ACL 2022 • Zihan Wang, Peiyi Wang, Lianzhe Huang, Xin Sun, Houfeng Wang
Hierarchical text classification is a challenging subtask of multi-label classification due to its complex label hierarchy.
no code implementations • 24 Feb 2022 • Jing Jin, Houfeng Wang
Machine Reading Comprehension(MRC) has achieved a remarkable result since some powerful models, such as BERT, are proposed.
1 code implementation • 23 Feb 2022 • Lianzhe Huang, Shuming Ma, Dongdong Zhang, Furu Wei, Houfeng Wang
To collocate with the unified prompt, we propose a new initialization method for the target label word to further improve the model's transferability across languages.
no code implementations • 26 Jan 2022 • Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang
Synthetic data construction of Grammatical Error Correction (GEC) for non-English languages relies heavily on human-designed and language-specific rules, which produce limited error-corrected patterns.
no code implementations • 24 Aug 2021 • Yantao Gong, Cao Liu, Jiazhen Yuan, Fan Yang, Xunliang Cai, Guanglu Wan, Jiansong Chen, Ruiyao Niu, Houfeng Wang
To handle this problem, we propose a density-based dynamic curriculum learning model.
no code implementations • 25 Jul 2021 • Linhao Zhang, Houfeng Wang
In this paper, we make the first step towards controllable generation of comments, by building a system that can explicitly control the emotion of the generated comments.
no code implementations • 25 Jul 2021 • Linhao Zhang, Yu Shi, Linjun Shou, Ming Gong, Houfeng Wang, Michael Zeng
In this paper, we attempt to bridge these two lines of research and propose a joint and domain adaptive approach to SLU.
no code implementations • 25 Jul 2021 • Linhao Zhang, Houfeng Wang
Recently, researchers have explored using the encoder-decoder framework to tackle dialogue state tracking (DST), which is a key component of task-oriented dialogue systems.
1 code implementation • ACL 2021 • Xin Sun, Tao Ge, Furu Wei, Houfeng Wang
In this paper, we propose Shallow Aggressive Decoding (SAD) to improve the online inference efficiency of the Transformer for instantaneous Grammatical Error Correction (GEC).
no code implementations • 17 Feb 2021 • Lianzhe Huang, Peiyi Wang, Sujian Li, Tianyu Liu, Xiaodong Zhang, Zhicong Cheng, Dawei Yin, Houfeng Wang
Aspect Sentiment Triplet Extraction (ASTE) aims to extract triplets from a sentence, including target entities, associated sentiment polarities, and opinion spans which rationalize the polarities.
Ranked #8 on
Aspect Sentiment Triplet Extraction
on ASTE-Data-V2
no code implementations • 1 Jan 2021 • Chen Yang, Houfeng Wang
To improve the efficiency of trying different skip connection architectures, we apply the idea of network morphism to add skip connections as a procedure of fine-tuning.
no code implementations • COLING 2020 • Lianzhe Huang, Xin Sun, Sujian Li, Linhao Zhang, Houfeng Wang
In this paper, we exploit syntactic awareness to the model by the graph attention network on the dependency tree structure and external pre-training knowledge by BERT language model, which helps to model the interaction between the context and aspect words better.
2 code implementations • IJCNLP 2019 • Lianzhe Huang, Dehong Ma, Sujian Li, Xiaodong Zhang, Houfeng Wang
Recently, researches have explored the graph neural network (GNN) techniques on text classification, since GNN does well in handling complex structures and preserving global information.
Ranked #2 on
Text Classification
on Ohsumed
no code implementations • ACL 2019 • Dehong Ma, Sujian Li, Fangzhao Wu, Xing Xie, Houfeng Wang
Aspect term extraction (ATE) aims at identifying all aspect terms in a sentence and is usually modeled as a sequence labeling problem.
Ranked #1 on
Term Extraction
on SemEval 2014 Task 4 Laptop
no code implementations • EMNLP 2018 • Wei Wu, Houfeng Wang, Tianyu Liu, Shuming Ma
As a result, the memory consumption can be reduced because the self-attention is performed at the phrase level instead of the sentence level.
no code implementations • EMNLP 2018 • Dehong Ma, Sujian Li, Houfeng Wang
Targeted sentiment analysis (TSA) aims at extracting targets and classifying their sentiment classes.
no code implementations • EMNLP 2018 • Chen Shi, Qi Chen, Lei Sha, Sujian Li, Xu Sun, Houfeng Wang, Lintao Zhang
The lack of labeled data is one of the main challenges when building a task-oriented dialogue system.
1 code implementation • EMNLP 2018 • Fenglin Liu, Xuancheng Ren, Yuanxin Liu, Houfeng Wang, Xu sun
The encode-decoder framework has shown recent success in image captioning.
no code implementations • 16 Aug 2018 • Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang, Xu sun
In the experiments, we take a real-world sememe knowledge base HowNet and the corresponding descriptions of the words in Baidu Wiki for training and evaluation.
no code implementations • COLING 2018 • Hao Wang, Xiaodong Zhang, Shuming Ma, Xu sun, Houfeng Wang, Mengxiang Wang
Then the system measures the relevance between each question and candidate table cells, and choose the most related cell as the source of answer.
1 code implementation • ACL 2018 • Wei Wu, Xu sun, Houfeng Wang
Answer selection is an important subtask of community question answering (CQA).
1 code implementation • COLING 2018 • Pengcheng Yang, Xu sun, Wei Li, Shuming Ma, Wei Wu, Houfeng Wang
Further analysis of experimental results demonstrates that the proposed methods not only capture the correlations between labels, but also select the most informative words automatically when predicting different labels.
1 code implementation • ACL 2018 • Jingjing Xu, Xu sun, Qi Zeng, Xuancheng Ren, Xiaodong Zhang, Houfeng Wang, Wenjie Li
We evaluate our approach on two review datasets, Yelp and Amazon.
Ranked #6 on
Unsupervised Text Style Transfer
on Yelp
1 code implementation • ACL 2018 • Shuming Ma, Xu sun, Junyang Lin, Houfeng Wang
In this work, we supervise the learning of the representation of the source content with that of the summary.
no code implementations • IJCNLP 2017 • Yizhong Wang, Sujian Li, Jingfeng Yang, Xu sun, Houfeng Wang
Identifying implicit discourse relations between text spans is a challenging task because it requires understanding the meaning of the text.
General Classification
Implicit Discourse Relation Classification
+2
no code implementations • 25 Nov 2017 • Xu Sun, Weiwei Sun, Shuming Ma, Xuancheng Ren, Yi Zhang, Wenjie Li, Houfeng Wang
The decoding of the complex structure model is regularized by the additionally trained simple structure model.
3 code implementations • 17 Nov 2017 • Xu Sun, Xuancheng Ren, Shuming Ma, Bingzhen Wei, Wei Li, Jingjing Xu, Houfeng Wang, Yi Zhang
Based on the sparsified gradients, we further simplify the model by eliminating the rows or columns that are seldom updated, which will reduce the computational cost both in the training and decoding, and potentially accelerate decoding in real-world applications.
no code implementations • IJCNLP 2017 • Dehong Ma, Sujian Li, Xiaodong Zhang, Houfeng Wang, Xu sun
Document-level sentiment classification aims to assign the user reviews a sentiment polarity.
Ranked #5 on
Sentiment Analysis
on User and product information
no code implementations • IJCNLP 2017 • Shen Huang, Xu sun, Houfeng Wang
Boundary features are widely used in traditional Chinese Word Segmentation (CWS) methods as they can utilize unlabeled data to help improve the Out-of-Vocabulary (OOV) word recognition performance.
5 code implementations • 4 Sep 2017 • Dehong Ma, Sujian Li, Xiaodong Zhang, Houfeng Wang
In this paper, we argue that both targets and contexts deserve special treatment and need to be learned their own representations via interactive learning.
no code implementations • EMNLP 2017 • Qing Zhang, Houfeng Wang
For the task of relation extraction, distant supervision is an efficient approach to generate labeled data by aligning knowledge base with free texts.
no code implementations • EMNLP 2017 • Liang Wang, Sujian Li, Yajuan Lv, Houfeng Wang
Topic segmentation plays an important role for discourse parsing and information retrieval.
1 code implementation • ACL 2017 • Yizhong Wang, Sujian Li, Houfeng Wang
Previous work introduced transition-based algorithms to form a unified architecture of parsing rhetorical structures (including span, nuclearity and relation), but did not achieve satisfactory performance.
Ranked #5 on
Discourse Parsing
on RST-DT
2 code implementations • ICML 2017 • Xu Sun, Xuancheng Ren, Shuming Ma, Houfeng Wang
In back propagation, only a small subset of the full gradient is computed to update the model parameters.
1 code implementation • ACL 2017 • Shuming Ma, Xu sun, Jingjing Xu, Houfeng Wang, Wenjie Li, Qi Su
In this work, our goal is to improve semantic relevance between source texts and summaries for Chinese social media summarization.
no code implementations • WS 2016 • Shen Huang, Houfeng Wang
Grammatical Error Diagnosis for Chinese has always been a challenge for both foreign learners and NLP researchers, for the variousity of grammar and the flexibility of expression.
no code implementations • 5 Apr 2016 • Li Li, Houfeng Wang
To the best of our knowledge, we are the first to tackle the imbalance problem in multi-label classification with many labels.
no code implementations • IJCNLP 2015 • Yang Liu, Furu Wei, Sujian Li, Heng Ji, Ming Zhou, Houfeng Wang
Previous research on relation classification has verified the effectiveness of using dependency shortest paths or subtrees.
Ranked #5 on
Relation Classification
on SemEval 2010 Task 8