Search Results for author: Yunbo Cao

Found 54 papers, 23 papers with code

Pre-training and Fine-tuning Neural Topic Model: A Simple yet Effective Approach to Incorporating External Knowledge

no code implementations ACL 2022 Linhai Zhang, Xuemeng Hu, Boyu Wang, Deyu Zhou, Qian-Wen Zhang, Yunbo Cao

Recent years have witnessed growing interests in incorporating external knowledge such as pre-trained word embeddings (PWEs) or pre-trained language models (PLMs) into neural topic modeling.

Topic Models Word Embeddings

IEKM: A Model Incorporating External Keyword Matrices

no code implementations21 Nov 2023 Cheng Luo, Qin Li, Zhao Yan, Mengliang Rao, Yunbo Cao

In this paper, we propose an incorporation external keywords matrices model (IEKM) to address these challenges.

Semantic Similarity Semantic Textual Similarity +2

Making Large Language Models Better Reasoners with Alignment

no code implementations5 Sep 2023 Peiyi Wang, Lei LI, Liang Chen, Feifan Song, Binghuai Lin, Yunbo Cao, Tianyu Liu, Zhifang Sui

To address this problem, we introduce an \textit{Alignment Fine-Tuning (AFT)} paradigm, which involves three steps: 1) fine-tuning LLMs with COT training data; 2) generating multiple COT responses for each question, and categorizing them into positive and negative ones based on whether they achieve the correct answer; 3) calibrating the scores of positive and negative responses given by LLMs with a novel constraint alignment loss.

Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

1 code implementation16 Jul 2023 Zifeng Cheng, Qingyu Zhou, Zhiwei Jiang, Xuemin Zhao, Yunbo Cao, Qing Gu

However, these methods are only trained at a single granularity (i. e., either token level or span level) and have some weaknesses of the corresponding granularity.

Metric Learning

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

no code implementations28 Jun 2023 Zhangyin Feng, Yong Dai, Fan Zhang, Duyu Tang, Xiaocheng Feng, Shuangzhi Wu, Bing Qin, Yunbo Cao, Shuming Shi

Traditional multitask learning methods basically can only exploit common knowledge in task- or language-wise, which lose either cross-language or cross-task knowledge.

Natural Language Understanding

Soft Language Clustering for Multilingual Model Pre-training

no code implementations13 Jun 2023 Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao, Jie zhou

Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size.

Clustering Question Answering +5

Large Language Models are not Fair Evaluators

2 code implementations29 May 2023 Peiyi Wang, Lei LI, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Qi Liu, Tianyu Liu, Zhifang Sui

In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large language models~(LLMs), e. g., GPT-4, as a referee to score and compare the quality of responses generated by candidate models.

Language Modelling Large Language Model +1

DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade

no code implementations24 May 2023 Zefan Cai, Xin Zheng, Tianyu Liu, Xu Wang, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao

In the constant updates of the product dialogue systems, we need to retrain the natural language understanding (NLU) model as new data from the real users would be merged into the existent data accumulated in the last updates.

Intent Detection Multi-Label Classification +1

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

1 code implementation24 May 2023 Shaoxiang Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

However, unlike other image-text multimodal tasks, video has longer multimodal sequences with more redundancy and noise in both visual and audio modalities.

Denoising Multimodal Sentiment Analysis

Enhancing Continual Relation Extraction via Classifier Decomposition

1 code implementation8 May 2023 Heming Xia, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

In this work, we point out that there exist two typical biases after training of this vanilla strategy: classifier bias and representation bias, which causes the previous knowledge that the model learned to be shaded.

Continual Relation Extraction Relation

Finding Similar Exercises in Retrieval Manner

no code implementations15 Mar 2023 Tongwen Huang, Xihua Li, Chao Yi, Xuemin Zhao, Yunbo Cao

When students make a mistake in an exercise, they can consolidate it by ``similar exercises'' which have the same concepts, purposes and methods.

Representation Learning Retrieval

TQ-Net: Mixed Contrastive Representation Learning For Heterogeneous Test Questions

no code implementations9 Mar 2023 He Zhu, Xihua Li, Xuemin Zhao, Yunbo Cao, Shan Yu

Finally, supervised contrastive learning was conducted on relevance prediction-related downstream tasks, which helped the model to learn the representation of questions effectively.

Contrastive Learning Representation Learning

DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog

no code implementations14 Dec 2022 Xin Zheng, Tianyu Liu, Haoran Meng, Xu Wang, Yufan Jiang, Mengliang Rao, Binghuai Lin, Zhifang Sui, Yunbo Cao

Harvesting question-answer (QA) pairs from customer service chatlog in the wild is an efficient way to enrich the knowledge base for customer service chatbots in the cold start or continuous integration scenarios.

Retrieval

DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition

no code implementations15 Nov 2022 Jiali Zeng, Yufan Jiang, Yongjing Yin, Xu Wang, Binghuai Lin, Yunbo Cao

We present DualNER, a simple and effective framework to make full use of both annotated source language corpus and unlabeled target language text for zero-shot cross-lingual named entity recognition (NER).

named-entity-recognition Named Entity Recognition +1

Contrastive Learning with Prompt-derived Virtual Semantic Prototypes for Unsupervised Sentence Embedding

no code implementations7 Nov 2022 Jiali Zeng, Yongjing Yin, Yufan Jiang, Shuangzhi Wu, Yunbo Cao

Specifically, with the help of prompts, we construct virtual semantic prototypes to each instance, and derive negative prototypes by using the negative form of the prompts.

Clustering Contrastive Learning +5

Instance Segmentation for Chinese Character Stroke Extraction, Datasets and Benchmarks

1 code implementation25 Oct 2022 Lizhao Liu, Kunyang Lin, Shangxin Huang, Zhongli Li, Chao Li, Yunbo Cao, Qingyu Zhou

Moreover, there are no standardized benchmarks to provide a fair comparison between different stroke extraction methods, which, we believe, is a major impediment to the development of Chinese character stroke understanding and related tasks.

Font Generation Instance Segmentation +2

Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction

2 code implementations19 Oct 2022 Shirong Ma, Yinghui Li, Rongyi Sun, Qingyu Zhou, Shulin Huang, Ding Zhang, Li Yangning, Ruiyang Liu, Zhongli Li, Yunbo Cao, Haitao Zheng, Ying Shen

Extensive experiments and detailed analyses not only demonstrate that the training data constructed by our method effectively improves the performance of CGEC models, but also reflect that our benchmark is an excellent resource for further development of the CGEC field.

Grammatical Error Correction

Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation

1 code implementation10 Oct 2022 Peiyi Wang, YiFan Song, Tianyu Liu, Binghuai Lin, Yunbo Cao, Sujian Li, Zhifang Sui

In this paper, through empirical studies we argue that this assumption may not hold, and an important reason for catastrophic forgetting is that the learned representations do not have good robustness against the appearance of analogous relations in the subsequent learning process.

Continual Relation Extraction Relation

AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications

1 code implementation COLING 2022 Yusen Zhang, Zhongli Li, Qingyu Zhou, Ziyi Liu, Chao Li, Mina Ma, Yunbo Cao, Hongzhi Liu

To automatically correct handwritten assignments, the traditional approach is to use an OCR model to recognize characters and compare them to answers.

Optical Character Recognition (OCR)

Automatic Context Pattern Generation for Entity Set Expansion

1 code implementation17 Jul 2022 Yinghui Li, Shulin Huang, Xinwei Zhang, Qingyu Zhou, Yangning Li, Ruiyang Liu, Yunbo Cao, Hai-Tao Zheng, Ying Shen

In addition, we propose the GAPA, a novel ESE framework that leverages the aforementioned GenerAted PAtterns to expand target entities.

Information Retrieval Retrieval +1

HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification

1 code implementation28 Apr 2022 Zihan Wang, Peiyi Wang, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui, Houfeng Wang

However, in this paradigm, there exists a huge gap between the classification tasks with sophisticated label hierarchy and the masked language model (MLM) pretraining tasks of PLMs and thus the potentials of PLMs can not be fully tapped.

Language Modelling Multi-Label Classification +2

SmartSales: Sales Script Extraction and Analysis from Sales Chatlog

no code implementations19 Apr 2022 Hua Liang, Tianyu Liu, Peiyi Wang, Mengliang Rao, Yunbo Cao

2) Customer objection response assists the salespeople to figure out the typical customer objections and corresponding winning sales scripts, as well as search for proper sales responses for a certain customer objection.

Management

Type-Driven Multi-Turn Corrections for Grammatical Error Correction

1 code implementation Findings (ACL) 2022 Shaopeng Lai, Qingyu Zhou, Jiali Zeng, Zhongli Li, Chao Li, Yunbo Cao, Jinsong Su

First, they simply mix additionally-constructed training instances and original ones to train models, which fails to help models be explicitly aware of the procedure of gradual corrections.

Data Augmentation Grammatical Error Correction +1

Exploring Student Representation For Neural Cognitive Diagnosis

no code implementations17 Nov 2021 Hengyao Bao, Xihua Li, Xuemin Zhao, Yunbo Cao

In this paper, we propose a method of student representation with the exploration of the hierarchical relations of knowledge concepts and student embedding.

cognitive diagnosis

A non-hierarchical attention network with modality dropout for textual response generation in multimodal dialogue systems

no code implementations19 Oct 2021 Rongyi Sun, Borun Chen, Qingyu Zhou, Yinghui Li, Yunbo Cao, Hai-Tao Zheng

Existing text- and image-based multimodal dialogue systems use the traditional Hierarchical Recurrent Encoder-Decoder (HRED) framework, which has an utterance-level encoder to model utterance representation and a context-level encoder to model context representation.

Response Generation

Hierarchical Curriculum Learning for AMR Parsing

1 code implementation ACL 2022 Peiyi Wang, Liang Chen, Tianyu Liu, Damai Dai, Yunbo Cao, Baobao Chang, Zhifang Sui

Abstract Meaning Representation (AMR) parsing aims to translate sentences to semantic representation with a hierarchical structure, and is recently empowered by pretrained sequence-to-sequence models.

AMR Parsing Representation Learning

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling

1 code implementation NAACL 2022 Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui

Few-Shot Sequence Labeling (FSSL) is a canonical paradigm for the tagging models, e. g., named entity recognition and slot filling, to generalize on an emerging, resource-scarce domain.

Few-shot NER Meta-Learning +4

LANA: Towards Personalized Deep Knowledge Tracing Through Distinguishable Interactive Sequences

1 code implementation21 Apr 2021 Yuhao Zhou, Xihua Li, Yunbo Cao, Xuemin Zhao, Qing Ye, Jiancheng Lv

With pivot module reconstructed the decoder for individual students and leveled learning specialized encoders for groups, personalized DKT was achieved.

Knowledge Tracing

Improving BERT with Syntax-aware Local Attention

1 code implementation Findings (ACL) 2021 Zhongli Li, Qingyu Zhou, Chao Li, Ke Xu, Yunbo Cao

Pre-trained Transformer-based neural language models, such as BERT, have achieved remarkable results on varieties of NLP tasks.

Machine Translation Question Answering +3

Dialogue Response Selection with Hierarchical Curriculum Learning

1 code implementation ACL 2021 Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate.

Conversational Response Selection

Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation

1 code implementation Findings of the Association for Computational Linguistics 2020 Chujie Zheng, Yunbo Cao, Daxin Jiang, Minlie Huang

In a multi-turn knowledge-grounded dialog, the difference between the knowledge selected at different turns usually provides potential clues to knowledge selection, which has been largely neglected in previous research.

The SPPD System for Schema Guided Dialogue State Tracking Challenge

no code implementations16 Jun 2020 Miao Li, Haoqi Xiong, Yunbo Cao

This paper introduces one of our group's work on the Dialog System Technology Challenges 8 (DSTC8), the SPPD system for Schema Guided dialogue state tracking challenge.

Dialogue State Tracking Multi-domain Dialogue State Tracking

A Statistical Framework for Product Description Generation

no code implementations IJCNLP 2017 Jinpeng Wang, Yutai Hou, Jing Liu, Yunbo Cao, Chin-Yew Lin

We present in this paper a statistical framework that generates accurate and fluent product description from product attributes.

Attribute Data-to-Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.