Search Results for author: Yubo Chen

Found 70 papers, 18 papers with code

CASIA at SemEval-2022 Task 11: Chinese Named Entity Recognition for Complex and Ambiguous Entities

no code implementations SemEval (NAACL) 2022 Jia Fu, Zhen Gan, Zhucong Li, Sirui Li, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao

This paper describes our approach to develop a complex named entity recognition system in SemEval 2022 Task 11: MultiCoNER Multilingual Complex Named Entity Recognition, Track 9 - Chinese.

Chinese Named Entity Recognition Data Augmentation +3

CroAno : A Crowd Annotation Platform for Improving Label Consistency of Chinese NER Dataset

no code implementations EMNLP (ACL) 2021 Baoli Zhang, Zhucong Li, Zhen Gan, Yubo Chen, Jing Wan, Kang Liu, Jun Zhao, Shengping Liu, Yafei Shi

2) Inconsistency Detector: CroAno employs a detector to locate corpus-level label inconsistency and provides users an interface to correct inconsistent entities in batches.

Chinese Named Entity Recognition Management +3

Augmentation, Retrieval, Generation: Event Sequence Prediction with a Three-Stage Sequence-to-Sequence Approach

no code implementations COLING 2022 Bo Zhou, Chenhao Wang, Yubo Chen, Kang Liu, Jun Zhao, Jiexin Xu, XiaoJian Jiang, Qiuxia Li

Currently existing approach models this task as a statistical induction problem, to predict a sequence of events by exploring the similarity between the given goal and the known sequences of events.

Retrieval

Generating Temporally-ordered Event Sequences via Event Optimal Transport

no code implementations COLING 2022 Bo Zhou, Yubo Chen, Kang Liu, Jun Zhao, Jiexin Xu, XiaoJian Jiang, Qiuxia Li

The other issue is that the model adopts a word-level objective to model events in texts, failing to evaluate the predicted results of the model from the perspective of event sequence.

Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification

1 code implementation EMNLP 2021 Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu, Jun Zhao

Moreover, we propose an Uncertain Information Aggregation module to leverage the global structure for integrating the local information.

Sentence

Incremental Event Detection via Knowledge Consolidation Networks

no code implementations EMNLP 2020 Pengfei Cao, Yubo Chen, Jun Zhao, Taifeng Wang

However, existing incremental learning methods cannot handle semantic ambiguity and training data imbalance problems between old and new classes in the task of incremental event detection.

Event Detection Incremental Learning

Learning Reasoning Patterns for Relational Triple Extraction with Mutual Generation of Text and Graph

no code implementations Findings (ACL) 2022 Yubo Chen, Yunqi Zhang, Yongfeng Huang

To capture the relation type inference logic of the paths, we propose to understand the unlabeled conceptual expressions by reconstructing the sentence from the relational graph (graph-to-text generation) in a self-supervised manner.

Graph Generation Knowledge Graphs +3

FedED: Federated Learning via Ensemble Distillation for Medical Relation Extraction

no code implementations EMNLP 2020 Dianbo Sui, Yubo Chen, Jun Zhao, Yantao Jia, Yuantao Xie, Weijian Sun

In this paper, we propose a privacy-preserving medical relation extraction model based on federated learning, which enables training a central model with no single piece of private local data being shared or exchanged.

Federated Learning Knowledge Distillation +4

Event Extraction as Machine Reading Comprehension

no code implementations EMNLP 2020 Jian Liu, Yubo Chen, Kang Liu, Wei Bi, Xiaojiang Liu

ii) Our model is excelled in the data-scarce scenario, for example, obtaining 49. 8{\%} in F1 for event argument extraction with only 1{\%} data, compared with 2. 2{\%} of the previous method.

Event Argument Extraction Event Extraction +5

Continual Few-shot Event Detection via Hierarchical Augmentation Networks

1 code implementation26 Mar 2024 Chenlong Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun, Jun Zhao

The CFED task is challenging as it involves memorizing previous event types and learning new event types with few-shot samples.

Event Detection

SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents

1 code implementation5 Mar 2024 Zhitao He, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, XiaoJian Jiang, Kang Liu, Jun Zhao

In this paper, (1) we introduce SimuCourt, a judicial benchmark that encompasses 420 judgment documents from real-world, spanning the three most common types of judicial cases, and a novel task Judicial Decision-Making to evaluate the judicial analysis and decision-making power of agents.

Decision Making Information Retrieval

Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning

no code implementations28 Feb 2024 Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao

Large language models exhibit high-level commonsense reasoning abilities, especially with enhancement methods like Chain-of-Thought (CoT).

Position

Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models

no code implementations28 Feb 2024 Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, XiaoJian Jiang, Kang Liu, Jun Zhao

Moreover, we reveal that the pivotal point at which knowledge conflicts emerge in LMs is the integration of inconsistent information flows by memory heads and context heads.

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models

no code implementations22 Feb 2024 Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, XiaoJian Jiang, Jiexin Xu, Qiuxia Li, Jun Zhao

Then, we investigate the behavior and preference of RALMs from the following two perspectives: (1) Conflicts between internal memory and external sources: We find that stronger RALMs emerge with the Dunning-Kruger effect, persistently favoring their faulty internal memory even when correct evidence is provided.

Retrieval

The Da Vinci Code of Large Pre-trained Language Models: Deciphering Degenerate Knowledge Neurons

no code implementations21 Feb 2024 YuHeng Chen, Pengfei Cao, Yubo Chen, Yining Wang, Shengping Liu, Kang Liu, Jun Zhao

This paper provides a comprehensive definition of DKNs that covers both structural and functional aspects, pioneering the study of structures in PLMs' factual knowledge storage units.

WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing

no code implementations16 Feb 2024 Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

Knowledge editing aims to rectify inaccuracies in large language models (LLMs) without costly retraining for outdated or erroneous knowledge.

knowledge editing

Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering

no code implementations15 Feb 2024 Jiaxiang Liu, Tong Zhou, Yubo Chen, Kang Liu, Jun Zhao

In summary, our results pave the way for enhancing LLMs by incorporating Pseudo- and Multisource-KGs, particularly in the context of open-ended questions.

Graph Generation Knowledge Graphs +1

Oasis: Data Curation and Assessment System for Pretraining of Large Language Models

1 code implementation21 Nov 2023 Tong Zhou, Yubo Chen, Pengfei Cao, Kang Liu, Jun Zhao, Shengping Liu

To this end, we present a pretraining corpus curation and assessment platform called Oasis -- a one-stop system for data quality improvement and quantification with user-friendly interactive interfaces.

Language Modelling Large Language Model

ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models

no code implementations28 Aug 2023 Baoli Zhang, Haining Xie, Pengfan Du, JunHao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu, Jun Zhao

To this end, we propose the ZhuJiu benchmark, which has the following strengths: (1) Multi-dimensional ability coverage: We comprehensively evaluate LLMs across 7 ability dimensions covering 51 tasks.

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons

1 code implementation25 Aug 2023 YuHeng Chen, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

We design cross-lingual knowledge editing experiments, demonstrating that the PLMs can accomplish this task based on language-independent neurons; (2) The discovery of Degenerate Knowledge Neurons, a novel type of neuron showing that different knowledge neurons can store the same fact.

Fact Checking knowledge editing

Learning Through Guidance: Knowledge Distillation for Endoscopic Image Classification

no code implementations17 Aug 2023 Harshala Gammulle, Yubo Chen, Sridha Sridharan, Travis Klein, Clinton Fookes

However, there is a lack of focus on developing lightweight models which can run in low-resource environments, which are typically encountered in medical clinics.

Classification Feature Engineering +3

DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks

no code implementations15 Jun 2023 Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su, Xingxing Wei

In this paper, we propose DIFFender, a novel defense method that leverages a text-guided diffusion model to defend against adversarial patches.

Adversarial Defense Face Recognition +1

Programming Language Agnostic Mining of Code and Language Pairs with Sequence Labeling Based Question Answering

no code implementations21 Mar 2022 Changran Hu, Akshara Reddi Methukupalli, Yutong Zhou, Chen Wu, Yubo Chen

In particular, we propose to apply the BIO tagging scheme instead of the conventional binary scheme to mine the code solutions which are often composed of multiple blocks of a post.

Question Answering

Contrastive Vision-Language Pre-training with Limited Resources

1 code implementation17 Dec 2021 Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen

However, these works require a tremendous amount of data and computational resources (e. g., billion-level web data and hundreds of GPUs), which prevent researchers with limited resources from reproduction and further exploration.

Contrastive Learning

Document-level Event Extraction via Parallel Prediction Networks

2 code implementations ACL 2021 Hang Yang, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Taifeng Wang

We argue that sentence-level extractors are ill-suited to the DEE task where event arguments always scatter across sentences and multiple events may co-exist in a document.

Document-level Event Extraction Event Extraction +1

Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks

no code implementations ACL 2021 Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen, Weihua Peng

Specifically, to make use of the descriptive knowledge, we devise a Descriptive Graph Induction module to obtain and encode the graph-structured descriptive knowledge.

Descriptive Event Causality Identification

CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet

1 code implementation ACL 2021 Zhuoran Jin, Yubo Chen, Dianbo Sui, Chenhao Wang, Zhipeng Xue, Jun Zhao

CogNet is a knowledge base that integrates three types of knowledge: linguistic knowledge, world knowledge and commonsense knowledge.

Entity Linking Entity Typing +7

A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

1 code implementation ACL 2021 Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu, Jun Zhao

In this paper, we aim to explore an uncharted territory, which is Chinese multimodal named entity recognition (NER) with both textual and acoustic contents.

named-entity-recognition Named Entity Recognition +1

LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification

no code implementations ACL 2021 Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng, Yuguang Chen

On the other hand, our approach employs a dual mechanism, which is a learnable augmentation framework and can interactively adjust the generation process to generate task-related sentences.

Data Augmentation Event Causality Identification

Jointly Extracting Explicit and Implicit Relational Triples with Reasoning Pattern Enhanced Binary Pointer Network

no code implementations NAACL 2021 Yubo Chen, Yunqi Zhang, Changran Hu, Yongfeng Huang

To explore entity pairs that may be implicitly connected by relations, we propose a binary pointer network to extract overlapping relational triples relevant to each word sequentially and retain the information of previously extracted triples in an external memory.

graph construction Implicit Relations +5

CogNet: Bridging Linguistic Knowledge, World Knowledge and Commonsense Knowledge

no code implementations3 Mar 2021 Chenhao Wang, Yubo Chen, Zhipeng Xue, Yang Zhou, Jun Zhao

In this paper, we present CogNet, a knowledge base (KB) dedicated to integrating three types of knowledge: (1) linguistic knowledge from FrameNet, which schematically describes situations, objects and events.

World Knowledge

Joint Entity and Relation Extraction with Set Prediction Networks

1 code implementation3 Nov 2020 Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Xiangrong Zeng, Shengping Liu

Compared with cross-entropy loss that highly penalizes small shifts in triple order, the proposed bipartite matching loss is invariant to any permutation of predictions; thus, it can provide the proposed networks with a more accurate training signal by ignoring triple order and focusing on relation types and entities.

Joint Entity and Relation Extraction Relation +1

KnowDis: Knowledge Enhanced Data Augmentation for Event Causality Detection via Distant Supervision

no code implementations COLING 2020 Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao

Modern models of event causality detection (ECD) are mainly based on supervised learning from small hand-labeled corpora.

Data Augmentation

Event Coreference Resolution via a Multi-loss Neural Network without Using Argument Information

no code implementations22 Sep 2020 Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao

Event coreference resolution(ECR) is an important task in Natural Language Processing (NLP) and nearly all the existing approaches to this task rely on event argument information.

coreference-resolution Event Argument Extraction +1

Towards Causal Explanation Detection with Pyramid Salient-Aware Network

no code implementations CCL 2020 Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao

PSAN can assist in causal explanation detection via capturing the salient semantics of discourses contained in their keywords with a bottom graph-based word-level salient network.

THU\_NGN at SemEval-2019 Task 12: Toponym Detection and Disambiguation on Scientific Papers

no code implementations SEMEVAL 2019 Tao Qi, Suyu Ge, Chuhan Wu, Yubo Chen, Yongfeng Huang

First name: Tao Last name: Qi Email: taoqi. qt@gmail. com Affiliation: Department of Electronic Engineering, Tsinghua University First name: Suyu Last name: Ge Email: gesy17@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Chuhan Last name: Wu Email: wuch15@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Yubo Last name: Chen Email: chen-yb18@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Yongfeng Last name: Huang Email: yfhuang@mail. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University Toponym resolution is an important and challenging task in the neural language processing field, and has wide applications such as emergency response and social media geographical event analysis.

POS Toponym Resolution +1

Collective Event Detection via a Hierarchical and Bias Tagging Networks with Gated Multi-level Attention Mechanisms

1 code implementation EMNLP 2018 Yubo Chen, Hang Yang, Kang Liu, Jun Zhao, Yantao Jia

Traditional approaches to the task of ACE event detection primarily regard multiple events in one sentence as independent ones and recognize them separately by using sentence-level information.

Event Detection Sentence

Event Detection via Gated Multilingual Attention Mechanism

no code implementations AAAI-18 2018 Jian Liu, Yubo Chen, Kang Liu, Jun Zhao

In specific, to alleviate data scarcity problem, we exploit the consistent information in multilingual data via context attention mechanism.

Event Detection

Neural Metaphor Detecting with CNN-LSTM Model

no code implementations WS 2018 Chuhan Wu, Fangzhao Wu, Yubo Chen, Sixing Wu, Zhigang Yuan, Yongfeng Huang

In addition, we compare the performance of the softmax classifier and conditional random field (CRF) for sequential labeling in this task.

Machine Translation POS +1

Cannot find the paper you are looking for? You can Submit a new open access paper.