Search Results for author: Dianbo Sui

Found 22 papers, 12 papers with code

Set Generation Networks for End-to-End Knowledge Base Population

no code implementations EMNLP 2021 Dianbo Sui, Chenhao Wang, Yubo Chen, Kang Liu, Jun Zhao, Wei Bi

In this paper, we formulate end-to-end KBP as a direct set generation problem, avoiding considering the order of multiple facts.

Decoder Knowledge Base Population +2

FedED: Federated Learning via Ensemble Distillation for Medical Relation Extraction

no code implementations EMNLP 2020 Dianbo Sui, Yubo Chen, Jun Zhao, Yantao Jia, Yuantao Xie, Weijian Sun

In this paper, we propose a privacy-preserving medical relation extraction model based on federated learning, which enables training a central model with no single piece of private local data being shared or exchanged.

Federated Learning Knowledge Distillation +4

CASIA at SemEval-2022 Task 11: Chinese Named Entity Recognition for Complex and Ambiguous Entities

no code implementations SemEval (NAACL) 2022 Jia Fu, Zhen Gan, Zhucong Li, Sirui Li, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao

This paper describes our approach to develop a complex named entity recognition system in SemEval 2022 Task 11: MultiCoNER Multilingual Complex Named Entity Recognition, Track 9 - Chinese.

Chinese Named Entity Recognition Data Augmentation +3

TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs

1 code implementation14 Oct 2024 Haochuan Wang, Xiachong Feng, Lei LI, Zhanyue Qin, Dianbo Sui, Lingpeng Kong

The rapid advancement of large language models (LLMs) has accelerated their application in reasoning, with strategic reasoning drawing increasing attention.

Synthetic Data Generation

Mitigating Gender Bias in Code Large Language Models via Model Editing

no code implementations10 Oct 2024 Zhanyue Qin, Haochuan Wang, Zecheng Wang, Deyuan Liu, Cunhang Fan, Zhao Lv, Zhiying Tu, Dianhui Chu, Dianbo Sui

At the same time, the experimental results show that, considering both the gender bias of the model and its general code generation capability, MG-Editing is most effective when applied at the row and neuron levels of granularity.

Code Generation knowledge editing +2

Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data

1 code implementation10 Oct 2024 Can Wang, Dianbo Sui, Hongliang Sun, Hao Ding, Bolin Zhang, Zhiying Tu

This paper introduces a novel method to estimate the performance of LLM services across different tasks and contexts, which can be "plug-and-play" utilizing only a few unlabeled samples like ICL.

In-Context Learning Language Modeling +2

HBot: A Chatbot for Healthcare Applications in Traditional Chinese Medicine Based on Human Body 3D Visualization

no code implementations1 Aug 2024 Bolin Zhang, Zhiwei Yi, Jiahao Wang, Dianbo Sui, Zhiying Tu, Dianhui Chu

However, concepts such as acupuncture points (acupoints) and meridians involved in TCM always appear in the consultation, which cannot be displayed intuitively.

Chatbot

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

1 code implementation2 Jul 2024 Bozhong Tian, Xiaozhuan Liang, Siyuan Cheng, Qingbin Liu, Mengru Wang, Dianbo Sui, Xi Chen, Huajun Chen, Ningyu Zhang

Large Language Models (LLMs) trained on extensive corpora inevitably retain sensitive data, such as personal privacy information and copyrighted material.

General Knowledge

UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models

no code implementations24 Jun 2024 Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui

In order to answer this question, we propose the UNO Arena based on the card game UNO to evaluate the sequential decision-making capability of LLMs and explain in detail why we choose UNO.

Decision Making Sequential Decision Making

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

1 code implementation22 May 2024 Yongxin Guo, Jingyu Liu, Mingda Li, Dingxin Cheng, Xiaoying Tang, Dianbo Sui, Qingbin Liu, Xi Chen, Kevin Zhao

Video Temporal Grounding (VTG) strives to accurately pinpoint event timestamps in a specific video using linguistic queries, significantly impacting downstream tasks like video browsing and editing.

Dense Video Captioning Highlight Detection +2

Checkpoint Merging via Bayesian Optimization in LLM Pretraining

no code implementations28 Mar 2024 Deyuan Liu, Zecheng Wang, Bingning Wang, WeiPeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Bo Li, Dianbo Sui

The rapid proliferation of large language models (LLMs) such as GPT-4 and Gemini underscores the intense demand for resources during their training processes, posing significant challenges due to substantial computational and environmental costs.

Bayesian Optimization

Woodpecker: Hallucination Correction for Multimodal Large Language Models

1 code implementation24 Oct 2023 Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen

Hallucination is a big shadow hanging over the rapidly evolving Multimodal Large Language Models (MLLMs), referring to the phenomenon that the generated text is inconsistent with the image content.

Hallucination

CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet

1 code implementation ACL 2021 Zhuoran Jin, Yubo Chen, Dianbo Sui, Chenhao Wang, Zhipeng Xue, Jun Zhao

CogNet is a knowledge base that integrates three types of knowledge: linguistic knowledge, world knowledge and commonsense knowledge.

Entity Linking Entity Typing +7

Document-level Event Extraction via Parallel Prediction Networks

2 code implementations ACL 2021 Hang Yang, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Taifeng Wang

We argue that sentence-level extractors are ill-suited to the DEE task where event arguments always scatter across sentences and multiple events may co-exist in a document.

Decoder Document-level Event Extraction +2

A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

1 code implementation ACL 2021 Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu, Jun Zhao

In this paper, we aim to explore an uncharted territory, which is Chinese multimodal named entity recognition (NER) with both textual and acoustic contents.

named-entity-recognition Named Entity Recognition +1

Graph-Based Knowledge Integration for Question Answering over Dialogue

no code implementations COLING 2020 Jian Liu, Dianbo Sui, Kang Liu, Jun Zhao

Despite many advances, existing approaches for this task did not consider dialogue structure and background knowledge (e. g., relationships between speakers).

Machine Reading Comprehension Question Answering +1

Joint Entity and Relation Extraction with Set Prediction Networks

1 code implementation3 Nov 2020 Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Xiangrong Zeng, Shengping Liu

Compared with cross-entropy loss that highly penalizes small shifts in triple order, the proposed bipartite matching loss is invariant to any permutation of predictions; thus, it can provide the proposed networks with a more accurate training signal by ignoring triple order and focusing on relation types and entities.

Joint Entity and Relation Extraction Relation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.