Search Results for author: Hongyu Lin

Found 55 papers, 32 papers with code

ISCAS at SemEval-2022 Task 10: An Extraction-Validation Pipeline for Structured Sentiment Analysis

1 code implementation • SemEval (NAACL) 2022 • Xinyu Lu, Mengjie Ren, Yaojie Lu, Hongyu Lin

ISCAS participated in both sub-tasks in SemEval-2022 Task 10: Structured Sentiment competition.

Paper
Code

CATAMARAN: A Cross-lingual Long Text Abstractive Summarization Dataset

no code implementations • LREC 2022 • Zheng Chen, Hongyu Lin

Cross-lingual summarization, which produces the summary in one language from a given source document in another language, could be extremely helpful for humans to obtain information across the world.

Abstractive Text Summarization Cross-Lingual Abstractive Summarization

Paper
Add Code

Spiral of Silences: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering

1 code implementation • 16 Apr 2024 • Xiaoyang Chen, Ben He, Hongyu Lin, Xianpei Han, Tianshu Wang, Boxi Cao, Le Sun, Yingfei Sun

The practice of Retrieval-Augmented Generation (RAG), which integrates Large Language Models (LLMs) with retrieval systems, has become increasingly prevalent.

Information Retrieval Language Modelling +3

Paper
Code

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation

2 code implementations • 10 Apr 2024 • Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun

In this paper, we propose Credibility-aware Generation (CAG), a universally applicable framework designed to mitigate the impact of flawed information in RAG.

Retrieval

131

Paper
Code

Few-shot Named Entity Recognition via Superposition Concept Discrimination

1 code implementation • 25 Mar 2024 • Jiawei Chen, Hongyu Lin, Xianpei Han, Yaojie Lu, Shanshan Jiang, Bin Dong, Le Sun

Then a superposition instance retriever is applied to retrieve corresponding instances of these superposition concepts from large-scale text corpus.

Active Learning few-shot-ner +4

Paper
Code

Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models

1 code implementation • 14 Mar 2024 • Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han, Le Sun

Declarative knowledge and procedural knowledge are two key parts in meta-cognitive theory, and these two hold significant importance in pre-training and inference of LLMs.

Paper
Code

Academically intelligent LLMs are not necessarily socially intelligent

1 code implementation • 11 Mar 2024 • Ruoxi Xu, Hongyu Lin, Xianpei Han, Le Sun, Yingfei Sun

The academic intelligence of large language models (LLMs) has made remarkable progress in recent times, but their social intelligence performance remains unclear.

Paper
Code

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

no code implementations • 6 Mar 2024 • Xin Men, Mingyu Xu, Qingyu Zhang, Bingning Wang, Hongyu Lin, Yaojie Lu, Xianpei Han, WeiPeng Chen

As Large Language Models (LLMs) continue to advance in performance, their size has escalated significantly, with current LLMs containing billions or even trillions of parameters.

Quantization

Paper
Add Code

Learning or Self-aligning? Rethinking Instruction Fine-tuning

no code implementations • 28 Feb 2024 • Mengjie Ren, Boxi Cao, Hongyu Lin, Cao Liu, Xianpei Han, Ke Zeng, Guanglu Wan, Xunliang Cai, Le Sun

Instruction Fine-tuning~(IFT) is a critical phase in building large language models~(LLMs).

World Knowledge

Paper
Add Code

SoFA: Shielded On-the-fly Alignment via Priority Rule Following

no code implementations • 27 Feb 2024 • Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin, Haiyang Yu, Le Sun, Xianpei Han, Yongbin Li

The alignment problem in Large Language Models (LLMs) involves adapting them to the broad spectrum of human values.

Paper
Add Code

Self-Retrieval: Building an Information Retrieval System with One Large Language Model

no code implementations • 23 Feb 2024 • Qiaoyu Tang, Jiawei Chen, Bowen Yu, Yaojie Lu, Cheng Fu, Haiyang Yu, Hongyu Lin, Fei Huang, Ben He, Xianpei Han, Le Sun, Yongbin Li

The rise of large language models (LLMs) has transformed the role of information retrieval (IR) systems in the way to humans accessing information.

Information Retrieval Language Modelling +2

Paper
Add Code

Executing Natural Language-Described Algorithms with Large Language Models: An Investigation

1 code implementation • 23 Feb 2024 • Xin Zheng, Qiming Zhu, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

In this paper, we seek to examine the capacity of present-day LLMs to comprehend and execute algorithms outlined in natural language.

Natural Language Understanding

Paper
Code

Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?

no code implementations • 22 Feb 2024 • Ning Bian, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun

Building machines with commonsense has been a longstanding challenge in NLP due to the reporting bias of commonsense rules and the exposure bias of rule-based commonsense reasoning.

Paper
Add Code

AI for social science and social science of AI: A Survey

no code implementations • 22 Jan 2024 • Ruoxi Xu, Yingfei Sun, Mengjie Ren, Shiguang Guo, Ruotong Pan, Hongyu Lin, Le Sun, Xianpei Han

Recent advancements in artificial intelligence, particularly with the emergence of large language models (LLMs), have sparked a rethinking of artificial general intelligence possibilities.

Paper
Add Code

DBCopilot: Scaling Natural Language Querying to Massive Databases

1 code implementation • 6 Dec 2023 • Tianshu Wang, Hongyu Lin, Xianpei Han, Le Sun, Xiaoyang Chen, Hao Wang, Zhenyu Zeng

Text-to-SQL simplifies database interactions by enabling non-experts to convert their natural language (NL) questions into Structured Query Language (SQL) queries.

Navigate Question Generation +2

Paper
Code

Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-based Retrofitting

no code implementations • 22 Nov 2023 • Xinyan Guan, Yanjiang Liu, Hongyu Lin, Yaojie Lu, Ben He, Xianpei Han, Le Sun

Incorporating factual knowledge in knowledge graph is regarded as a promising approach for mitigating the hallucination of large language models (LLMs).

Hallucination Language Modelling +1

Paper
Add Code

Toward Unified Controllable Text Generation via Regular Expression Instruction

1 code implementation • 19 Sep 2023 • Xin Zheng, Hongyu Lin, Xianpei Han, Le Sun

Controllable text generation is a fundamental aspect of natural language generation, with numerous methods proposed for different constraint types.

In-Context Learning Text Generation

Paper
Code

Benchmarking Large Language Models in Retrieval-Augmented Generation

1 code implementation • 4 Sep 2023 • Jiawei Chen, Hongyu Lin, Xianpei Han, Le Sun

In this paper, we systematically investigate the impact of Retrieval-Augmented Generation on large language models.

Benchmarking counterfactual +2

158

Paper
Code

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

1 code implementation • 8 Jun 2023 • Qiaoyu Tang, Ziliang Deng, Hongyu Lin, Xianpei Han, Qiao Liang, Boxi Cao, Le Sun

Existing approaches to tool learning have either primarily relied on extremely large language models, such as GPT-4, to attain generalized tool-use abilities in a zero-shot manner, or utilized supervised learning to train limited scopes of tools on compact models.

247

Paper
Code

Learning In-context Learning for Named Entity Recognition

2 code implementations • 18 May 2023 • Jiawei Chen, Yaojie Lu, Hongyu Lin, Jie Lou, Wei Jia, Dai Dai, Hua Wu, Boxi Cao, Xianpei Han, Le Sun

M}$, and a new entity extractor can be implicitly constructed by applying new instruction and demonstrations to PLMs, i. e., $\mathcal{ (\lambda .

few-shot-ner Few-shot NER +4

Paper
Code

Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models

no code implementations • 16 May 2023 • Boxi Cao, Qiaoyu Tang, Hongyu Lin, Shanshan Jiang, Bin Dong, Xianpei Han, Jiawei Chen, Tianshu Wang, Le Sun

Memory is one of the most essential cognitive functions serving as a repository of world knowledge and episodes of activities.

World Knowledge

Paper
Add Code

DLUE: Benchmarking Document Language Understanding

no code implementations • 16 May 2023 • Ruoxi Xu, Hongyu Lin, Xinyan Guan, Xianpei Han, Yingfei Sun, Le Sun

Understanding documents is central to many real-world tasks but remains a challenging topic.

Benchmarking Document Classification +1

Paper
Add Code

Harvesting Event Schemas from Large Language Models

1 code implementation • 12 May 2023 • Jialong Tang, Hongyu Lin, Zhuoqun Li, Yaojie Lu, Xianpei Han, Le Sun

Event schema provides a conceptual, structural and formal language to represent events and model the world event knowledge.

Paper
Code

Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns

no code implementations • 8 May 2023 • Ning Bian, Hongyu Lin, Peilin Liu, Yaojie Lu, Chunkang Zhang, Ben He, Xianpei Han, Le Sun

LLMs, as AI agents, can observe external information, which shapes their cognition and behaviors.

Paper
Add Code

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

no code implementations • 29 Mar 2023 • Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang, Bin Dong

(4) Can ChatGPT effectively leverage commonsense for answering questions?

Instruction Following

Paper
Add Code

The Life Cycle of Knowledge in Big Language Models: A Survey

1 code implementation • 14 Mar 2023 • Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun

Knowledge plays a critical role in artificial intelligence.

Paper
Code

Universal Information Extraction as Unified Semantic Matching

no code implementations • 9 Jan 2023 • Jie Lou, Yaojie Lu, Dai Dai, Wei Jia, Hongyu Lin, Xianpei Han, Le Sun, Hua Wu

Based on this paradigm, we propose to universally model various IE tasks with Unified Semantic Matching (USM) framework, which introduces three unified token linking operations to model the abilities of structuring and conceptualizing.

Paper
Add Code

Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction

no code implementations • 12 May 2022 • Tianshu Wang, Hongyu Lin, Cheng Fu, Xianpei Han, Le Sun, Feiyu Xiong, Hui Chen, Minlong Lu, Xiuwen Zhu

Experimental results demonstrate that the assumptions made in the previous benchmark construction process are not coincidental with the open environment, which conceal the main challenges of the task and therefore significantly overestimate the current progress of entity matching.

Entity Resolution

Paper
Add Code

Unified Structure Generation for Universal Information Extraction

1 code implementation • ACL 2022 • Yaojie Lu, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun, Hua Wu

Information extraction suffers from its varying targets, heterogeneous structures, and demand-specific schemas.

Ranked #4 on Aspect-Based Sentiment Analysis (ABSA) on ASTE (using extra training data)

Aspect-Based Sentiment Analysis (ABSA) UIE

834

Paper
Code

Pre-training to Match for Unified Low-shot Relation Extraction

1 code implementation • ACL 2022 • Fangchao Liu, Hongyu Lin, Xianpei Han, Boxi Cao, Le Sun

Low-shot relation extraction~(RE) aims to recognize novel relations with very few or even no samples, which is critical in real scenario application.

Meta-Learning Relation +1

Paper
Code

ECO v1: Towards Event-Centric Opinion Mining

no code implementations • Findings (ACL) 2022 • Ruoxi Xu, Hongyu Lin, Meng Liao, Xianpei Han, Jin Xu, Wei Tan, Yingfei Sun, Le Sun

Events are considered as the fundamental building blocks of the world.

Decision Making Opinion Mining

Paper
Add Code

Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View

1 code implementation • ACL 2022 • Boxi Cao, Hongyu Lin, Xianpei Han, Fangchao Liu, Le Sun

Prompt-based probing has been widely used in evaluating the abilities of pretrained language models (PLMs).

Paper
Code

Few-shot Named Entity Recognition with Self-describing Networks

1 code implementation • ACL 2022 • Jiawei Chen, Qing Liu, Hongyu Lin, Xianpei Han, Le Sun

In this paper, we propose a self-describing mechanism for few-shot NER, which can effectively leverage illustrative instances and precisely transfer knowledge from external resources by describing both entity types and mentions using a universal concept set.

Few-shot NER Named Entity Recognition

Paper
Code

Procedural Text Understanding via Scene-Wise Evolution

no code implementations • 15 Mar 2022 • Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun, Weijian Xie, Jin Xu

In this paper, we propose a new \textbf{scene-wise} paradigm for procedural text understanding, which jointly tracks states of all entities in a scene-by-scene manner.

Procedural Text Understanding

Paper
Add Code

Fine-grained Entity Typing via Label Reasoning

no code implementations • EMNLP 2021 • Qing Liu, Hongyu Lin, Xinyan Xiao, Xianpei Han, Le Sun, Hua Wu

Conventional entity typing approaches are based on independent classification paradigms, which make them difficult to recognize inter-dependent, long-tailed and fine-grained entity types.

Ranked #8 on Entity Typing on Open Entity

Attribute Entity Typing

Paper
Add Code

Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention

1 code implementation • EMNLP 2021 • Jiawei Chen, Hongyu Lin, Xianpei Han, Le Sun

In this paper, we identify and solve the trigger curse problem in few-shot event detection (FSED) from a causal view.

Event Detection

Paper
Code

Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision

no code implementations • 19 Jul 2021 • Ning Bian, Xianpei Han, Bo Chen, Hongyu Lin, Ben He, Le Sun

In this paper, we propose a new framework for unsupervised MRC.

Language Modelling Machine Reading Comprehension +4

Paper
Add Code

Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases

1 code implementation • ACL 2021 • Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun, Lingyong Yan, Meng Liao, Tong Xue, Jin Xu

Previous literatures show that pre-trained masked language models (MLMs) such as BERT can achieve competitive factual knowledge extraction performance on some datasets, indicating that MLMs can potentially be a reliable knowledge source.

Paper
Code

Element Intervention for Open Relation Extraction

no code implementations • ACL 2021 • Fangchao Liu, Lingyong Yan, Hongyu Lin, Xianpei Han, Le Sun

Open relation extraction aims to cluster relation instances referring to the same underlying relation, which is a critical step for general relation extraction.

Relation Relation Extraction

Paper
Add Code

Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model

1 code implementation • 17 Jun 2021 • Wenkai Zhang, Hongyu Lin, Xianpei Han, Le Sun, Huidan Liu, Zhicheng Wei, Nicholas Jing Yuan

Specifically, during neural network training, we naturally model the noise samples in each batch following a hypergeometric distribution parameterized by the noise-rate.

Denoising named-entity-recognition +2

Paper
Code

De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention

1 code implementation • ACL 2021 • Wenkai Zhang, Hongyu Lin, Xianpei Han, Le Sun

Distant supervision tackles the data bottleneck in NER by automatically generating training instances via dictionary matching.

named-entity-recognition Named Entity Recognition +1

Paper
Code

Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction

1 code implementation • ACL 2021 • Yaojie Lu, Hongyu Lin, Jin Xu, Xianpei Han, Jialong Tang, Annan Li, Le Sun, Meng Liao, Shaoyi Chen

Event extraction is challenging due to the complex structure of event records and the semantic gap between text and event.

Ranked #3 on Event Extraction on ACE2005

Event Extraction Transfer Learning

199

Paper
Code

From Discourse to Narrative: Knowledge Projection for Event Relation Extraction

1 code implementation • ACL 2021 • Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun, Weijian Xie, Jin Xu

Current event-centric knowledge graphs highly rely on explicit connectives to mine relations between events.

Event Relation Extraction Knowledge Graphs +2

Paper
Code

Syntactic and Semantic-driven Learning for Open Information Extraction

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Jialong Tang, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Xinyan Xiao, Hua Wu

One of the biggest bottlenecks in building accurate, high coverage neural open IE systems is the need for large labelled corpora.

Open Information Extraction

Paper
Code

ISCAS at SemEval-2020 Task 5: Pre-trained Transformers for Counterfactual Statement Modeling

1 code implementation • SEMEVAL 2020 • Yaojie Lu, Annan Li, Hongyu Lin, Xianpei Han, Le Sun

ISCAS participated in two subtasks of SemEval 2020 Task 5: detecting counterfactual statements and detecting antecedent and consequence.

counterfactual Question Answering

Paper
Code

End-to-End Neural Event Coreference Resolution

1 code implementation • 17 Sep 2020 • Yaojie Lu, Hongyu Lin, Jialong Tang, Xianpei Han, Le Sun

Traditional event coreference systems usually rely on pipeline framework and hand-crafted features, which often face error propagation problem and have poor generalization ability.

coreference-resolution Event Coreference Resolution +1

Paper
Code

A Rigorous Study on Named Entity Recognition: Can Fine-tuning Pretrained Model Lead to the Promised Land?

no code implementations • EMNLP 2020 • Hongyu Lin, Yaojie Lu, Jialong Tang, Xianpei Han, Le Sun, Zhicheng Wei, Nicholas Jing Yuan

Specifically, we erase name regularity, mention coverage and context diversity respectively from the benchmarks, in order to explore their impact on the generalization ability of models.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Gazetteer-Enhanced Attentive Neural Networks for Named Entity Recognition

no code implementations • IJCNLP 2019 • Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun, Bin Dong, Shanshan Jiang

Current region-based NER models only rely on fully-annotated training data to learn effective region encoder, which often face the training data bottleneck.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Distilling Discrimination and Generalization Knowledge for Event Detection via Delta-Representation Learning

1 code implementation • ACL 2019 • Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun

Event detection systems rely on discrimination knowledge to distinguish ambiguous trigger words and generalization knowledge to detect unseen/sparse trigger words.

Event Detection Representation Learning

Paper
Code

Cost-sensitive Regularization for Label Confusion-aware Event Detection

1 code implementation • ACL 2019 • Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

In supervised event detection, most of the mislabeling occurs between a small number of confusing type pairs, including trigger-NIL pairs and sibling sub-types of the same coarse type.

Event Detection Vocal Bursts Type Prediction

Paper
Code

Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks

1 code implementation • ACL 2019 • Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

In this paper, we propose to resolve this problem by modeling and leveraging the head-driven phrase structures of entity mentions, i. e., although a mention can nest other mentions, they will not share the same head word.

Ranked #7 on Nested Mention Recognition on ACE 2005

NER Nested Mention Recognition +1

Paper
Code

Adaptive Scaling for Sparse Detection in Information Extraction

1 code implementation • ACL 2018 • Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

This paper focuses on detection tasks in information extraction, where positive instances are sparsely distributed and models are usually evaluated using F-measure on positive classes.

Paper
Code

Nugget Proposal Networks for Chinese Event Detection

1 code implementation • ACL 2018 • Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

Neural network based models commonly regard event detection as a word-wise classification task, which suffer from the mismatch problem between words and event triggers, especially in languages without natural word delimiters such as Chinese.

Event Detection General Classification

136

Paper
Code

Reasoning with Heterogeneous Knowledge for Commonsense Machine Comprehension

no code implementations • EMNLP 2017 • Hongyu Lin, Le Sun, Xianpei Han

Then we propose a multi-knowledge reasoning model, which selects inference rules for a specific reasoning context using attention mechanism, and reasons by summarizing all valid inference rules.

Natural Language Understanding Reading Comprehension +1

Paper
Add Code

A Context-Aware Topic Model for Statistical Machine Translation

no code implementations • IJCNLP 2015 • Jinsong Su, Deyi Xiong, Yang Liu, Xianpei Han, Hongyu Lin, Junfeng Yao, Min Zhang

Machine Translation Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.