Search Results for author: Lihu Chen

Found 9 papers, 8 papers with code

Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little Cost

1 code implementation ACL 2022 Lihu Chen, Gael Varoquaux, Fabian Suchanek

State-of-the-art NLP systems represent inputs with word embeddings, but these are brittle when faced with Out-of-Vocabulary (OOV) words. To address this issue, we follow the principle of mimick-like models to generate vectors for unseen words, by learning the behavior of pre-trained embeddings using only the surface form of words. We present a simple contrastive learning framework, LOVE, which extends the word representation of an existing pre-trained language model (such as BERT) and makes it robust to OOV with few additional parameters. Extensive evaluations demonstrate that our lightweight model achieves similar or even better performances than prior competitors, both on original datasets and on corrupted variants.

Contrastive Learning Language Modelling +1

Reconfidencing LLMs from the Grouping Loss Perspective

no code implementations7 Feb 2024 Lihu Chen, Alexandre Perez-Lebel, Fabian M. Suchanek, Gaël Varoquaux

In this work, we construct a new evaluation dataset derived from a knowledge base to assess confidence scores given to answers of Mistral and LLaMA.

Uncertainty Quantification

Learning High-Quality and General-Purpose Phrase Representations

1 code implementation18 Jan 2024 Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek

The framework employs phrase type classification as an auxiliary task and incorporates character-level information more effectively into the phrase representation.

Contrastive Learning Data Augmentation +1

The Locality and Symmetry of Positional Encodings

1 code implementation19 Oct 2023 Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek

Positional Encodings (PEs) are used to inject word-order information into transformer-based language models.

Sentence

YAGO 4.5: A Large and Clean Knowledge Base with a Rich Taxonomy

1 code implementation23 Aug 2023 Fabian Suchanek, Mehwish Alam, Thomas Bonald, Lihu Chen, Pierre-Henri Paris, Jules Soria

Knowledge Bases (KBs) find applications in many knowledge-intensive tasks and, most notably, in information retrieval.

Information Retrieval Retrieval

Knowledge Base Completion for Long-Tail Entities

1 code implementation30 Jun 2023 Lihu Chen, Simon Razniewski, Gerhard Weikum

To evaluate our method and various baselines, we introduce a novel dataset, called MALT, rooted in Wikidata.

Knowledge Base Completion Retrieval

GLADIS: A General and Large Acronym Disambiguation Benchmark

1 code implementation3 Feb 2023 Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek

Acronym Disambiguation (AD) is crucial for natural language understanding on various sources, including biomedical reports, scientific papers, and search engine queries.

Language Modelling Natural Language Understanding

Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

1 code implementation15 Mar 2022 Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek

State-of-the-art NLP systems represent inputs with word embeddings, but these are brittle when faced with Out-of-Vocabulary (OOV) words.

Contrastive Learning Language Modelling +1

A Lightweight Neural Model for Biomedical Entity Linking

1 code implementation16 Dec 2020 Lihu Chen, Gaël Varoquaux, Fabian M. Suchanek

Biomedical entity linking aims to map biomedical mentions, such as diseases and drugs, to standard entities in a given knowledge base.

Entity Linking

Cannot find the paper you are looking for? You can Submit a new open access paper.