Entity Resolution

49 papers with code • 10 benchmarks • 11 datasets

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Surveys on entity resolution:

The task of entity resolution is closely related to the task of entity alignment which focuses on matching entities between knowledge bases. The task of entity linking differs from entity resolution as entity linking focuses on identifying entity mentions in free text.

Libraries

Use these libraries to find Entity Resolution models and implementations

Subtasks


Latest papers with no code

Methods for Matching English Language Addresses

no code yet • 14 Mar 2024

Addresses occupy a niche location within the landscape of textual data, due to the positional importance carried by every word, and the geographical scope it refers to.

Neural Locality Sensitive Hashing for Entity Blocking

no code yet • 31 Jan 2024

We assess the effectiveness of this approach within the context of the entity resolution problem, which frequently involves the use of task-specific metrics in real-world applications.

Spatial Entity Resolution between Restaurant Locations and Transportation Destinations in Southeast Asia

no code yet • 16 Jan 2024

As a tech company, Grab has expanded from transportation to food delivery, aiming to serve Southeast Asia with hyperlocalized applications.

On Leveraging Large Language Models for Enhancing Entity Resolution

no code yet • 7 Jan 2024

Entity resolution, the task of identifying and consolidating records that pertain to the same real-world entity, plays a pivotal role in various sectors such as e-commerce, healthcare, and law enforcement.

Cost-Efficient Prompt Engineering for Unsupervised Entity Resolution

no code yet • 9 Oct 2023

We use an extensive set of experimental results to show that an LLM like GPT3. 5 is viable for high-performing unsupervised ER, and interestingly, that more complicated and detailed (and hence, expensive) prompting methods do not necessarily outperform simpler approaches.

Graph Representation Learning Towards Patents Network Analysis

no code yet • 25 Sep 2023

Patent analysis has recently been recognized as a powerful technique for large companies worldwide to lend them insight into the age of competition among various industries.

Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution

no code yet • 7 Aug 2023

We propose a novel blind annotation protocol based on homomorphic encryption that allows domain oracles to collaboratively label ground truths without sharing data in plaintext with other parties.

Revisiting Prompt Engineering via Declarative Crowdsourcing

no code yet • 7 Aug 2023

Large language models (LLMs) are incredibly powerful at comprehending and generating data in the form of text, but are brittle and error-prone.

Named Entity Resolution in Personal Knowledge Graphs

no code yet • 22 Jul 2023

We begin with a formal definition of the problem, and the components necessary for doing high-quality and efficient ER.

Record Deduplication for Entity Distribution Modeling in ASR Transcripts

no code yet • 9 Jun 2023

Voice digital assistants must keep up with trending search queries.