Entity Resolution

49 papers with code • 10 benchmarks • 11 datasets

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Surveys on entity resolution:

The task of entity resolution is closely related to the task of entity alignment which focuses on matching entities between knowledge bases. The task of entity linking differs from entity resolution as entity linking focuses on identifying entity mentions in free text.

Benchmarks

Add a Result

These leaderboards are used to track progress in Entity Resolution

Dataset	Best Model	Compare
Amazon-Google	gpt4-0613_fewshot-10	See all
Abt-Buy	gpt4-0613_zeroshot	See all
WDC Computers-small	BERT	See all
WDC Computers-xlarge	RoBERTa-SupCon	See all
WDC Products-80%cc-seen-medium	gpt4-0613_zeroshot	See all
WDC Watches-small	HG	See all
WDC Products-50%cc-unseen-medium	RoBERTa-base	See all
WDC Watches-xlarge	JointBERT	See all
MusicBrainz20K	ALMSER-GB	See all
WDC Products-80%cc-seen-medium-multi	RoBERTa-SupCon	See all

Libraries

Use these libraries to find Entity Resolution models and implementations

megagonlabs/rotom

2 papers

Datasets

Subtasks

Blocking

Latest papers

Most implemented Social Latest No code

How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name Disambiguation

faceonlive/ai-research • 8 Apr 2024

These benchmark data sets can then be used for model training and a variety of evaluation tasks.

131

08 Apr 2024

Paper
Code

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration

fmh1art/batcher • • 7 Dec 2023

However, existing ICL approaches to ER typically necessitate providing a task description and a set of demonstrations for each entity pair and thus have limitations on the monetary cost of interfacing LLMs.

07 Dec 2023

Paper
Code

Entity Matching using Large Language Models

wbsg-uni-mannheim/matchgpt • 17 Oct 2023

We show that for use cases that do not allow data to be shared with third parties, open-source LLMs can be a viable alternative to hosted LLMs given that a small amount of training data or matching knowledge...

17 Oct 2023

Paper
Code

A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms

gpapadis/dlmatchers • • 3 Jul 2023

Entity resolution (ER) is the process of identifying records that refer to the same entities within one or across multiple databases.

03 Jul 2023

Paper
Code

Using ChatGPT for Entity Matching

wbsg-uni-mannheim/matchgpt • 5 May 2023

Always using the same set of 10 handpicked demonstrations leads to an improvement of 4. 92% over the zero-shot performance.

05 May 2023

Paper
Code

Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration

ruc-datalab/Unicorn • • SIGMOD/PODS 2023

The widely used practice is to build task-specific or even dataset-specific solutions, which are hard to generalize and disable the opportunities of knowledge sharing that can be learned from different datasets and multiple tasks.

01 May 2023

Paper
Code

Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]

alexZeakis/Embeddings4ER • • 24 Apr 2023

This is applied to both main steps of ER, i. e., blocking and matching.

24 Apr 2023

Paper
Code

SC-Block: Supervised Contrastive Blocking within Entity Resolution Pipelines

wbsg-uni-mannheim/sc-block • • 6 Mar 2023

To reduce these runtimes, entity resolution pipelines are constructed of two parts: a blocker that applies a computationally cheap method to select candidate record pairs, and a matcher that afterwards identifies matching pairs from this set using more expensive methods.

06 Mar 2023

Paper
Code

WDC Products: A Multi-Dimensional Entity Matching Benchmark

wbsg-uni-mannheim/wdcproducts • • 23 Jan 2023

It also shows that for entity matching contrastive learning is more training data efficient compared to cross-encoders.

23 Jan 2023

Paper
Code

PIZZA: A new benchmark for complex end-to-end task-oriented parsing

amazon-science/pizza-semantic-parsing-dataset • 1 Dec 2022

Much recent work in task-oriented parsing has focused on finding a middle ground between flat slots and intents, which are inexpressive but easy to annotate, and powerful representations such as the lambda calculus, which are expressive but costly to annotate.

01 Dec 2022

Paper
Code

Entity Resolution

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result