1 code implementation • 27 Nov 2023 • Bar Genossar, Avigdor Gal, Roee Shraga
Entity matching, a core data integration problem, is the task of deciding whether two data tuples refer to the same real-world entity.
1 code implementation • 19 Oct 2023 • Alexander Brinkmann, Roee Shraga, Christian Bizer
Vendors often times provide unstructured product descriptions consisting only of an offer title and a textual description.
Ranked #1 on Attribute Value Extraction on AE-110k (F1-score metric)
1 code implementation • 7 Aug 2023 • Koyena Pal, Aamod Khatiwada, Roee Shraga, Renée J. Miller
We thoroughly evaluate recent existing table union search methods over existing benchmarks and our new benchmark.
1 code implementation • 23 Jun 2023 • Alexander Brinkmann, Roee Shraga, Reng Chiz Der, Christian Bizer
Hence, extracting attribute/value pairs from textual product descriptions is an essential enabler for e-commerce applications.
1 code implementation • 6 Mar 2023 • Alexander Brinkmann, Roee Shraga, Christian Bizer
To reduce these runtimes, entity resolution pipelines are constructed of two parts: a blocker that applies a computationally cheap method to select candidate record pairs, and a matcher that afterwards identifies matching pairs from this set using more expensive methods.
Ranked #1 on Blocking on Amazon-Google
1 code implementation • 23 Aug 2022 • Bar Genossar, Roee Shraga, Avigdor Gal
In what follows, we introduce the problem of multiple intents entity resolution (MIER), an extension to the universal (single intent) entity resolution task.
no code implementations • 6 May 2022 • Roee Shraga
This work offers a novel view on the use of human input as labels, acknowledging that humans may err.
no code implementations • 27 Apr 2022 • Avigdor Gal, Roee Shraga
Given the availability of data and the improvement of machine learning techniques, this blog discusses the respective roles of humans and machines in achieving cognitive tasks in matching, aiming to determine whether traditional roles of humans and machines are subject to change.
no code implementations • 26 Apr 2022 • Roee Shraga, Gil Katz, Yael Badian, Nitay Calderon, Avigdor Gal
In this paper we propose a design methodology, using active learning to enhance learning capabilities, for building a model of production outcome using a constrained amount of raw material training data.
1 code implementation • 15 Sep 2021 • Roee Shraga, Avigdor Gal
Schema matching is a core task of any data integration process.
no code implementations • 3 Dec 2020 • Diego Calvanese, Avigdor Gal, Davide Lanti, Marco Montali, Alessandro Mosca, Roee Shraga
Virtual Knowledge Graphs (VKG) constitute one of the most promising paradigms for integrating and accessing legacy data sources.
1 code implementation • 2 Dec 2020 • Roee Shraga, Ofra Amir, Avigdor Gal
Matching is a task at the heart of any data integration process, aimed at identifying correspondences among data elements.