TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Noun Phrase Canonicalization	Base Dataset	Full ML	Macro Precision	99.4	# 1
Noun Phrase Canonicalization	Base Dataset	Full ML	Micro Precision	100	# 1
Noun Phrase Canonicalization	Base Dataset	Full ML	Pairwise Precision	100	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/canonicalizing-open-knowledge-bases/noun-phrase-canonicalization-on-base-dataset)](https://paperswithcode.com/sota/noun-phrase-canonicalization-on-base-dataset?p=canonicalizing-open-knowledge-bases)`

Canonicalizing Open Knowledge Bases

3 Nov 2014 · Luis Galárraga, Geremy Heitz ·

Open information extraction approaches have led to the creation of large knowledge bases from the Web. The problem with such methods is that their entities and relations are not canonicalized, leading to redundant and ambiguous facts. For example, they may store hBarack Obama, was born in, Honolului and hObama, place of birth, Honolului. In this paper, we present an approach based on machine learning methods that can canonicalize such Open IE triples, by clustering synonymous names and phrases. We also provide a detailed discussion about the different signals, features and design choices that influence the quality of synonym resolution for noun phrases in Open IE KBs, thus shedding light on the middle ground between “open” and “closed” information extraction systems.

PDF