Word Sense Disambiguation: a Unified Evaluation Framework and Empirical Comparison

Introduced by Raganato et al. in Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

The Evaluation framework of Raganato et al. 2017 includes two training sets (SemCor-Miller et al., 1993- and OMSTI-Taghipour and Ng, 2015-) and five test sets from the Senseval/SemEval series (Edmonds and Cotton, 2001; Snyder and Palmer, 2004; Pradhan et al., 2007; Navigli et al., 2013; Moro and Navigli, 2015), standardized to the same format and sense inventory (i.e. WordNet 3.0).

Typically, there are two kinds of approach for WSD: supervised (which make use of sense-annotated training data) and knowledge-based (which make use of the properties of lexical resources).

Supervised: The most widely used training corpus used is SemCor, with 226,036 sense annotations from 352 documents manually annotated. All supervised systems in the evaluation table are trained on SemCor. Some supervised methods, particularly neural architectures, usually employ the SemEval 2007 dataset as development set (marked by *). The most usual baseline is the Most Frequent Sense (MFS) heuristic, which selects for each target word the most frequent sense in the training data.

Knowledge-based: Knowledge-based systems usually exploit WordNet or BabelNet as semantic network. The first sense given by the underlying sense inventory (i.e. WordNet 3.0) is included as a baseline.

Description from NLP Progress

Homepage

Benchmarks

Add a new result Link an existing benchmark

Task	Dataset Variant	Best Model
Word Sense Disambiguation	Supervised:	ConSeC+WNGC
Word Sense Disambiguation	Knowledge-based:	KEF
Quantization	Knowledge-based:	3DCNN_VIVA_5

Papers

Paper	Code	Results	Date	Stars

Word Sense Disambiguation: a Unified Evaluation Framework and Empirical Comparison

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

AM2iCo

XL-WiC

CoarseWSD-20

Senseval-2

Usage

License

Modalities

Languages

Word Sense Disambiguation: a Unified Evaluation Framework and Empirical Comparison

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

AM2iCo

XL-WiC

CoarseWSD-20

Senseval-2

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages