ADAPT at SemEval-2018 Task 9: Skip-Gram Word Embeddings for Unsupervised Hypernym Discovery in Specialised Corpora

This paper describes a simple but competitive unsupervised system for hypernym discovery. The system uses skip-gram word embeddings with negative sampling, trained on specialised corpora. Candidate hypernyms for an input word are predicted based based on cosine similarity scores. Two sets of word embedding models were trained separately on two specialised corpora: a medical corpus and a music industry corpus. Our system scored highest in the medical domain among the competing unsupervised systems but performed poorly on the music industry domain. Our system does not depend on any external data other than raw specialised corpora.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Hypernym Discovery Medical domain ADAPT MAP 8.13 # 7
MRR 20.56 # 7
P@5 8.32 # 7
Hypernym Discovery Music domain ADAPT MAP 2.63 # 6
MRR 7.46 # 6
P@5 2.64 # 6

Methods


No methods listed for this paper. Add relevant methods here