Semi-supervised Word Sense Disambiguation with Neural Models

Determining the intended sense of words in text - word sense disambiguation (WSD) - is a long standing problem in natural language processing. Recently, researchers have shown promising results using word vectors extracted from a neural network language model as features in WSD algorithms. However, a simple average or concatenation of word vectors for each word in a text loses the sequential and syntactic information of the text. In this paper, we study WSD with a sequence learning neural net, LSTM, to better capture the sequential and syntactic patterns of the text. To alleviate the lack of training data in all-words WSD, we employ the same LSTM in a semi-supervised label propagation classifier. We demonstrate state-of-the-art results, especially on verbs.

PDF Abstract COLING 2016 PDF COLING 2016 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Word Sense Disambiguation SemEval 2007 Task 17 LSTM (T:SemCor) F1 64.2 # 3
Word Sense Disambiguation SemEval 2007 Task 17 LSTMLP (T:SemCor, U:1K) F1 63.5 # 5
Word Sense Disambiguation SemEval 2007 Task 17 LSTMLP (T:SemCor, U:OMSTI) F1 63.7 # 4
Word Sense Disambiguation SemEval 2007 Task 17 LSTMLP (T:OMSTI, U:1K) F1 63.3 # 6
Word Sense Disambiguation SemEval 2007 Task 17 LSTM (T:OMSTI) F1 60.7 # 9
Word Sense Disambiguation SemEval 2007 Task 7 LSTMLP (T:OMSTI, U:1K) F1 83.3 # 6
Word Sense Disambiguation SemEval 2007 Task 7 LSTM (T:SemCor) F1 82.8 # 7
Word Sense Disambiguation SemEval 2007 Task 7 LSTMLP (T:SemCor, U:OMSTI) F1 84.3 # 4
Word Sense Disambiguation SemEval 2007 Task 7 LSTMLP (T:SemCor, U:1K) F1 83.6 # 5
Word Sense Disambiguation SemEval 2007 Task 7 LSTM (T:OMSTI) F1 81.1 # 10
Word Sense Disambiguation SemEval 2013 Task 12 LSTM (T:OMSTI) F1 67.3 # 6
Word Sense Disambiguation SemEval 2013 Task 12 LSTMLP (T:OMSTI, U:1K) F1 68.1 # 4
Word Sense Disambiguation SemEval 2013 Task 12 LSTMLP (T:SemCor, U:1K) F1 69.5 # 3
Word Sense Disambiguation SemEval 2013 Task 12 LSTMLP (T:SemCor, U:OMSTI) F1 67.9 # 5
Word Sense Disambiguation SemEval 2013 Task 12 LSTM (T:SemCor) F1 67.0 # 9
Word Sense Disambiguation SensEval 2 LSTMLP (T:OMSTI, U:1K) F1 74.4 # 3
Word Sense Disambiguation SensEval 2 LSTMLP (T:SemCor, U:OMSTI) F1 73.9 # 4
Word Sense Disambiguation SensEval 2 LSTMLP (T:SemCor, U:1K) F1 73.8 # 5
Word Sense Disambiguation SensEval 2 LSTM (T:SemCor) F1 73.6 # 6
Word Sense Disambiguation SensEval 2 LSTM (T:OMSTI) F1 72.4 # 7
Word Sense Disambiguation SensEval 3 Task 1 LSTMLP (T:SemCor, U:1K) F1 71.8 # 2
Word Sense Disambiguation SensEval 3 Task 1 LSTMLP (T:SemCor, U:OMSTI) F1 71.1 # 3
Word Sense Disambiguation SensEval 3 Task 1 LSTMLP (T:OMSTI, U:1K) F1 71.0 # 4
Word Sense Disambiguation SensEval 3 Task 1 LSTM (T:SemCor) F1 69.2 # 10
Word Sense Disambiguation SensEval 3 Task 1 LSTM (T:OMSTI) F1 64.3 # 11

Methods