Learning to Compute Word Embeddings On the Fly

Words in natural language follow a Zipfian distribution whereby some words are frequent but most are rare. Learning representations for words in the "long tail" of this distribution requires enormous amounts of data... (read more)

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Question Answering SQuAD1.1 OTF dict+spelling (single) EM 64.083 # 158
F1 73.056 # 163
Question Answering SQuAD1.1 OTF spelling (single) EM 62.897 # 160
F1 72.016 # 164
Question Answering SQuAD1.1 OTF spelling+lemma (single) EM 62.604 # 161
F1 71.968 # 165
Question Answering SQuAD1.1 dev OTF dict+spelling (single) EM 63.06 # 38

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet