A Simple Fully Connected Network for Composing Word Embeddings from Characters

ICLR 2018 · Michael Traynor, Thomas Trappenberg ·

This work introduces a simple network for producing character aware word embeddings. Position agnostic and position aware character embeddings are combined to produce an embedding vector for each word. The learned word representations are shown to be very sparse and facilitate improved results on language modeling tasks, despite using markedly fewer parameters, and without the need to apply dropout. A final experiment suggests that weight sharing contributes to sparsity, increases performance, and prevents overfitting.

PDF Abstract