The Development of Multimodal Lexical Resources

WS 2016 · James Pustejovsky, Tuan Do, Gitit Kehat, Nikhil Krishnaswamy ·

Human communication is a multimodal activity, involving not only speech and written expressions, but intonation, images, gestures, visual clues, and the interpretation of actions through perception. In this paper, we describe the design of a multimodal lexicon that is able to accommodate the diverse modalities that present themselves in NLP applications. We have been developing a multimodal semantic representation, VoxML, that integrates the encoding of semantic, visual, gestural, and action-based features associated with linguistic expressions.

PDF Abstract