1 code implementation • 23 Apr 2018 • Akilesh B, Abhishek Sinha, Mausoom Sarkar, Balaji Krishnamurthy
We develop an attention mechanism for multi-modal fusion of visual and textual modalities that allows the agent to learn to complete the task and achieve language grounding.
no code implementations • ICLR 2018 • Abhishek Sinha, Akilesh B, Mausoom Sarkar, Balaji Krishnamurthy
In this work, we focus on the problem of grounding language by training an agent to follow a set of natural language instructions and navigate to a target object in a 2D grid environment.