1 code implementation • 19 Jan 2021 • Homagni Saha, Fateme Fotouhif, Qisai Liu, Soumik Sarkar
In this paper we propose a new framework - MoViLan (Modular Vision and Language) for execution of visually grounded natural language instructions for day to day indoor household tasks.