no code implementations • 13 Mar 2024 • Dingbang Li, Wenzhou Chen, Xin Lin
We evaluate the performance of our method on the Room-to-Room dataset.
Question Answering Vision-Language Navigation