no code implementations • 3 Apr 2024 • Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, Shinsuke Mori
The key idea of our approach is employing textual instruction, targeting various affordances for a wide range of objects.
Referring Expression Referring Expression Comprehension