1 code implementation • 27 Mar 2024 • Inhwan Bae, Junoh Lee, Hae-Gon Jeon
Next, to guide the language model in understanding and reasoning high-level knowledge, such as scene context and social relationships between pedestrians, we introduce an auxiliary multi-task question and answering.