1 code implementation • 29 Jul 2022 • Jiachang Hao, Haifeng Sun, Pengfei Ren, Jingyu Wang, Qi Qi, Jianxin Liao
Our framework introduces two auxiliary tasks, cross-modal matching and temporal order discrimination, to promote the grounding model training.
1 code implementation • CVPR 2022 • Pengfei Ren, Haifeng Sun, Jiachang Hao, Jingyu Wang, Qi Qi, Jianxin Liao
However, these methods ignore the rich semantic information in each view and ignore the complex dependencies between different regions of different views.