1 code implementation • 8 Jan 2023 • Shuailei Ma, Yuefeng Wang, Shanze Wang, Ying WEI
HSAM and TAM semantically align and merge the extracted features and query embeddings in the hierarchical spatial and task perspectives in turn.
Ranked #6 on Human-Object Interaction Detection on HICO-DET