no code implementations • 6 Jan 2024 • Aniello Panariello, Gianluca Mancusi, Fedy Haj Ali, Angelo Porrello, Simone Calderara, Rita Cucchiara
Existing approaches rely on two scales: local information (i. e., the bounding box proportions) or global information, which encodes the semantics of the scene as well as the spatial relations with neighboring objects.