no code implementations • ICCV 2023 • Amit Kumar Rana, Sabarinath Mahadevan, Alexander Hermans, Bastian Leibe
We introduce a more efficient approach, called DynaMITe, in which we represent user interactions as spatio-temporal queries to a Transformer decoder with a potential to segment multiple object instances in a single iteration.