no code implementations • 15 Mar 2024 • Jinxia Xie, Bineng Zhong, Zhiyi Mo, Shengping Zhang, Liangtao Shi, Shuxiang Song, Rongrong Ji
Firstly, we introduce a set of learnable and autoregressive queries to capture the instantaneous target appearance changes in a sliding window fashion.
1 code implementation • 6 Jan 2024 • Liangtao Shi, Bineng Zhong, Qihua Liang, Ning li, Shengping Zhang, Xianxian Li
Specifically, we utilize spatio-temporal tokens to propagate information between consecutive frames without focusing on updating templates.