no code implementations • 14 Mar 2024 • Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Chen, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang
To evaluate the effectiveness of our general framework OneTracker, which is consisted of Foundation Tracker and Prompt Tracker, we conduct extensive experiments on 6 popular tracking tasks across 11 benchmarks and our OneTracker outperforms other models and achieves state-of-the-art performance.
no code implementations • 13 Mar 2024 • Wanyun Li, Pinxue Guo, Xinyu Zhou, Lingyi Hong, Yangji He, Xiangyu Zheng, Wei zhang, Wenqiang Zhang
Contemporary Video Object Segmentation (VOS) approaches typically consist stages of feature extraction, matching, memory management, and multiple objects aggregation.
no code implementations • 10 Mar 2024 • Pinxue Guo, Lingyi Hong, Xinyu Zhou, Shuyong Gao, Wanyun Li, Jinglun Li, Zhaoyu Chen, Xiaoqiang Li, Wei zhang, Wenqiang Zhang
To address these limitations, we propose the setting named Click Video Object Segmentation (ClickVOS) which segments objects of interest across the whole video according to a single click per object in the first frame.
no code implementations • NeurIPS 2023 • Xinyu Zhou, Pinxue Guo, Lingyi Hong, Jinglun Li, Wei zhang, Weifeng Ge, Wenqiang Zhang
Therefore, using all features in the template and memory can lead to redundancy and impair tracking performance.
1 code implementation • 26 May 2023 • Pinxue Guo, Tony Huang, Peiyang He, Xuefeng Liu, Tianjun Xiao, Zhaoyu Chen, Wenqiang Zhang
Open-vocabulary Video Instance Segmentation (OpenVIS) can simultaneously detect, segment, and track arbitrary object categories in a video, without being constrained to categories seen during training.
no code implementations • 21 Mar 2023 • Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Pinxue Guo, Kaixun Jiang, Wenqiang Zhang, Lizhe Qi
Adversarial Robustness Distillation (ARD) is a promising task to solve the issue of limited adversarial robustness of small capacity models while optimizing the expensive computational costs of Adversarial Training (AT).
no code implementations • ICCV 2023 • Jinglun Li, Xinyu Zhou, Pinxue Guo, Yixuan Sun, Yiwen Huang, Weifeng Ge, Wenqiang Zhang
We use one fold as the in-distribution dataset and the others as out-of-distribution datasets to evaluate the proposed method.
2 code implementations • 21 Nov 2022 • Jiafeng Wang, Zhaoyu Chen, Kaixun Jiang, Dingkang Yang, Lingyi Hong, Pinxue Guo, Haijing Guo, Wenqiang Zhang
To tackle these issues, we propose Global Momentum Initialization (GI) to suppress gradient elimination and help search for the global optimum.
1 code implementation • ICCV 2023 • Lingyi Hong, Wenchao Chen, Zhongying Liu, Wei zhang, Pinxue Guo, Zhaoyu Chen, Wenqiang Zhang
The videos in our LVOS last 1. 59 minutes on average, which is 20 times longer than videos in existing VOS datasets.