1 code implementation • 25 Jul 2024 • Shuming Liu, Lin Sui, Chen-Lin Zhang, Fangzhou Mu, Chen Zhao, Bernard Ghanem
As a fundamental task in long-form video understanding, temporal action detection (TAD) aims to capture inherent temporal relations in untrimmed videos and identify candidate actions with precise boundaries.
1 code implementation • 5 Jul 2023 • Lin Sui, Fangzhou Mu, Yin Li
This report describes our submission to the Ego4D Moment Queries Challenge 2023.
1 code implementation • 7 Jun 2022 • Lin Sui, Chen-Lin Zhang, Lixin Gu, Feng Han
Some existing methods build one-stage pipelines, But a large performance drop exists with the vanilla one-stage pipeline and extra classification modules are needed to achieve comparable performance.
no code implementations • CVPR 2022 • Lin Sui, Chen-Lin Zhang, Jianxin Wu
However, the lack of bounding-box supervision makes its accuracy much lower than fully supervised object detection (FSOD), and currently modern FSOD techniques cannot be applied to WSOD.