1 code implementation • 16 Mar 2023 • Tuan N. Tang, Kwonyoung Kim, Kwanghoon Sohn
To this end, we introduce TemporalMaxer, which minimizes long-term temporal context modeling while maximizing information from the extracted video clip features with a basic, parameter-free, and local region operating max-pooling block.
Ranked #1 on
Temporal Action Localization
on MUSES
1 code implementation • 8 Nov 2022 • Tuan N. Tang, Jungin Park, Kwonyoung Kim, Kwanghoon Sohn
In addition, the evaluation for Online Detection of Action Start (ODAS) demonstrates the effectiveness and robustness of our method in the online setting.
4 code implementations • 24 Aug 2021 • Chuong H. Nguyen, Thuy C. Nguyen, Tuan N. Tang, Nam L. H. Phan
Using PAA-ResNet50 as a teacher, our LAD techniques can improve detectors PAA-ResNet101 and PAA-ResNeXt101 to $46 \rm AP$ and $47. 5\rm AP$ on the COCO test-dev set.
no code implementations • 12 Jun 2021 • Thuy C. Nguyen, Tuan N. Tang, Nam LH. Phan, Chuong H. Nguyen, Masayuki Yamazaki, Masao Yamanaka
Video Instance Segmentation (VIS) is a multi-task problem performing detection, segmentation, and tracking simultaneously.
Ranked #11 on
Video Instance Segmentation
on YouTube-VIS validation