Search Results for author: Shuming Liu

Found 10 papers, 5 papers with code

Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

1 code implementation8 Jan 2024 Chen Zhao, Shuming Liu, Karttikeya Mangalam, Guocheng Qian, Fatimah Zohra, Abdulmohsen Alghannam, Jitendra Malik, Bernard Ghanem

We use two coefficients on either type of residual connections respectively, and introduce a dynamic training strategy that seamlessly transitions the pretrained model to a reversible network with much higher numerical precision.

object-detection Small Object Detection +1

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

2 code implementations28 Nov 2023 Shuming Liu, Chen-Lin Zhang, Chen Zhao, Bernard Ghanem

In this paper, we reduce the memory consumption for end-to-end training, and manage to scale up the TAD backbone to 1 billion parameters and the input video to 1, 536 frames, leading to significant detection performance.

Action Detection Temporal Action Localization

Boundary-Denoising for Video Activity Localization

1 code implementation6 Apr 2023 Mengmeng Xu, Mattia Soldan, Jialin Gao, Shuming Liu, Juan-Manuel Pérez-Rúa, Bernard Ghanem

To alleviate the boundary ambiguity, we propose to study the video activity localization problem from a denoising perspective.

Action Detection Denoising +2

Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition

no code implementations3 Jan 2023 Hasan Abed Al Kader Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad Albalawi, Bernard Ghanem

Although backdoor attacks have been extensively studied in the image domain, there are very few works that explore such attacks in the video domain, and they tend to conclude that image backdoor attacks are less effective in the video domain.

Action Recognition Temporal Action Localization

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

1 code implementation25 Nov 2022 Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content.

Temporal Action Localization

ETAD: Training Action Detection End to End on a Laptop

1 code implementation14 May 2022 Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem

We propose to sequentially forward the snippet frame through the video encoder, and backward only a small necessary portion of gradients to update the encoder.

Action Detection Video Understanding

Hybrid Attention Networks for Flow and Pressure Forecasting in Water Distribution Systems

no code implementations13 Apr 2020 Ziqing Ma, Shuming Liu, Guancheng Guo, Xipeng Yu

Specifically, a hybrid spatial attention mechanism that employs inputs along temporal and spatial axes is proposed.

Anomaly Detection Decision Making +2

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2

no code implementations29 Jul 2019 Haisheng Su, Xu Zhao, Shuming Liu

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2019 Task 1 (\textbf{temporal action proposal generation}) and Task 2 (\textbf{temporal action localization/detection}).

Re-Ranking Task 2 +1

Cannot find the paper you are looking for? You can Submit a new open access paper.