Search Results for author: Shuming Liu

Found 10 papers, 5 papers with code

Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

1 code implementation • 8 Jan 2024 • Chen Zhao, Shuming Liu, Karttikeya Mangalam, Guocheng Qian, Fatimah Zohra, Abdulmohsen Alghannam, Jitendra Malik, Bernard Ghanem

We use two coefficients on either type of residual connections respectively, and introduce a dynamic training strategy that seamlessly transitions the pretrained model to a reversible network with much higher numerical precision.

object-detection Small Object Detection +1

Paper
Code

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

2 code implementations • 28 Nov 2023 • Shuming Liu, Chen-Lin Zhang, Chen Zhao, Bernard Ghanem

In this paper, we reduce the memory consumption for end-to-end training, and manage to scale up the TAD backbone to 1 billion parameters and the input video to 1, 536 frames, leading to significant detection performance.

Ranked #1 on Temporal Action Localization on EPIC-KITCHENS-100

Action Detection Temporal Action Localization

Paper
Code

Mindstorms in Natural Language-Based Societies of Mind

no code implementations • 26 May 2023 • Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-Ping Fan, Bernard Ghanem, Jürgen Schmidhuber

What should be the social structure of an NLSOM?

3D Generation Image Captioning +2

Paper
Add Code

Boundary-Denoising for Video Activity Localization

1 code implementation • 6 Apr 2023 • Mengmeng Xu, Mattia Soldan, Jialin Gao, Shuming Liu, Juan-Manuel Pérez-Rúa, Bernard Ghanem

To alleviate the boundary ambiguity, we propose to study the video activity localization problem from a denoising perspective.

Ranked #1 on Video Grounding on MAD

Action Detection Denoising +2

Paper
Code

Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition

no code implementations • 3 Jan 2023 • Hasan Abed Al Kader Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad Albalawi, Bernard Ghanem

Although backdoor attacks have been extensively studied in the image domain, there are very few works that explore such attacks in the video domain, and they tend to conclude that image backdoor attacks are less effective in the video domain.

Action Recognition Temporal Action Localization

Paper
Add Code

Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

no code implementations • CVPR 2023 • Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content.

Temporal Action Localization

Paper
Add Code

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

1 code implementation • 25 Nov 2022 • Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content.

Temporal Action Localization

Paper
Code

ETAD: Training Action Detection End to End on a Laptop

1 code implementation • 14 May 2022 • Shuming Liu, Mengmeng Xu, Chen Zhao, Xu Zhao, Bernard Ghanem

We propose to sequentially forward the snippet frame through the video encoder, and backward only a small necessary portion of gradients to update the encoder.

Action Detection Video Understanding

Paper
Code

Hybrid Attention Networks for Flow and Pressure Forecasting in Water Distribution Systems

no code implementations • 13 Apr 2020 • Ziqing Ma, Shuming Liu, Guancheng Guo, Xipeng Yu

Specifically, a hybrid spatial attention mechanism that employs inputs along temporal and spatial axes is proposed.

Anomaly Detection Decision Making +2

Paper
Add Code

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2

no code implementations • 29 Jul 2019 • Haisheng Su, Xu Zhao, Shuming Liu

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2019 Task 1 (\textbf{temporal action proposal generation}) and Task 2 (\textbf{temporal action localization/detection}).

Re-Ranking Task 2 +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.