Search Results for author: SerNam Lim

Found 5 papers, 1 papers with code

Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions

no code implementations11 Mar 2024 Lan Wang, Vishnu Boddeti, SerNam Lim

While existing video editing tasks are limited to changes in attributes, backgrounds, and styles, our method aims to predict open-ended human action changes in video.

counterfactual Video Editing +1

UniMODE: Unified Monocular 3D Object Detection

no code implementations28 Feb 2024 Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao

To address these challenges, we build a detector based on the bird's-eye-view (BEV) detection paradigm, where the explicit feature projection is beneficial to addressing the geometry learning ambiguity when employing multiple scenarios of data to train detectors.

Monocular 3D Object Detection Object +2

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

no code implementations20 Sep 2023 Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, SerNam Lim

While most modern video understanding models operate on short-range clips, real-world videos are often several minutes long with semantically consistent segments of variable length.

Temporal Action Localization Video Classification +1

Robustness and Generalization via Generative Adversarial Training

no code implementations ICCV 2021 Omid Poursaeed, Tianxing Jiang, Harry Yang, Serge Belongie, SerNam Lim

Adversarial training with these examples enable the model to withstand a wide range of attacks by observing a variety of input alterations during training.

object-detection Object Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.