Search Results for author: Huabin Liu

Found 8 papers, 3 papers with code

DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement

no code implementations • 3 Apr 2024 • Hao Wu, Huabin Liu, Yu Qiao, Xiao Sun

We present Dive Into the BoundarieS (DIBS), a novel pretraining framework for dense video captioning (DVC), that elaborates on improving the quality of the generated event captions and their associated pseudo event boundaries from unlabeled videos.

Dense Video Captioning

Paper
Add Code

Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis

no code implementations • 18 Dec 2023 • Tianyao He, Huabin Liu, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin

Our framework comprises two core modules: collaborative step mining and frame-to-step alignment.

Action Quality Assessment Procedure Learning

Paper
Add Code

Few-shot Action Recognition via Intra- and Inter-Video Information Maximization

no code implementations • 10 May 2023 • Huabin Liu, Weiyao Lin, Tieyuan Chen, Yuxi Li, Shuyuan Li, John See

The alignment model performs temporal and spatial action alignment sequentially at the feature level, leading to more precise measurements of inter-video similarity.

Few-Shot action recognition Few Shot Action Recognition +2

Paper
Add Code

Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition

1 code implementation • 20 Jul 2022 • Huabin Liu, Weixian Lv, John See, Weiyao Lin

In this paper, we propose a novel video frame sampler for few-shot action recognition to address this issue, where task-specific spatial-temporal frame sampling is achieved via a temporal selector (TS) and a spatial amplifier (SA).

Few-Shot action recognition Few Shot Action Recognition

Paper
Code

Speed Up Object Detection on Gigapixel-Level Images With Patch Arrangement

no code implementations • CVPR 2022 • Jiahao Fan, Huabin Liu, Wenjie Yang, John See, Aixin Zhang, Weiyao Lin

With the appearance of super high-resolution (e. g., gigapixel-level) images, performing efficient object detection on such images becomes an important issue.

Object object-detection +1

Paper
Add Code

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

1 code implementation • ICCV 2021 • Rui Qian, Yuxi Li, Huabin Liu, John See, Shuangrui Ding, Xian Liu, Dian Li, Weiyao Lin

The crux of self-supervised video representation learning is to build general features from unlabeled videos.

Contrastive Learning Representation Learning +1

Paper
Code

TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition

1 code implementation • 10 Jul 2021 • Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin

The first stage locates the action by learning a temporal affine transform, which warps each video feature to its action duration while dismissing the action-irrelevant feature (e. g. background).

Few-Shot action recognition Few Shot Action Recognition +2

Paper
Code

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events

no code implementations • 9 May 2020 • Weiyao Lin, Huabin Liu, Shizhan Liu, Yuxi Li, Rui Qian, Tao Wang, Ning Xu, Hongkai Xiong, Guo-Jun Qi, Nicu Sebe

To this end, we present a new large-scale dataset with comprehensive annotations, named Human-in-Events or HiEve (Human-centric video analysis in complex Events), for the understanding of human motions, poses, and actions in a variety of realistic events, especially in crowd & complex events.

Action Recognition Pose Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.