Search Results for author: Quoc-Huy Tran

Found 18 papers, 4 papers with code

Action Segmentation Using 2D Skeleton Heatmaps

no code implementations • 12 Sep 2023 • Syed Waleed Hyder, Muhammad Usama, Anas Zafar, Muhammad Naufil, Fawad Javed Fateh, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran

This paper presents a 2D skeleton-based action segmentation method with applications in fine-grained human activity recognition.

Action Segmentation Human Activity Recognition +1

Paper
Add Code

Learning by Aligning 2D Skeleton Sequences in Time

no code implementations • 31 May 2023 • Quoc-Huy Tran, Muhammad Ahmed, Murad Popattia, M. Hassan Ahmed, Andrey Konin, M. Zeeshan Zia

This paper presents a self-supervised temporal video alignment framework which is useful for several fine-grained human activity understanding applications.

Retrieval Self-Supervised Learning +1

Paper
Add Code

Permutation-Aware Action Segmentation via Unsupervised Frame-to-Segment Alignment

no code implementations • 31 May 2023 • Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, M. Zeeshan Zia

The frame-level prediction module is trained in an unsupervised manner via temporal optimal transport.

Action Segmentation Segmentation

Paper
Add Code

Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

1 code implementation • 21 Jul 2022 • Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen

To the best of our knowledge, our work is the first to explore transductive few-shot video classification.

General Classification Video Classification

Paper
Code

Timestamp-Supervised Action Segmentation with Graph Convolutional Networks

no code implementations • 30 Jun 2022 • Hamza Khan, Sanjay Haresh, Awais Ahmed, Shakeeb Siddiqui, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran

We introduce a novel approach for temporal activity segmentation with timestamp supervision.

Action Segmentation Segmentation

Paper
Add Code

POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

1 code implementation • NeurIPS 2021 • Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

In this work, we propose to use out-of-distribution samples, i. e., unlabeled samples coming from outside the target classes, to improve few-shot learning.

Few-Shot Learning

Paper
Code

POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples NeurIPS 2021

1 code implementation • Advances in Neural Information Processing Systems 2021 • Duong H. Le*, Khoi D. Nguyen*, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

In this work, we propose to use out-of-distribution samples, i. e., unlabeled samples coming from outside the target classes, to improve few-shot learning.

Few-Shot Learning

Paper
Code

Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering

no code implementations • CVPR 2022 • Sateesh Kumar, Sanjay Haresh, Awais Ahmed, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran

The temporal optimal transport module enables our approach to learn effective representations for unsupervised activity segmentation.

Action Segmentation Clustering +3

Paper
Add Code

Learning by Aligning Videos in Time

no code implementations • CVPR 2021 • Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram Najam Syed, Andrey Konin, Muhammad Zeeshan Zia, Quoc-Huy Tran

To overcome this problem, we propose a temporal regularization term (i. e., Contrastive-IDM) which encourages different frames to be mapped to different points in the embedding space.

Representation Learning Retrieval +1

Paper
Add Code

Image Stitching and Rectification for Hand-Held Cameras

no code implementations • ECCV 2020 • Bingbing Zhuang, Quoc-Huy Tran

In this paper, we derive a new differential homography that can account for the scanline-varying camera poses in Rolling Shutter (RS) cameras, and demonstrate its application to carry out RS-aware image stitching and rectification at one stroke.

Image Stitching

Paper
Add Code

Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling

no code implementations • ECCV 2020 • Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker

Monocular visual odometry (VO) suffers severely from error accumulation during frame-to-frame pose estimation.

Monocular Visual Odometry Pose Estimation +2

Paper
Add Code

Pseudo RGB-D for Self-Improving Monocular SLAM and Depth Prediction

1 code implementation • ECCV 2020 • Lokender Tiwari, Pan Ji, Quoc-Huy Tran, Bingbing Zhuang, Saket Anand, Manmohan Chandraker

Classical monocular Simultaneous Localization And Mapping (SLAM) and the recently emerging convolutional neural networks (CNNs) for monocular depth prediction represent two largely disjoint approaches towards building a 3D map of the surrounding environment.

Depth Estimation Depth Prediction +1

Paper
Code

Towards Anomaly Detection in Dashcam Videos

no code implementations • 11 Apr 2020 • Sanjay Haresh, Sateesh Kumar, M. Zeeshan Zia, Quoc-Huy Tran

We apply: (i) one-class classification loss and (ii) reconstruction-based loss, for anomaly detection on RetroTrucks as well as on existing static-camera datasets.

One-Class Classification

Paper
Add Code

Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLAM

no code implementations • 30 Jul 2019 • Bingbing Zhuang, Quoc-Huy Tran, Pan Ji, Gim Hee Lee, Loong Fah Cheong, Manmohan Chandraker

Self-calibration of camera intrinsics and radial distortion has a long history of research in the computer vision community.

Simultaneous Localization and Mapping

Paper
Add Code

Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences

no code implementations • ECCV 2018 • Mohammed E. Fathy, Quoc-Huy Tran, M. Zeeshan Zia, Paul Vernaza, Manmohan Chandraker

Further, we propose to use activation maps at different layers of a CNN, as an effective and principled replacement for the multi-resolution image pyramids often used for matching tasks.

Geometric Matching Metric Learning +1

Paper
Add Code

Deep Supervision with Intermediate Concepts

no code implementations • 8 Jan 2018 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker

In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice.

Image Classification