Search Results for author: Haoxuan Qu

Found 10 papers, 1 papers with code

LLMs are Good Action Recognizers

no code implementations31 Mar 2024 Haoxuan Qu, Yujun Cai, Jun Liu

Motivated by this, we propose a novel LLM-AR framework, in which we investigate treating the Large Language Model as an Action Recognizer.

Action Recognition Language Modelling +3

GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner

no code implementations22 Mar 2024 Haoxuan Qu, Ziyan Guo, Jun Liu

Recently, while text-driven human motion generation has received massive research attention, most existing text-driven motion generators are generally only designed to generate motion sequences in a blank background.

Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning

no code implementations15 Mar 2024 Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu

Human-centered dynamic scene understanding plays a pivotal role in enhancing the capability of robotic and autonomous systems, in which Video-based Human-Object Interaction (V-HOI) detection is a crucial task in semantic scene understanding, aimed at comprehensively understanding HOI relationships within a video to benefit the behavioral decisions of mobile robots and autonomous driving systems.

Autonomous Driving Human-Object Interaction Detection +2

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

no code implementations29 Dec 2023 Li Xu, Haoxuan Qu, Yujun Cai, Jun Liu

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds.

6D Pose Estimation using RGB Denoising +1

A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

no code implementations CVPR 2023 Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu

Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap.

Pose Estimation

Improving the Reliability for Confidence Estimation

no code implementations13 Oct 2022 Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu

Confidence estimation, a task that aims to evaluate the trustworthiness of the model's prediction output during deployment, has received lots of research attention recently, due to its importance for the safe deployment of deep models.

Image Classification Meta-Learning +1

Heatmap Distribution Matching for Human Pose Estimation

no code implementations3 Oct 2022 Haoxuan Qu, Li Xu, Yujun Cai, Lin Geng Foo, Jun Liu

In this paper, we show that optimizing the heatmap prediction in such a way, the model performance of body joint localization, which is the intrinsic objective of this task, may not be consistently improved during the optimization process of the heatmap prediction.

2D Human Pose Estimation Pose Estimation

Meta Spatio-Temporal Debiasing for Video Scene Graph Generation

no code implementations23 Jul 2022 Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu

Video scene graph generation (VidSGG) aims to parse the video content into scene graphs, which involves modeling the spatio-temporal contextual information in the video.

Graph Generation Meta-Learning +2

Recent Advances of Continual Learning in Computer Vision: An Overview

no code implementations23 Sep 2021 Haoxuan Qu, Hossein Rahmani, Li Xu, Bryan Williams, Jun Liu

In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order.

Continual Learning Knowledge Distillation

Cannot find the paper you are looking for? You can Submit a new open access paper.