1 code implementation • 8 Dec 2022 • Xiangyu Xu, Li Guan, Enrique Dunn, Haoxiang Li, Gang Hua
In this paper, we propose an end-to-end framework that jointly learns keypoint detection, descriptor representation and cross-frame matching for the task of image-based 3D localization.
1 code implementation • 30 Nov 2022 • Haichao Yu, Haoxiang Li, Gang Hua, Gao Huang, Humphrey Shi
To optimize the model, these prediction heads together with the network backbone are trained on every batch of training data.
1 code implementation • 3 Jan 2022 • Siming Yan, Zhenpei Yang, Haoxiang Li, Chen Song, Li Guan, Hao Kang, Gang Hua, QiXing Huang
The most popular and accessible 3D representation, i. e., point clouds, involves discrete samples of the underlying continuous 3D surface.
Ranked #3 on
3D Point Cloud Linear Classification
on ModelNet40
(using extra training data)
3D Point Cloud Classification
3D Point Cloud Linear Classification
+3
no code implementations • 1 Dec 2021 • Yanjie Zhu, Haoxiang Li, Yuanyuan Liu, Muzi Guo, Guanxun Cheng, Gang Yang, Haifeng Wang, Dong Liang
Methods: The proposed framework consists of a reconstruction module and a generative module.
1 code implementation • 26 Nov 2021 • Kumara Kahatapitiya, Zhou Ren, Haoxiang Li, Zhenyu Wu, Michael S. Ryoo, Gang Hua
However, such pretrained models are not ideal for downstream detection, due to the disparity between the pretraining and the downstream fine-tuning tasks.
Ranked #3 on
Action Detection
on Charades
no code implementations • CVPR 2021 • Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua
In this paper, we propose a novel online approach to learning the pose dynamics, which are independent of pose detections in current fame, and hence may serve as a robust estimation even in challenging scenarios including occlusion.
Multi-Person Pose Estimation
Multi-Person Pose Estimation and Tracking
+1
no code implementations • 1 May 2021 • Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos
It is shown that, unlike class-balanced sampling, this is an adversarial augmentation strategy.
no code implementations • ICCV 2021 • Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos
A new learning algorithm is then proposed for GeometrIc Structure Transfer (GIST), with resort to a combination of loss functions that combine class-balanced and random sampling to guarantee that, while overfitting to the popular classes is restricted to geometric parameters, it is leveraged to transfer class geometry from popular to few-shot classes.
no code implementations • 1 May 2021 • Bo Liu, Haoxiang Li, Hao Kang, Nuno Vasconcelos, Gang Hua
A consistency loss has been introduced to limit the impact from unlabeled data while leveraging them to update the feature embedding.
no code implementations • 24 Mar 2021 • Wei Wei, Li Guan, Yue Liu, Hao Kang, Haoxiang Li, Ying Wu, Gang Hua
By the proposed physical regularization, our method can generate HDRs which are not only visually appealing but also physically plausible.
1 code implementation • CVPR 2020 • Bo Liu, Hao Kang, Haoxiang Li, Gang Hua, Nuno Vasconcelos
It is argued that the classic softmax classifier is a poor solution for open-set recognition, since it tends to overfit on the training classes.
2 code implementations • 17 Nov 2019 • Haichao Yu, Haoxiang Li, Honghui Shi, Thomas S. Huang, Gang Hua
When all layers are set to low-bits, we show that the model achieved accuracy comparable to dedicated models trained at the same precision.
no code implementations • 19 Jan 2019 • Yinan Zhang, Devin Balkcom, Haoxiang Li
A weighted average of the supervisor and learned policies is used during trials, with a heavier weight initially on the supervisor, in order to allow safe and useful physical trials while the learned policy is still ineffective.
no code implementations • ECCV 2018 • Rameswar Panda, Jianming Zhang, Haoxiang Li, Joon-Young Lee, Xin Lu, Amit K. Roy-Chowdhury
While machine learning approaches to visual emotion recognition offer great promise, current methods consider training and testing models on small scale datasets covering limited visual emotion concepts.
no code implementations • 30 Jul 2018 • Xin Ye, Zhe Lin, Haoxiang Li, Shibin Zheng, Yezhou Yang
We study the problem of learning a navigation policy for a robot to actively search for an object of interest in an indoor environment solely from its visual inputs.
no code implementations • CVPR 2018 • Muhammad Abdullah Jamal, Haoxiang Li, Boqing Gong
Arguably, no single face detector fits all real-life scenarios.
1 code implementation • ICCV 2019 • Bangjie Yin, Luan Tran, Haoxiang Li, Xiaohui Shen, Xiaoming Liu
Deep CNNs have been pushing the frontier of visual recognition over past years.
1 code implementation • 24 Feb 2018 • Kaichun Mo, Haoxiang Li, Zhe Lin, Joon-Young Lee
Synthetic data suffers from domain gap to the real-world scenes while visual inputs rendered from 3D reconstructed scenes have undesired holes and artifacts.
Robotics
no code implementations • ICCV 2017 • Ronald Yu, Shunsuke Saito, Haoxiang Li, Duygu Ceylan, Hao Li
To train such a network, we generate a massive dataset of synthetic faces with dense labels using renderings of a morphable face model with variations in pose, expressions, lighting, and occlusions.
1 code implementation • ICCV 2017 • Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong
Many seemingly distant annotations (e. g., semantic segmentation and visual question answering (VQA)) are inherently connected in that they reveal different levels and perspectives of human understandings about the same visual scenes --- and even the same set of images (e. g., of COCO).
no code implementations • CVPR 2016 • Haoxiang Li, Jonathan Brandt, Zhe Lin, Xiaohui Shen, Gang Hua
Our new framework enables efficient use of these complementary multi-level contextual cues to improve overall recognition rates on the photo album person recognition task, as demonstrated through state-of-the-art results on a challenging public dataset.
no code implementations • CVPR 2015 • Haoxiang Li, Zhe Lin, Xiaohui Shen, Jonathan Brandt, Gang Hua
To improve localization effectiveness, and reduce the number of candidates at later stages, we introduce a CNN-based calibration stage after each of the detection stages in the cascade.
no code implementations • CVPR 2015 • Haoxiang Li, Gang Hua
We apply the PEP model hierarchically to decompose a face image into face parts at different levels of details to build pose-invariant part-based face representations.
no code implementations • CVPR 2014 • Haoxiang Li, Zhe Lin, Jonathan Brandt, Xiaohui Shen, Gang Hua
Despite the fact that face detection has been studied intensively over the past several decades, the problem is still not completely solved.
no code implementations • CVPR 2013 • Haoxiang Li, Gang Hua, Zhe Lin, Jonathan Brandt, Jianchao Yang
By augmenting each feature with its location, a Gaussian mixture model (GMM) is trained to capture the spatialappearance distribution of all face images in the training corpus.