Detecting Twenty-thousand Classes using Image-level Supervision

1 code implementation7 Jan 2022 Xingyi Zhou, Rohit Girdhar, Armand Joulin, Phillip Krähenbühl, Ishan Misra

For the first time, we train a detector with all the twenty-one-thousand classes of the ImageNet dataset and show that it generalizes to new datasets without fine-tuning.

Image Classification

Multimodal Virtual Point 3D Detection

1 code implementation NeurIPS 2021 Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl

For autonomous driving, this means that large objects close to the sensors are easily visible, but far-away or small objects comprise only one measurement or two.

3D Object Detection Autonomous Driving

Probabilistic two-stage detection

2 code implementations12 Mar 2021 Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl

We develop a probabilistic interpretation of two-stage object detection.

Object Detection Region Proposal

Learning a unified label space

no code implementations1 Jan 2021 Xingyi Zhou, Vladlen Koltun, Philipp Kraehenbuehl

These labels span many diverse datasets with potentially inconsistent semantic labels.

Instance Segmentation Object Detection +1

Tracking Objects as Points

5 code implementations ECCV 2020 Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl

Nowadays, tracking is dominated by pipelines that perform object detection followed by temporal association, also known as tracking-by-detection.

Multi-Object Tracking Multiple Object Tracking +1

Objects as Points

71 code implementations16 Apr 2019 Xingyi Zhou, Dequan Wang, Philipp Krähenbühl

We model an object as a single point --- the center point of its bounding box.

Keypoint Detection Real-Time Object Detection

StarMap for Category-Agnostic Keypoint and Viewpoint Estimation

1 code implementation ECCV 2018 Xingyi Zhou, Arjun Karpur, Linjie Luo, Qi-Xing Huang

Existing methods define semantic keypoints separately for each category with a fixed number of semantic labels in fixed indices.

Keypoint Detection Viewpoint Estimation

Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency

1 code implementation ECCV 2018 Xingyi Zhou, Arjun Karpur, Chuang Gan, Linjie Luo, Qi-Xing Huang

In this paper, we introduce a novel unsupervised domain adaptation technique for the task of 3D keypoint prediction from a single depth scan or image.

Unsupervised Domain Adaptation

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach

5 code implementations ICCV 2017 Xingyi Zhou, Qi-Xing Huang, Xiao Sun, xiangyang xue, Yichen Wei

We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure.

Monocular 3D Human Pose Estimation Pose Prediction +1

Deep Kinematic Pose Regression

no code implementations17 Sep 2016 Xingyi Zhou, Xiao Sun, Wei zhang, Shuang Liang, Yichen Wei

In this work, we propose to directly embed a kinematic object model into the deep neutral network learning for general articulated object pose estimation.

3D Human Pose Estimation

Model-based Deep Hand Pose Estimation

1 code implementation22 Jun 2016 Xingyi Zhou, Qingfu Wan, Wei zhang, xiangyang xue, Yichen Wei

For the first time, we show that embedding such a non-linear generative process in deep learning is feasible for hand pose estimation.

Hand Pose Estimation

