no code implementations • 26 Feb 2023 • Linghao Chen, Yunzhou Song, Hujun Bao, Xiaowei Zhou
We present a novel approach to interactive 3D object perception for robots.
no code implementations • 23 Feb 2023 • Chen Geng, Sida Peng, Zhen Xu, Hujun Bao, Xiaowei Zhou
In this paper, we propose a novel method for learning neural volumetric videos of dynamic humans from sparse view videos in minutes with competitive visual quality.
no code implementations • 14 Feb 2023 • Shangzhan Zhang, Sida Peng, Tianrun Chen, Linzhan Mou, Haotong Lin, Kaicheng Yu, Yiyi Liao, Xiaowei Zhou
We introduce a novel approach that takes a single semantic mask as input to synthesize multi-view consistent color images of natural scenes, trained with a collection of single images from the Internet.
no code implementations • 18 Jan 2023 • Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, Xiaowei Zhou
We propose a new method for object pose estimation without CAD models.
no code implementations • 31 Dec 2022 • Di Huang, Sida Peng, Tong He, Xiaowei Zhou, Wanli Ouyang
We propose a novel approach to self-supervised learning of point cloud representations by differentiable neural rendering.
no code implementations • 30 Nov 2022 • Di Huang, Xiaopeng Ji, Xingyi He, Jiaming Sun, Tong He, Qing Shuai, Wanli Ouyang, Xiaowei Zhou
The key idea is that the hand motion naturally provides multiple views of the object and the motion can be reliably estimated by a hand pose tracker.
no code implementations • SIGGRAPH 2022 • Zhize Zhou, Qing Shuai, Yize Wang, Qi Fang, Xiaopeng Ji, Fashuai Li, Hujun Bao, Xiaowei Zhou
The key challenge of this problem is to efficiently match 2D observations across multiple views.
Ranked #2 on
3D Multi-Person Pose Estimation
on Shelf
3D Multi-Person Pose Estimation
Multi-Person Pose Estimation
1 code implementation • CVPR 2022 • Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang
We present PlanarRecon -- a novel framework for globally coherent detection and reconstruction of 3D planes from a posed monocular video.
1 code implementation • 8 Jun 2022 • Xiaowei Zhou, Ivor W. Tsang, Jie Yin
To achieve a better trade-off between standard accuracy and adversarial robustness, we propose a novel adversarial training framework called LAtent bounDary-guided aDvErsarial tRaining (LADDER) that adversarially trains DNN models on latent boundary-guided adversarial examples.
no code implementations • 25 May 2022 • Jiaming Sun, Xi Chen, Qianqian Wang, Zhengqi Li, Hadar Averbuch-Elor, Xiaowei Zhou, Noah Snavely
We are witnessing an explosion of neural implicit representations in computer vision and graphics.
1 code implementation • CVPR 2022 • Jiaming Sun, ZiHao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao, Guofeng Zhang, Xiaowei Zhou
We propose a new method named OnePose for object pose estimation.
no code implementations • CVPR 2022 • Jian Zhang, Yuanqing Zhang, Huan Fu, Xiaowei Zhou, Bowen Cai, Jinchi Huang, Rongfei Jia, Binqiang Zhao, Xing Tang
Neural Radiance Fields (NeRF) have emerged as a potent paradigm for representing scenes and synthesizing photo-realistic images.
1 code implementation • CVPR 2022 • Haoyu Guo, Sida Peng, Haotong Lin, Qianqian Wang, Guofeng Zhang, Hujun Bao, Xiaowei Zhou
Based on the Manhattan-world assumption, planar constraints are employed to regularize the geometry in floor and wall regions predicted by a 2D semantic segmentation network.
no code implementations • CVPR 2022 • Yuanqing Zhang, Jiaming Sun, Xingyi He, Huan Fu, Rongfei Jia, Xiaowei Zhou
The key insight is that indirect illumination can be conveniently derived from the neural radiance field learned from input images instead of being estimated jointly with direct illumination and materials.
1 code implementation • 12 Apr 2022 • Karl Schmeckpeper, Philip R. Osteen, Yufu Wang, Georgios Pavlakos, Kenneth Chaney, Wyatt Jordan, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Empirically, we show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios even against a cluttered background.
no code implementations • 29 Mar 2022 • Xiao Fu, Shangzhan Zhang, Tianrun Chen, Yichong Lu, Lanyun Zhu, Xiaowei Zhou, Andreas Geiger, Yiyi Liao
In this work, we present a novel 3D-to-2D label transfer method, Panoptic NeRF, which aims for obtaining per-pixel 2D semantic and instance labels from easy-to-obtain coarse 3D bounding primitives.
1 code implementation • CVPR 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou
To enhance the quality of synthesized gestures, we develop a contrastive learning strategy based on audio-text alignment for better audio representations.
Ranked #2 on
Gesture Generation
on TED Gesture Dataset
1 code implementation • 15 Mar 2022 • Sida Peng, Zhen Xu, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Hujun Bao, Xiaowei Zhou
Some recent works have proposed to decompose a non-rigidly deforming scene into a canonical neural radiance field and a set of deformation fields that map observation-space points to the canonical space, thereby enabling them to learn the dynamic scene from images.
1 code implementation • 13 Feb 2022 • Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou
Specifically, we observe that the previous practice of learning only a single audio representation is insufficient due to the additive nature of audio signals.
no code implementations • 2 Dec 2021 • Haotong Lin, Sida Peng, Zhen Xu, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou
We propose a novel scene representation, called ENeRF, for the fast creation of interactive free-viewpoint videos.
no code implementations • 24 Sep 2021 • Xiaowei Zhou, Jie Yin, Ivor W. Tsang
Graph neural networks have emerged as a powerful model for graph representation learning to undertake graph-level prediction tasks.
1 code implementation • CVPR 2022 • YuAn Liu, Sida Peng, Lingjie Liu, Qianqian Wang, Peng Wang, Christian Theobalt, Xiaowei Zhou, Wenping Wang
On such a 3D point, these generalization methods will include inconsistent image features from invisible views, which interfere with the radiance field construction.
1 code implementation • CVPR 2021 • Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li
To address this problem, we propose a novel visual localization framework that establishes 2D-to-3D correspondences between the query image and the 3D map with a series of learnable scene-specific landmarks.
1 code implementation • ICCV 2021 • Sida Peng, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Xiaowei Zhou, Hujun Bao
Moreover, the learned blend weight fields can be combined with input skeletal motions to generate new deformation fields to animate the human model.
1 code implementation • CVPR 2021 • Qi Fang, Qing Shuai, Junting Dong, Hujun Bao, Xiaowei Zhou
In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person's image through a mirror.
3 code implementations • CVPR 2021 • Jiaming Sun, Zehong Shen, Yuang Wang, Hujun Bao, Xiaowei Zhou
We present a novel method for local image feature matching.
Ranked #1 on
Image Matching
on IMC PhotoTourism
(using extra training data)
2 code implementations • CVPR 2021 • Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao
We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video.
no code implementations • 9 Mar 2021 • Yaxin Shi, Xiaowei Zhou, Ping Liu, Ivor Tsang
To benefit the generalization ability of the translation model, we propose transition encoding to facilitate explicit regularization of these two {kinds} of consistencies on unseen transitions.
no code implementations • 5 Mar 2021 • Xiaowei Zhou, Jie Yin, Ivor Tsang, Chen Wang
The widespread use of deep neural networks has achieved substantial success in many tasks.
no code implementations • ICCV 2021 • Jiaming Sun, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang, Hujun Bao, Xiaowei Zhou
In this work, we propose a novel system for integrated 3D object detection and tracking, which uses a dynamic object occupancy map and previous object states as spatial-temporal memory to assist object detection in future frames.
3 code implementations • CVPR 2021 • Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou
To this end, we propose Neural Body, a new human body representation which assumes that the learned neural representations at different frames share the same set of latent codes anchored to a deformable mesh, so that the observations across frames can be naturally integrated.
no code implementations • 14 Dec 2020 • Jiafa He, Chengwei Pan, Can Yang, Ming Zhang, Yang Wang, Xiaowei Zhou, Yizhou Yu
The main idea is to use CNNs to learn local appearances of vessels in image crops while using another point-cloud network to learn the global geometry of vessels in the entire image.
no code implementations • ECCV 2020 • Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, Xiaowei Zhou
Recovering multi-person 3D poses with absolute scales from a single RGB image is a challenging problem due to the inherent depth and scale ambiguity from a single view.
3D Depth Estimation
3D Multi-Person Pose Estimation (absolute)
+3
2 code implementations • ECCV 2020 • Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao
Therefore, we propose to capture human motion by jointly analyzing these Internet videos instead of using single videos separately.
1 code implementation • CVPR 2020 • Wen Jiang, Nikos Kolotouros, Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis
Our goal is to train a single network that learns to avoid these problems and generate a coherent 3D reconstruction of all the humans in the scene.
Ranked #1 on
3D Human Reconstruction
on AGORA
1 code implementation • ECCV 2020 • Qianqian Wang, Xiaowei Zhou, Bharath Hariharan, Noah Snavely
Recent research on learned visual descriptors has shown promising improvements in correspondence estimation, a key component of many 3D vision tasks.
1 code implementation • CVPR 2020 • Jiaming Sun, Linghao Chen, Yiming Xie, Siyu Zhang, Qinhong Jiang, Xiaowei Zhou, Hujun Bao
In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images.
3D Object Detection From Stereo Images
Disparity Estimation
+2
no code implementations • 24 Mar 2020 • Min Wang, Feng Qiu, Wentao Liu, Chen Qian, Xiaowei Zhou, Lizhuang Ma
In this paper, we introduce body part segmentation as critical supervision.
Ranked #66 on
3D Human Pose Estimation
on Human3.6M
(PA-MPJPE metric)
1 code implementation • CVPR 2020 • Sida Peng, Wen Jiang, Huaijin Pi, Xiuli Li, Hujun Bao, Xiaowei Zhou
Based on deep snake, we develop a two-stage pipeline for instance segmentation: initial contour proposal and contour deformation, which can handle errors in object localization.
Ranked #2 on
Semantic Contour Prediction
on Sbd val
1 code implementation • NeurIPS 2019 • Yuan Liu, Zehong Shen, Zhixuan Lin, Sida Peng, Hujun Bao, Xiaowei Zhou
Instead of feature pooling, we use group convolutions to exploit underlying structures of the extracted features on the group, resulting in descriptors that are both discriminative and provably invariant to the group of transformations.
no code implementations • 16 Jul 2019 • Xiaowei Zhou, Ivor W. Tsang, Jie Yin
The proposed LAD method improves the robustness of a DNN model through adversarial training on generated adversarial examples.
1 code implementation • CVPR 2019 • Xiangru Huang, Zhenxiao Liang, Xiaowei Zhou, Yao Xie, Leonidas Guibas, Qi-Xing Huang
Our approach alternates between transformation synchronization using weighted relative transformations and predicting new weights of the input relative transformations using a neural network.
2 code implementations • CVPR 2019 • Junting Dong, Wen Jiang, Qi-Xing Huang, Hujun Bao, Xiaowei Zhou
This paper addresses the problem of 3D pose estimation for multiple people in a few calibrated camera views.
Ranked #11 on
3D Multi-Person Pose Estimation
on Campus
1 code implementation • CVPR 2019 • Zaiwei Zhang, Zhenxiao Liang, Lemeng Wu, Xiaowei Zhou, Qi-Xing Huang
Optimizing a network of maps among a collection of objects/domains (or map synchronization) is a central problem across computer vision and many other relevant fields.
1 code implementation • CVPR 2019 • Zhenpei Yang, Jeffrey Z. Pan, Linjie Luo, Xiaowei Zhou, Kristen Grauman, Qi-Xing Huang
In particular, instead of only performing scene completion from each individual scan, our approach alternates between relative pose estimation and scene completion.
2 code implementations • CVPR 2019 • Sida Peng, Yu-An Liu, Qi-Xing Huang, Hujun Bao, Xiaowei Zhou
We further create a Truncation LINEMOD dataset to validate the robustness of our approach against truncation.
Ranked #2 on
6D Pose Estimation using RGB
on YCB-Video
(Mean AUC metric)
no code implementations • CVPR 2018 • Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis
The proposed approach outperforms previous baselines on this task and offers an attractive solution for direct prediction of 3D shape from a single color image.
Ranked #88 on
3D Human Pose Estimation
on Human3.6M
(PA-MPJPE metric)
1 code implementation • CVPR 2018 • Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis
This information can be acquired by human annotators for a wide range of images and poses.
Ranked #1 on
Monocular 3D Human Pose Estimation
on Human3.6M
(Use Video Sequence metric)
1 code implementation • 17 Apr 2018 • Xiaowei Zhou, Sikang Liu, Georgios Pavlakos, Vijay Kumar, Kostas Daniilidis
Current motion capture (MoCap) systems generally require markers and multiple calibrated cameras, which can be used only in constrained environments.
1 code implementation • CVPR 2018 • Qianqian Wang, Xiaowei Zhou, Kostas Daniilidis
This work proposes a multi-image matching method to estimate semantic correspondences across multiple images.
no code implementations • ICCV 2017 • Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis
We consider the problem of finding consistent matches across multiple images.
1 code implementation • ICLR 2018 • Carlos Esteves, Christine Allen-Blanchette, Xiaowei Zhou, Kostas Daniilidis
The result is a network invariant to translation and equivariant to both rotation and scale.
no code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
In this paper, we present a geometry-driven approach to automatically collect annotations for human pose prediction tasks.
Ranked #26 on
Weakly-supervised 3D Human Pose Estimation
on Human3.6M
1 code implementation • 14 Mar 2017 • Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis
This paper presents a novel approach to estimating the continuous six degree of freedom (6-DoF) pose (3D translation and rotation) of an object from a single RGB image.
Ranked #1 on
Keypoint Detection
on Pascal3D+
1 code implementation • 9 Jan 2017 • Xiaowei Zhou, Menglong Zhu, Georgios Pavlakos, Spyridon Leonardos, Kostantinos G. Derpanis, Kostas Daniilidis
Recovering 3D full-body human pose is a challenging problem with many applications.
2 code implementations • CVPR 2017 • Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
This paper addresses the challenge of 3D human pose estimation from a single color image.
Ranked #15 on
3D Human Pose Estimation
on HumanEva-I
no code implementations • ICCV 2015 • Menglong Zhu, Xiaowei Zhou, Kostas Daniilidis
We introduce a new approach for estimating a fine grained 3D shape and continuous pose of an object from a single image.
1 code implementation • CVPR 2016 • Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Kosta Derpanis, Kostas Daniilidis
Here, two cases are considered: (i) the image locations of the human joints are provided and (ii) the image locations of joints are unknown.
Ranked #31 on
Monocular 3D Human Pose Estimation
on Human3.6M
no code implementations • 14 Sep 2015 • Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Kostas Daniilidis
We investigate the problem of estimating the 3D shape of an object defined by a set of 3D landmarks, given their 2D correspondences in a single image.
Ranked #95 on
3D Human Pose Estimation
on Human3.6M
(PA-MPJPE metric)
no code implementations • ICCV 2015 • Xiaowei Zhou, Menglong Zhu, Kostas Daniilidis
In this paper we propose a global optimization-based approach to jointly matching a set of images.
no code implementations • 1 Feb 2015 • Menglong Zhu, Xiaowei Zhou, Kostas Daniilidis
We introduce a new approach for estimating the 3D pose and the 3D shape of an object from a single image.
no code implementations • CVPR 2015 • Xiaowei Zhou, Spyridon Leonardos, Xiaoyan Hu, Kostas Daniilidis
We investigate the problem of estimating the 3D shape of an object, given a set of 2D landmarks in a single image.
no code implementations • 15 Jan 2014 • Xiaowei Zhou, Can Yang, Hongyu Zhao, Weichuan Yu
In this paper, we review the recent advance of low-rank modeling, the state-of-the-art algorithms, and related applications in image analysis.
no code implementations • CVPR 2013 • Xiaowei Zhou, Xiaojie Huang, James S. Duncan, Weichuan Yu
In this paper, we propose to use the group similarity of object shapes in multiple images as a prior to aid segmentation, which can be interpreted as an unsupervised approach of shape prior modeling.
no code implementations • 5 Sep 2011 • Xiaowei Zhou, Can Yang, Weichuan Yu
To automate the analysis, object detection without a separate training phase becomes a critical task.