no code implementations • ECCV 2020 • Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li
Depth completion is a widely studied problem of predicting a dense depth map from a sparse set of measurements and a single RGB image.
no code implementations • 2 Mar 2025 • Liao Shen, Tianqi Liu, Huiqiang Sun, Jiaqi Li, Zhiguo Cao, Wei Li, Chen Change Loy
We also introduce a synthetic dataset to assess refocusing capabilities and the model's ability to learn precise lens parameters.
no code implementations • 23 Jan 2025 • Xianrui Luo, Juewen Peng, Zhongang Cai, Lei Yang, Fan Yang, Zhiguo Cao, Guosheng Lin
Existing methods either (1) assume sharp image inputs, failing to address the detail loss introduced by motion blur, or (2) mainly consider blur by camera movements, neglecting the human motion blur which is more common in animatable avatars.
1 code implementation • 26 Sep 2024 • Jiaqi Li, Yiran Wang, Jinghong Zheng, Zihao Huang, Ke Xian, Zhiguo Cao, Jianming Zhang
Analyzing the fundamental reasons for these limitations, we model depth refinement as a noisy Poisson fusion problem with local inconsistency and edge deformation noises.
no code implementations • 15 Sep 2024 • Liao Shen, Tianqi Liu, Huiqiang Sun, Xinyi Ye, Baopu Li, Jianming Zhang, Zhiguo Cao
Due to the large motion, the intermediate semantic information may be absent in input images.
1 code implementation • 20 Aug 2024 • Wenze Liu, Zixuan Ye, Hao Lu, Zhiguo Cao, Xiangyu Yue
Inspired by the nonlocal principle in traditional image matting, we build a directional distance consistency loss (DDC loss) at each pixel neighborhood to constrain the alpha values conditioned on the input image.
no code implementations • 12 Aug 2024 • JunRui Zhang, Jiaqi Li, Yachuan Huang, Yiran Wang, Jinghong Zheng, Liao Shen, Zhiguo Cao
In the field of monocular depth estimation (MDE), many models with excellent zero-shot performance in general scenes emerge recently.
1 code implementation • 3 Aug 2024 • Xingyi Li, Yizheng Wu, Jun Cen, Juewen Peng, Kewei Wang, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin
To this end, a 3D creator interface has been developed to provide users with fine-grained control over the creation process.
1 code implementation • 18 Jul 2024 • Jiahao Cui, Wei Jiang, Zhan Peng, Zhiyu Pan, Zhiguo Cao
Combining the interpolated and given LDR frames, the complete set of exposure information is available at each time stamp.
1 code implementation • 18 Jul 2024 • Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao
The goal of this work is to develop a task-agnostic feature upsampling operator for dense prediction where the operator is required to facilitate not only region-sensitive tasks like semantic segmentation but also detail-sensitive tasks such as image matting.
no code implementations • 8 Jul 2024 • Xianrui Luo, Huiqiang Sun, Juewen Peng, Zhiguo Cao
We introduce layered Depth-of-Field (DoF) volume rendering to model the defocus blur and reconstruct a sharp NeRF supervised by defocused views.
no code implementations • 2 Jul 2024 • Zhiyu Pan, Kewei Wang, Yizheng Wu, Liwen Xiao, Jiahao Cui, Zhicheng Wang, Zhiguo Cao
This idea can be implemented in a pseudo-labeling way: producing pseudo labels for unlabeled data by a teacher model and training a student model with these pseudo labels.
1 code implementation • 2 Jul 2024 • Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu
However, their annotating requirements are impractical for real-world images or videos, posing challenges toward real-world applications on current avatar creation methods.
1 code implementation • 24 Jun 2024 • Yizheng Wu, Zhiyu Pan, Kewei Wang, Xingyi Li, Jiahao Cui, Liwen Xiao, Guosheng Lin, Zhiguo Cao
To leverage unlabeled data, previous semi-supervised 3D instance segmentation approaches have explored self-training frameworks, which rely on high-quality pseudo labels for consistency regularization.
1 code implementation • 20 May 2024 • Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu
We present MVSGaussian, a new generalizable 3D Gaussian representation approach derived from Multi-View Stereo (MVS) that can efficiently reconstruct unseen scenes.
1 code implementation • CVPR 2024 • Tianqi Liu, Xinyi Ye, Min Shi, Zihao Huang, Zhiyu Pan, Zhan Peng, Zhiguo Cao
We incorporate the above ACA, SVA, and CAF into a coarse-to-fine framework, termed Geometry-aware Reconstruction and Fusion-refined Rendering (GeFu).
no code implementations • CVPR 2024 • Zhan Peng, Xinyi Ye, Weiyue Zhao, Tianqi Liu, Huiqiang Sun, Baopu Li, Zhiguo Cao
In this paper, we present RStab, a novel framework for video stabilization that integrates 3D multi-frame fusion through volume rendering.
1 code implementation • CVPR 2024 • He guo, Zixuan Ye, Zhiguo Cao, Hao Lu
We introduce in-context matting, a novel task setting of image matting.
1 code implementation • CVPR 2024 • Kewei Wang, Yizheng Wu, Jun Cen, Zhiyu Pan, Xingyi Li, Zhe Wang, Zhiguo Cao, Guosheng Lin
To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds.
no code implementations • CVPR 2024 • Huiqiang Sun, Xingyi Li, Liao Shen, Xinyi Ye, Ke Xian, Zhiguo Cao
Experimental results on our dataset demonstrate that our method outperforms existing approaches in generating sharp novel views from motion-blurred inputs while maintaining spatial-temporal consistency of the scene.
no code implementations • 15 Mar 2024 • Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou
Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e. g., joint location), and may suffer from local information loss and low generalization ability.
no code implementations • CVPR 2024 • Xingyi Li, Zhiguo Cao, Yizheng Wu, Kewei Wang, Ke Xian, Zhe Wang, Guosheng Lin
To address this limitation, we present S-DyRF, a reference-based spatio-temporal stylization method for dynamic neural radiance fields.
1 code implementation • CVPR 2024 • Zixuan Ye, Wenze Liu, He guo, Yujia Liang, Chaoyi Hong, Hao Lu, Zhiguo Cao
Therefore we wonder whether we can alleviate the limitations of both settings while achieving unification to facilitate more convenient use.
1 code implementation • 13 Dec 2023 • Kewei Wang, Yizheng Wu, Zhiyu Pan, Xingyi Li, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin
To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module.
1 code implementation • 27 Oct 2023 • Yiran Guan, Zhuoguang Chen, Wenzheng Zeng, Zhiguo Cao, Yang Xiao
In this letter, we propose a new method, Multi-Clue Gaze (MCGaze), to facilitate video gaze estimation via capturing spatial-temporal interaction context among head, face, and eye in an end-to-end learning way, which has not been well concerned yet.
Ranked #1 on
Gaze Estimation
on Gaze360
1 code implementation • ICCV 2023 • Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao
This constraint reduces the 2D search space into the epipolar line in stereo matching.
Ranked #4 on
3D Reconstruction
on DTU
1 code implementation • ICCV 2023 • Wenze Liu, Hao Lu, Hongtao Fu, Zhiguo Cao
We present DySample, an ultra-lightweight and effective dynamic upsampler.
1 code implementation • ICCV 2023 • Chengxin Liu, Hao Lu, Zhiguo Cao, Tongliang Liu
Such a querying process yields an intuitive, universal modeling of crowd as both the input and output are interpretable and steerable.
no code implementations • 20 Aug 2023 • Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin
To animate the visual content, the feature point cloud is displaced based on the scene flow derived from motion estimation and the corresponding camera pose.
no code implementations • 4 Aug 2023 • Jiaqi Li, Yiran Wang, Zihao Huang, Jinghong Zheng, Ke Xian, Zhiguo Cao, Jianming Zhang
We leverage the structural characteristics of diffusion model to enforce depth structures of depth models in a plug-and-play manner.
1 code implementation • 3 Aug 2023 • Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao
We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world.
1 code implementation • ICCV 2023 • Weiyue Zhao, Xin Li, Zhan Peng, Xianrui Luo, Xinyi Ye, Hao Lu, Zhiguo Cao
Video stabilization refers to the problem of transforming a shaky video into a visually pleasing one.
1 code implementation • ICCV 2023 • Xinyi Ye, Weiyue Zhao, Tianqi Liu, Zihao Huang, Zhiguo Cao, Xin Li
Learning-based multi-view stereo (MVS) methods deal with predicting accurate depth maps to achieve an accurate and complete 3D representation.
1 code implementation • 17 Jul 2023 • Wenze Liu, Hao Lu, Yuliang Liu, Zhiguo Cao
In DAB-DETR, such queries are modulated by the so-called conditional linear projection at each decoder stage, aiming to search for positions of interest such as the four extremities of the box.
2 code implementations • 17 Jul 2023 • Wenze Liu, Hao Lu, Yuliang Liu, Zhiguo Cao
We introduce the notion of point affiliation into feature upsampling.
2 code implementations • ICCV 2023 • Yiran Wang, Min Shi, Jiaqi Li, Chaoyi Hong, Zihao Huang, Juewen Peng, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin
Our work serves as a solid baseline and data foundation for learning-based video depth estimation.
Ranked #21 on
Monocular Depth Estimation
on NYU-Depth V2
(using extra training data)
no code implementations • 7 Jun 2023 • Xianrui Luo, Juewen Peng, Ke Xian, Zijin Wu, Zhiguo Cao
To this end, we present a Defocus to Focus (D2F) framework to learn realistic bokeh rendering by fusing defocus priors with the all-in-focus image and by implementing radiance priors in layered fusion.
no code implementations • 7 Jun 2023 • Weiyue Zhao, Hao Lu, Xinyi Ye, Zhiguo Cao, Xin Li
We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems.
no code implementations • 5 Jun 2023 • Weiyue Zhao, Hao Lu, Zhiguo Cao, Xin Li
This approach offers a new perspective to alleviate the problem of repeated patterns and emphasizes the importance of choosing coordinate representations for feature correspondences.
1 code implementation • CVPRW 2023 • Marcos V. Conde, Manuel Kolmet, Tim Seizinger, Tom E. Bishop, Radu Timofte, Xiangyu Kong, Dafeng Zhang, Jinlong Wu, Fan Wang, Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Zhiguo Cao, Ke Xian, Chaowei Liu, Zigeng Chen, Xingyi Yang, Songhua Liu, Yongcheng Jing, Michael Bi Mi, Xinchao Wang, Zhihao Yang, Wenyi Lian, Siyuan Lai, Haichuan Zhang, Trung Hoang, Amirsaeed Yazdani, Vishal Monga, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Yuxuan Zhao, Baoliang Chen, Yiqing Xu, JiXiang Niu
We present the new Bokeh Effect Transformation Dataset (BETD), and review the proposed solutions for this novel task at the NTIRE 2023 Bokeh Effect Transformation Challenge.
1 code implementation • 8 May 2023 • Zhicheng Wang, Liwen Xiao, Zhiguo Cao, Hao Lu
This task is typically addressed by extracting the features of query image and exemplars respectively and then matching their feature similarity, leading to an extract-then-match paradigm.
Ranked #4 on
Object Counting
on FSC147
no code implementations • 11 Apr 2023 • Xianrui Luo, Juewen Peng, Weiyue Zhao, Ke Xian, Hao Lu, Zhiguo Cao
Benefiting from the multi-camera module in modern smartphones, we introduce a new task of AIF synthesis from main (wide) and ultra-wide cameras.
1 code implementation • CVPR 2023 • Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou
3D interacting hand pose estimation from a single RGB image is a challenging task, due to serious self-occlusion and inter-occlusion towards hands, confusing similar appearance patterns between 2 hands, ill-posed joint position mapping from 2D to 3D, etc.. To address these, we propose to extend A2J-the state-of-the-art depth-based 3D single hand pose estimation method-to RGB domain under interacting hand condition.
Ranked #7 on
Hand Pose Estimation
on NYU Hands
1 code implementation • CVPR 2023 • Wenzheng Zeng, Yang Xiao, Sicheng Wei, Jinfang Gan, Xintao Zhang, Zhiguo Cao, Zhiwen Fang, Joey Tianyi Zhou
Experiments on MPEblink verify the essential challenges of real-time multi-person eyeblink detection in the wild for untrimmed video.
no code implementations • 28 Mar 2023 • Xinyi Ye, Weiyue Zhao, Hao Lu, Zhiguo Cao
It is challenging because of the disorganized spatial distribution of numerous outliers, especially when putative correspondences are largely dominated by outliers.
no code implementations • CVPR 2023 • Xingyi Li, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin
To animate the scene, we perform motion estimation and lift the 2D motion into the 3D scene flow.
no code implementations • 17 Feb 2023 • Zhiyu Pan, Yinpeng Chen, Jiale Zhang, Hao Lu, Zhiguo Cao, Weicai Zhong
Observing that similar composition patterns tend to be shared by the cropping boundaries annotated nearly, we argue to find the beauty of composition from the rare samples by clustering the samples with similar cropping boundary annotations, ie, similar composition patterns.
1 code implementation • CVPR 2023 • Min Shi, Zihao Huang, Xianzheng Ma, Xiaowei Hu, Zhiguo Cao
To calibrate the inaccurate matching results, we introduce a two-stage framework, where matched keypoints from the first stage are viewed as similarity-aware position proposals.
Ranked #5 on
2D Pose Estimation
on MP-100
1 code implementation • 27 Dec 2022 • Zixuan Ye, Yutong Dai, Chaoyi Hong, Zhiguo Cao, Hao Lu
Inspired by this, we introduce a novel composition style that binds the source and combined foregrounds in a definite triplet.
no code implementations • 7 Nov 2022 • Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Jiaqi Li, Yiran Wang, Zihao Huang, Zhiguo Cao, Marcos V. Conde, Denis Sapozhnikov, Byeong Hyun Lee, Dongwon Park, Seongmin Hong, Joonhee Lee, Seunggyu Lee, Se Young Chun
Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks.
1 code implementation • 29 Sep 2022 • Xingyi Li, Chaoyi Hong, Yiran Wang, Zhiguo Cao, Ke Xian, Guosheng Lin
We study the problem of novel view synthesis of objects from a single image.
2 code implementations • 26 Sep 2022 • Hao Lu, Wenze Liu, Zixuan Ye, Hongtao Fu, Yuliang Liu, Zhiguo Cao
We introduce point affiliation into feature upsampling, a notion that describes the affiliation of each upsampled point to a semantic cluster formed by local decoder feature points with semantic similarity.
Ranked #5 on
Feature Upsampling
on ImageNet
1 code implementation • 1 Aug 2022 • Zijin Wu, Xingyi Li, Juewen Peng, Hao Lu, Zhiguo Cao, Weicai Zhong
To mitigate this issue, we introduce DoF-NeRF, a novel neural rendering approach that can deal with shallow DoF inputs and can simulate DoF effect.
1 code implementation • 31 Jul 2022 • Yiran Wang, Zhiyu Pan, Xingyi Li, Zhiguo Cao, Ke Xian, Jianming Zhang
Temporal consistency is the key challenge of video depth estimation.
1 code implementation • 31 Jul 2022 • Yinpeng Chen, Zhiyu Pan, Min Shi, Hao Lu, Zhiguo Cao, Weicai Zhong
Generative adversarial networks (GANs) have been trained to be professional artists able to create stunning artworks such as face generation and image style transfer.
1 code implementation • 21 Jul 2022 • Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao
We consider the problem of task-agnostic feature upsampling in dense prediction where an upsampling operator is required to facilitate both region-sensitive tasks like semantic segmentation and detail-sensitive tasks such as image matting.
1 code implementation • 20 Jul 2022 • Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, Ziming Zhang
As the crowd-sourcing labeling process and the ambiguities of the objects may raise noisy bounding box annotations, the object detectors will suffer from the degenerated training data.
1 code implementation • 18 Jul 2022 • Juewen Peng, Jianming Zhang, Xianrui Luo, Hao Lu, Ke Xian, Zhiguo Cao
Partial occlusion effects are a phenomenon that blurry objects near a camera are semi-transparent, resulting in partial appearance of occluded background.
1 code implementation • 15 Jul 2022 • Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong
The idea of instance kernel is inspired by recent success of dynamic convolutions in 2D/3D instance segmentation.
Ranked #2 on
3D Instance Segmentation
on S3DIS
(mCov metric)
1 code implementation • CVPR 2022 • Juewen Peng, Zhiguo Cao, Xianrui Luo, Hao Lu, Ke Xian, Jianming Zhang
Based on this formulation, we implement the classical renderer by a scattering-based method and propose a two-stage neural renderer to fix the erroneous areas from the classical renderer.
1 code implementation • IEEE Transactions on Geoscience and Remote Sensing 2022 • Kewei Wang, Shuaiyuan Du, Chengxin Liu, Zhiguo Cao
Motivated by the fact that pixels from targets or backgrounds are correlated to each other, we propose a coarse-to-fine interior attention-aware network (IAANet) for infrared small target detection.
1 code implementation • CVPR 2022 • Min Shi, Hao Lu, Chen Feng, Chengxin Liu, Zhiguo Cao
In this work, we propose a similarity-aware CAC framework that jointly learns representation and similarity metric.
Ranked #4 on
Object Counting
on CARPK
1 code implementation • CVPR 2021 • Chaoyi Hong, Shuaiyuan Du, Ke Xian, Hao Lu, Zhiguo Cao, Weicai Zhong
To this end, we introduce the concept of the key composition map (KCM) to encode the composition rules.
Ranked #1 on
Image Cropping
on FLMS
no code implementations • 17 May 2021 • Andrey Ignatov, Grigory Malivenko, David Plowman, Samarth Shukla, Radu Timofte, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Yiran Wang, Xingyi Li, Min Shi, Ke Xian, Zhiguo Cao, Jin-Hua Du, Pei-Lin Wu, Chao Ge, Jiaoyang Yao, Fangwen Tu, Bo Li, Jung Eun Yoo, Kwanggyoon Seo, Jialei Xu, Zhenyu Li, Xianming Liu, Junjun Jiang, Wei-Chi Chen, Shayan Joya, Huanhuan Fan, Zhaobing Kang, Ang Li, Tianpeng Feng, Yang Liu, Chuannan Sheng, Jian Yin, Fausto T. Benavide
While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference.
no code implementations • ICCV 2021 • Zhiyu Pan, Zhiguo Cao, Kewei Wang, Hao Lu, Weicai Zhong
We show that relation modeling between visual elements matters in cropping view recommendation.
no code implementations • 10 Nov 2020 • Jiaqi Yang, Zhiqiang Huang, Siwen Quan, Qian Zhang, Yanning Zhang, Zhiguo Cao
This paper focuses on developing efficient and robust evaluation metrics for RANSAC hypotheses to achieve accurate 3D rigid registration.
no code implementations • 10 Nov 2020 • Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zijin Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan, Saagara M B, Minnu A L, Sanjana A R, Praseeda S, Ge Wu, Xueqin Chen, Tengyao Wang, Max Zheng, Hulk Wong, Jay Zou
This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results.
no code implementations • 21 Jul 2020 • Jiaqi Yang, Jiahao Chen, Zhiqiang Huang, Siwen Quan, Yanning Zhang, Zhiguo Cao
We present a simple yet effective method for 3D correspondence grouping.
1 code implementation • ECCV 2020 • Liang Liu, Hao Lu, Hongwei Zou, Haipeng Xiong, Zhiguo Cao, Chunhua Shen
Inspired by scale weighing, we propose a novel 'counting scale' termed LibraNet where the count value is analogized by weight.
1 code implementation • 11 Jul 2020 • Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu
Embedding RMML into the proposed ECML mechanism, our metric learning paradigm (EC-RMML) can run in the one-pass learning manner.
2 code implementations • CVPR 2020 • Haozhe Qi, Chen Feng, Zhiguo Cao, Feng Zhao, Yang Xiao
Specifically, we first sample seeds from the point clouds in template and search area respectively.
1 code implementation • CVPR 2020 • Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan
Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly.
no code implementations • ECCV 2020 • Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yun-hui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim
To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set.
no code implementations • 22 Jan 2020 • Angfan Zhu, Jiaqi Yang, Weiyue Zhao, Zhiguo Cao
The local reference frame (LRF) acts as a critical role in 3D local shape description and matching.
3 code implementations • 7 Jan 2020 • Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Chunhua Shen, Zhiguo Cao
Visual counting, a task that aims to estimate the number of objects from an image/video, is an open-set problem by nature, i. e., the number of population can vary in [0, inf) in theory.
1 code implementation • 1 Nov 2019 • Chen Zhao, Jiaqi Yang, Xin Xiong, Angfan Zhu, Zhiguo Cao, Xin Li
To the best of our knowledge, this work is the first principled approach toward adaptively combining global and local information under the context of RI point cloud analysis.
no code implementations • 3 Sep 2019 • Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li
Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.
2 code implementations • ICCV 2019 • Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan
For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed.
Ranked #1 on
Hand Pose Estimation
on K2HPD
5 code implementations • ICCV 2019 • Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Zhiguo Cao, Chunhua Shen
A dense region can always be divided until sub-region counts are within the previously observed closed set.
Ranked #3 on
Crowd Counting
on TRANCOS
1 code implementation • 30 Apr 2019 • Chen Zhao, Jiaqi Yang, Yang Xiao, Zhiguo Cao
Correspondence selection aiming at seeking correct feature correspondences from raw feature matches is pivotal for a number of feature-matching-based tasks.
no code implementations • 27 Apr 2019 • Jiaqi Yang, Chen Zhao, Ke Xian, Angfan Zhu, Zhiguo Cao
This paper presents a simple yet very effective data-driven approach to fuse both low-level and high-level local geometric features for 3D rigid data matching.
1 code implementation • CVPR 2019 • Chen Zhao, Zhiguo Cao, Chi Li, Xin Li, Jiaqi Yang
Feature correspondence selection is pivotal to many feature-matching based tasks in computer vision.
no code implementations • 21 Feb 2019 • Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan
Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc.
1 code implementation • 29 Jul 2018 • Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou
Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc.
1 code implementation • 11 Jul 2018 • Ruibo Li, Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Lingxiao Hang
However, robust depth prediction suffers from two challenging problems: a) How to extract more discriminative features for different scenes (compared to a single scene)?
1 code implementation • 29 Jun 2018 • Yang Xiao, Jun Chen, Yancheng Wang, Zhiguo Cao, Joey Tianyi Zhou, Xiang Bai
To better exploit three-dimensional (3D) characteristics, multi-view dynamic images are proposed.
no code implementations • 2 Jun 2018 • Yuanzhouhan Cao, Tianqi Zhao, Ke Xian, Chunhua Shen, Zhiguo Cao, Shugong Xu
In this paper, we propose to improve the performance of metric depth estimation with relative depths collected from stereo movie videos using existing stereo matching algorithm.
no code implementations • CVPR 2018 • Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo
In this paper we study the problem of monocular relative depth perception in the wild.
no code implementations • 6 Apr 2018 • Jiaqi Yang, Ke Xian, Yang Xiao, Zhiguo Cao
This paper presents a thorough evaluation of several widely-used 3D correspondence grouping algorithms, motived by their significance in vision tasks relying on correct feature correspondences.
1 code implementation • ICCV 2017 • Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, Ke Xian, Chunhua Shen, Anton Van Den Hengel
Domain adaption (DA) allows machine learning methods trained on data sampled from one distribution to be applied to data sampled from another.
no code implementations • 7 Jul 2017 • Hao Lu, Zhiguo Cao, Yang Xiao, Bohan Zhuang, Chunhua Shen
To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment.