Search Results for author: Zhiguo Cao

Found 75 papers, 44 papers with code

Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction

no code implementations • ECCV 2020 • Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li

Depth completion is a widely studied problem of predicting a dense depth map from a sparse set of measurements and a single RGB image.

Depth Completion graph construction

Paper
Add Code

3D Multi-frame Fusion for Video Stabilization

no code implementations • 19 Apr 2024 • Zhan Peng, Xinyi Ye, Weiyue Zhao, Tianqi Liu, Huiqiang Sun, Baopu Li, Zhiguo Cao

In this paper, we present RStab, a novel framework for video stabilization that integrates 3D multi-frame fusion through volume rendering.

Paper
Add Code

In-Context Matting

1 code implementation • 23 Mar 2024 • He guo, Zixuan Ye, Zhiguo Cao, Hao Lu

We introduce in-context matting, a novel task setting of image matting.

Image Matting

Paper
Code

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations

1 code implementation • 20 Mar 2024 • Kewei Wang, Yizheng Wu, Jun Cen, Zhiyu Pan, Xingyi Li, Zhe Wang, Zhiguo Cao, Guosheng Lin

To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds.

Autonomous Driving motion prediction

Paper
Code

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

no code implementations • 15 Mar 2024 • Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou

Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e. g., joint location), and may suffer from local information loss and low generalization ability.

Skeleton Based Action Recognition

Paper
Add Code

DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video

no code implementations • 15 Mar 2024 • Huiqiang Sun, Xingyi Li, Liao Shen, Xinyi Ye, Ke Xian, Zhiguo Cao

Experimental results on our dataset demonstrate that our method outperforms existing approaches in generating sharp novel views from motion-blurred inputs while maintaining spatial-temporal consistency of the scene.

Paper
Add Code

S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes

no code implementations • 10 Mar 2024 • Xingyi Li, Zhiguo Cao, Yizheng Wu, Kewei Wang, Ke Xian, Zhe Wang, Guosheng Lin

To address this limitation, we present S-DyRF, a reference-based spatio-temporal stylization method for dynamic neural radiance fields.

Style Transfer

Paper
Add Code

Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

1 code implementation • 13 Dec 2023 • Kewei Wang, Yizheng Wu, Zhiyu Pan, Xingyi Li, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin

To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module.

Autonomous Driving Data Augmentation +2

Paper
Code

End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

1 code implementation • 27 Oct 2023 • Yiran Guan, Zhuoguang Chen, Wenzheng Zeng, Zhiguo Cao, Yang Xiao

In this letter, we propose a new method, Multi-Clue Gaze (MCGaze), to facilitate video gaze estimation via capturing spatial-temporal interaction context among head, face, and eye in an end-to-end learning way, which has not been well concerned yet.

Ranked #1 on Gaze Estimation on Gaze360

Gaze Estimation

Paper
Code

When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo

1 code implementation • ICCV 2023 • Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao

This constraint reduces the 2D search space into the epipolar line in stereo matching.

Ranked #3 on 3D Reconstruction on DTU

3D Reconstruction Descriptive +2

Paper
Code

Learning to Upsample by Learning to Sample

1 code implementation • ICCV 2023 • Wenze Liu, Hao Lu, Hongtao Fu, Zhiguo Cao

We present DySample, an ultra-lightweight and effective dynamic upsampler.

Instance Segmentation Monocular Depth Estimation +4

Paper
Code

Point-Query Quadtree for Crowd Counting, Localization, and More

1 code implementation • ICCV 2023 • Chengxin Liu, Hao Lu, Zhiguo Cao, Tongliang Liu

Such a querying process yields an intuitive, universal modeling of crowd as both the input and output are interpretable and steerable.

Crowd Counting

Paper
Code

Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image

no code implementations • 20 Aug 2023 • Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin

To animate the visual content, the feature point cloud is displaced based on the scene flow derived from motion estimation and the corresponding camera pose.

Motion Estimation

Paper
Add Code

Diffusion-Augmented Depth Prediction with Sparse Annotations

no code implementations • 4 Aug 2023 • Jiaqi Li, Yiran Wang, Zihao Huang, Jinghong Zheng, Ke Xian, Zhiguo Cao, Jianming Zhang

We leverage the structural characteristics of diffusion model to enforce depth structures of depth models in a plug-and-play manner.

Autonomous Driving Depth Estimation +3

Paper
Add Code

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

1 code implementation • 3 Aug 2023 • Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao

We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world.

Question Answering Retrieval +1

369

Paper
Code

Fast Full-frame Video Stabilization with Iterative Optimization

1 code implementation • ICCV 2023 • Weiyue Zhao, Xin Li, Zhan Peng, Xianrui Luo, Xinyi Ye, Hao Lu, Zhiguo Cao

Video stabilization refers to the problem of transforming a shaky video into a visually pleasing one.

Video Stabilization

Paper
Code

Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells

1 code implementation • ICCV 2023 • Xinyi Ye, Weiyue Zhao, Tianqi Liu, Zihao Huang, Zhiguo Cao, Xin Li

Learning-based multi-view stereo (MVS) methods deal with predicting accurate depth maps to achieve an accurate and complete 3D representation.

Depth Estimation Depth Prediction

Paper
Code

Box-DETR: Understanding and Boxing Conditional Spatial Queries

1 code implementation • 17 Jul 2023 • Wenze Liu, Hao Lu, Yuliang Liu, Zhiguo Cao

In DAB-DETR, such queries are modulated by the so-called conditional linear projection at each decoder stage, aiming to search for positions of interest such as the four extremities of the box.

Paper
Code

Neural Video Depth Stabilizer

3 code implementations • ICCV 2023 • Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

Video depth estimation aims to infer temporally consistent depth.

Ranked #15 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Monocular Depth Estimation

581

Paper
Code

On Point Affiliation in Feature Upsampling

2 code implementations • 17 Jul 2023 • Wenze Liu, Hao Lu, Yuliang Liu, Zhiguo Cao

We introduce the notion of point affiliation into feature upsampling.

Depth Estimation Feature Upsampling +6

Paper
Code

Defocus to focus: Photo-realistic bokeh rendering by fusing defocus and radiance priors

no code implementations • 7 Jun 2023 • Xianrui Luo, Juewen Peng, Ke Xian, Zijin Wu, Zhiguo Cao

To this end, we present a Defocus to Focus (D2F) framework to learn realistic bokeh rendering by fusing defocus priors with the all-in-focus image and by implementing radiance priors in layered fusion.

Hallucination

Paper
Add Code

Learning Probabilistic Coordinate Fields for Robust Correspondences

no code implementations • 7 Jun 2023 • Weiyue Zhao, Hao Lu, Xinyi Ye, Zhiguo Cao, Xin Li

We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems.

Image Registration Pose Estimation

Paper
Add Code

A2B: Anchor to Barycentric Coordinate for Robust Correspondence

no code implementations • 5 Jun 2023 • Weiyue Zhao, Hao Lu, Zhiguo Cao, Xin Li

This approach offers a new perspective to alleviate the problem of repeated patterns and emphasizes the importance of choosing coordinate representations for feature correspondences.

Paper
Add Code

Lens-to-lens bokeh effect transformation. NTIRE 2023 challenge report

1 code implementation • CVPRW 2023 • Marcos V. Conde, Manuel Kolmet, Tim Seizinger, Tom E. Bishop, Radu Timofte, Xiangyu Kong, Dafeng Zhang, Jinlong Wu, Fan Wang, Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Zhiguo Cao, Ke Xian, Chaowei Liu, Zigeng Chen, Xingyi Yang, Songhua Liu, Yongcheng Jing, Michael Bi Mi, Xinchao Wang, Zhihao Yang, Wenyi Lian, Siyuan Lai, Haichuan Zhang, Trung Hoang, Amirsaeed Yazdani, Vishal Monga, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Yuxuan Zhao, Baoliang Chen, Yiqing Xu, JiXiang Niu

We present the new Bokeh Effect Transformation Dataset (BETD), and review the proposed solutions for this novel task at the NTIRE 2023 Bokeh Effect Transformation Challenge.

Bokeh Effect Rendering

Paper
Code

Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting

no code implementations • 8 May 2023 • Zhicheng Wang, Liwen Xiao, Zhiguo Cao, Hao Lu

This task is typically addressed by extracting the features of query image and exemplars respectively and then matching their feature similarity, leading to an extract-then-match paradigm.

Paper
Add Code

Point-and-Shoot All-in-Focus Photo Synthesis from Smartphone Camera Pair

no code implementations • 11 Apr 2023 • Xianrui Luo, Juewen Peng, Weiyue Zhao, Ke Xian, Hao Lu, Zhiguo Cao

Benefiting from the multi-camera module in modern smartphones, we introduce a new task of AIF synthesis from main (wide) and ultra-wide cameras.

Paper
Add Code

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image

1 code implementation • CVPR 2023 • Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou

3D interacting hand pose estimation from a single RGB image is a challenging task, due to serious self-occlusion and inter-occlusion towards hands, confusing similar appearance patterns between 2 hands, ill-posed joint position mapping from 2D to 3D, etc.. To address these, we propose to extend A2J-the state-of-the-art depth-based 3D single hand pose estimation method-to RGB domain under interacting hand condition.

Ranked #7 on Hand Pose Estimation on NYU Hands

3D Interacting Hand Pose Estimation Hand Pose Estimation +1

Paper
Code

Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video

1 code implementation • CVPR 2023 • Wenzheng Zeng, Yang Xiao, Sicheng Wei, Jinfang Gan, Xintao Zhang, Zhiguo Cao, Zhiwen Fang, Joey Tianyi Zhou

Experiments on MPEblink verify the essential challenges of real-time multi-person eyeblink detection in the wild for untrimmed video.

Emotion Recognition Face Anti-Spoofing +1

Paper
Code

Learning Second-Order Attentive Context for Efficient Correspondence Pruning

no code implementations • 28 Mar 2023 • Xinyi Ye, Weiyue Zhao, Hao Lu, Zhiguo Cao

It is challenging because of the disorganized spatial distribution of numerous outliers, especially when putative correspondences are largely dominated by outliers.

Paper
Add Code

3D Cinemagraphy from a Single Image

no code implementations • CVPR 2023 • Xingyi Li, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin

To animate the scene, we perform motion estimation and lift the 2D motion into the 3D scene flow.

Image Animation Motion Estimation

Paper
Add Code

Find Beauty in the Rare: Contrastive Composition Feature Clustering for Nontrivial Cropping Box Regression

no code implementations • 17 Feb 2023 • Zhiyu Pan, Yinpeng Chen, Jiale Zhang, Hao Lu, Zhiguo Cao, Weicai Zhong

Observing that similar composition patterns tend to be shared by the cropping boundaries annotated nearly, we argue to find the beauty of composition from the rare samples by clustering the samples with similar cropping boundary annotations, ie, similar composition patterns.

Clustering Image Cropping +2

Paper
Add Code

Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation

1 code implementation • CVPR 2023 • Min Shi, Zihao Huang, Xianzheng Ma, Xiaowei Hu, Zhiguo Cao

To calibrate the inaccurate matching results, we introduce a two-stage framework, where matched keypoints from the first stage are viewed as similarity-aware position proposals.

Ranked #3 on 2D Pose Estimation on MP-100

Category-Agnostic Pose Estimation Pose Estimation

Paper
Code

Infusing Definiteness into Randomness: Rethinking Composition Styles for Deep Image Matting

1 code implementation • 27 Dec 2022 • Zixuan Ye, Yutong Dai, Chaoyi Hong, Zhiguo Cao, Hao Lu

Inspired by this, we introduce a novel composition style that binds the source and combined foregrounds in a definite triplet.

Image Matting

Paper
Code

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

no code implementations • 7 Nov 2022 • Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Jiaqi Li, Yiran Wang, Zihao Huang, Zhiguo Cao, Marcos V. Conde, Denis Sapozhnikov, Byeong Hyun Lee, Dongwon Park, Seongmin Hong, Joonhee Lee, Seunggyu Lee, Se Young Chun

Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks.

Bokeh Effect Rendering Depth Estimation +3

Paper
Add Code

SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis

1 code implementation • 29 Sep 2022 • Xingyi Li, Chaoyi Hong, Yiran Wang, Zhiguo Cao, Ke Xian, Guosheng Lin

We study the problem of novel view synthesis of objects from a single image.

Novel View Synthesis

Paper
Code

SAPA: Similarity-Aware Point Affiliation for Feature Upsampling

2 code implementations • 26 Sep 2022 • Hao Lu, Wenze Liu, Zixuan Ye, Hongtao Fu, Yuliang Liu, Zhiguo Cao

We introduce point affiliation into feature upsampling, a notion that describes the affiliation of each upsampled point to a semantic cluster formed by local decoder feature points with semantic similarity.

Ranked #5 on Feature Upsampling on ImageNet

Depth Estimation Feature Upsampling +6

Paper
Code

DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields

1 code implementation • 1 Aug 2022 • Zijin Wu, Xingyi Li, Juewen Peng, Hao Lu, Zhiguo Cao, Weicai Zhong

To mitigate this issue, we introduce DoF-NeRF, a novel neural rendering approach that can deal with shallow DoF inputs and can simulate DoF effect.

Neural Rendering

Paper
Code

Design What You Desire: Icon Generation from Orthogonal Application and Theme Labels

1 code implementation • 31 Jul 2022 • Yinpeng Chen, Zhiyu Pan, Min Shi, Hao Lu, Zhiguo Cao, Weicai Zhong

Generative adversarial networks (GANs) have been trained to be professional artists able to create stunning artworks such as face generation and image style transfer.

Disentanglement Face Generation +1

Paper
Code

Less is More: Consistent Video Depth Estimation with Masked Frames Modeling

1 code implementation • 31 Jul 2022 • Yiran Wang, Zhiyu Pan, Xingyi Li, Zhiguo Cao, Ke Xian, Jianming Zhang

Temporal consistency is the key challenge of video depth estimation.

Depth Estimation Optical Flow Estimation

Paper
Code

FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling

no code implementations • 21 Jul 2022 • Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao

We consider the problem of task-agnostic feature upsampling in dense prediction where an upsampling operator is required to facilitate both region-sensitive tasks like semantic segmentation and detail-sensitive tasks such as image matting.

Feature Upsampling Image Matting +1

Paper
Add Code

Robust Object Detection With Inaccurate Bounding Boxes

1 code implementation • 20 Jul 2022 • Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, Ziming Zhang

As the crowd-sourcing labeling process and the ambiguities of the objects may raise noisy bounding box annotations, the object detectors will suffer from the degenerated training data.

Multiple Instance Learning Object +2

Paper
Code

MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects

1 code implementation • 18 Jul 2022 • Juewen Peng, Jianming Zhang, Xianrui Luo, Hao Lu, Ke Xian, Zhiguo Cao

Partial occlusion effects are a phenomenon that blurry objects near a camera are semi-transparent, resulting in partial appearance of occluded background.

Paper
Code

3D Instances as 1D Kernels

1 code implementation • 15 Jul 2022 • Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong

The idea of instance kernel is inspired by recent success of dynamic convolutions in 2D/3D instance segmentation.

Ranked #2 on 3D Instance Segmentation on S3DIS (mCov metric)

3D Instance Segmentation Semantic Segmentation

Paper
Code

BokehMe: When Neural Rendering Meets Classical Rendering

1 code implementation • CVPR 2022 • Juewen Peng, Zhiguo Cao, Xianrui Luo, Hao Lu, Ke Xian, Jianming Zhang

Based on this formulation, we implement the classical renderer by a scattering-based method and propose a two-stage neural renderer to fix the erroneous areas from the classical renderer.

Neural Rendering

170

Paper
Code

Interior Attention-Aware Network for Infrared Small Target Detection

1 code implementation • IEEE Transactions on Geoscience and Remote Sensing 2022 • Kewei Wang, Shuaiyuan Du, Chengxin Liu, Zhiguo Cao

Motivated by the fact that pixels from targets or backgrounds are correlated to each other, we propose a coarse-to-fine interior attention-aware network (IAANet) for infrared small target detection.

2D Object Detection 2D Semantic Segmentation

Paper
Code

Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting

1 code implementation • CVPR 2022 • Min Shi, Hao Lu, Chen Feng, Chengxin Liu, Zhiguo Cao

In this work, we propose a similarity-aware CAC framework that jointly learns representation and similarity metric.

Ranked #4 on Object Counting on CARPK

Object Counting

Paper
Code

Composing Photos Like a Photographer

1 code implementation • CVPR 2021 • Chaoyi Hong, Shuaiyuan Du, Ke Xian, Hao Lu, Zhiguo Cao, Weicai Zhong

To this end, we introduce the concept of the key composition map (KCM) to encode the composition rules.

Ranked #1 on Image Cropping on FLMS

Image Cropping

Paper
Code

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report

no code implementations • 17 May 2021 • Andrey Ignatov, Grigory Malivenko, David Plowman, Samarth Shukla, Radu Timofte, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Yiran Wang, Xingyi Li, Min Shi, Ke Xian, Zhiguo Cao, Jin-Hua Du, Pei-Lin Wu, Chao Ge, Jiaoyang Yao, Fangwen Tu, Bo Li, Jung Eun Yoo, Kwanggyoon Seo, Jialei Xu, Zhenyu Li, Xianming Liu, Junjun Jiang, Wei-Chi Chen, Shayan Joya, Huanhuan Fan, Zhaobing Kang, Ang Li, Tianpeng Feng, Yang Liu, Chuannan Sheng, Jian Yin, Fausto T. Benavide

While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference.

Depth Estimation

Paper
Add Code

TransView: Inside, Outside, and Across the Cropping View Boundaries

no code implementations • ICCV 2021 • Zhiyu Pan, Zhiguo Cao, Kewei Wang, Hao Lu, Weicai Zhong

We show that relation modeling between visual elements matters in cropping view recommendation.

Relation

Paper
Add Code

AIM 2020 Challenge on Rendering Realistic Bokeh

no code implementations • 10 Nov 2020 • Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zijin Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan, Saagara M B, Minnu A L, Sanjana A R, Praseeda S, Ge Wu, Xueqin Chen, Tengyao Wang, Max Zheng, Hulk Wong, Jay Zou

This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results.

Bokeh Effect Rendering

Paper
Add Code

On Efficient and Robust Metrics for RANSAC Hypotheses and 3D Rigid Registration

no code implementations • 10 Nov 2020 • Jiaqi Yang, Zhiqiang Huang, Siwen Quan, Qian Zhang, Yanning Zhang, Zhiguo Cao

This paper focuses on developing efficient and robust evaluation metrics for RANSAC hypotheses to achieve accurate 3D rigid registration.

Paper
Add Code

3D Correspondence Grouping with Compatibility Features

no code implementations • 21 Jul 2020 • Jiaqi Yang, Jiahao Chen, Zhiqiang Huang, Siwen Quan, Yanning Zhang, Zhiguo Cao

We present a simple yet effective method for 3D correspondence grouping.

Paper
Add Code

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

1 code implementation • ECCV 2020 • Liang Liu, Hao Lu, Hongwei Zou, Haipeng Xiong, Zhiguo Cao, Chunhua Shen

Inspired by scale weighing, we propose a novel 'counting scale' termed LibraNet where the count value is analogized by weight.

Crowd Counting reinforcement-learning +1

Paper
Code

ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification

1 code implementation • 11 Jul 2020 • Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu

Embedding RMML into the proposed ECML mechanism, our metric learning paradigm (EC-RMML) can run in the one-pass learning manner.

Face Verification Fine-Grained Visual Recognition +1

Paper
Code

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

2 code implementations • CVPR 2020 • Haozhe Qi, Chen Feng, Zhiguo Cao, Feng Zhao, Yang Xiao

Specifically, we first sample seeds from the point clouds in template and search area respectively.

3D Object Tracking Object Tracking

232

Paper
Code

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

1 code implementation • CVPR 2020 • Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly.

3D Action Recognition

Paper
Code

Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

no code implementations • ECCV 2020 • Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yun-hui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim

To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set.

3D Hand Pose Estimation

Paper
Add Code

LRF-Net: Learning Local Reference Frames for 3D Local Shape Description and Matching

no code implementations • 22 Jan 2020 • Angfan Zhu, Jiaqi Yang, Weiyue Zhao, Zhiguo Cao

The local reference frame (LRF) acts as a critical role in 3D local shape description and matching.

Pose Estimation

Paper
Add Code

From Open Set to Closed Set: Supervised Spatial Divide-and-Conquer for Object Counting

3 code implementations • 7 Jan 2020 • Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Chunhua Shen, Zhiguo Cao

Visual counting, a task that aims to estimate the number of objects from an image/video, is an open-set problem by nature, i. e., the number of population can vary in [0, inf) in theory.

Object Counting

132

Paper
Code

Rotation Invariant Point Cloud Classification: Where Local Geometry Meets Global Topology

1 code implementation • 1 Nov 2019 • Chen Zhao, Jiaqi Yang, Xin Xiong, Angfan Zhu, Zhiguo Cao, Xin Li

To the best of our knowledge, this work is the first principled approach toward adaptively combining global and local information under the context of RI point cloud analysis.

General Classification Point Cloud Classification

Paper
Code

Iterative Clustering with Game-Theoretic Matching for Robust Multi-consistency Correspondence

no code implementations • 3 Sep 2019 • Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li

Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.

3D Reconstruction Clustering +2

Paper
Add Code

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image

2 code implementations • ICCV 2019 • Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan

For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed.

Ranked #1 on Hand Pose Estimation on K2HPD

3D Pose Estimation Depth Estimation +1

285

Paper
Code

From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer

5 code implementations • ICCV 2019 • Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Zhiguo Cao, Chunhua Shen

A dense region can always be divided until sub-region counts are within the previously observed closed set.

Ranked #3 on Crowd Counting on TRANCOS

Crowd Counting

132

Paper
Code

Comparative evaluation of 2D feature correspondence selection algorithms

1 code implementation • 30 Apr 2019 • Chen Zhao, Jiaqi Yang, Yang Xiao, Zhiguo Cao

Correspondence selection aiming at seeking correct feature correspondences from raw feature matches is pivotal for a number of feature-matching-based tasks.

Paper
Code

Learning to Fuse Local Geometric Features for 3D Rigid Data Matching

no code implementations • 27 Apr 2019 • Jiaqi Yang, Chen Zhao, Ke Xian, Angfan Zhu, Zhiguo Cao

This paper presents a simple yet very effective data-driven approach to fuse both low-level and high-level local geometric features for 3D rigid data matching.

Paper
Add Code

NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences

1 code implementation • CVPR 2019 • Chen Zhao, Zhiguo Cao, Chi Li, Xin Li, Jiaqi Yang

Feature correspondence selection is pivotal to many feature-matching based tasks in computer vision.

Paper
Code

Towards Real-time Eyeblink Detection in The Wild:Dataset,Theory and Practices

no code implementations • 21 Feb 2019 • Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan

Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc.

Attribute Deception Detection +1

Paper
Add Code

Towards Good Practices on Building Effective CNN Baseline Model for Person Re-identification

1 code implementation • 29 Jul 2018 • Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou

Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc.

Open-Ended Question Answering Person Re-Identification

Paper
Code

Deep attention-based classification network for robust depth prediction

1 code implementation • 11 Jul 2018 • Ruibo Li, Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Lingxiao Hang

However, robust depth prediction suffers from two challenging problems: a) How to extract more discriminative features for different scenes (compared to a single scene)?

Classification Deep Attention +5

Paper
Code

Action Recognition for Depth Video using Multi-view Dynamic Images

1 code implementation • 29 Jun 2018 • Yang Xiao, Jun Chen, Yancheng Wang, Zhiguo Cao, Joey Tianyi Zhou, Xiang Bai

To better exploit three-dimensional (3D) characteristics, multi-view dynamic images are proposed.

Action Recognition Optical Flow Estimation +1

Paper
Code

Monocular Depth Estimation with Augmented Ordinal Depth Relationships

no code implementations • 2 Jun 2018 • Yuanzhouhan Cao, Tianqi Zhao, Ke Xian, Chunhua Shen, Zhiguo Cao, Shugong Xu

In this paper, we propose to improve the performance of metric depth estimation with relative depths collected from stereo movie videos using existing stereo matching algorithm.

Depth Prediction Monocular Depth Estimation +2

Paper
Add Code

Monocular Relative Depth Perception With Web Stereo Data Supervision

no code implementations • CVPR 2018 • Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo

In this paper we study the problem of monocular relative depth perception in the wild.

Depth Estimation Semantic Segmentation

Paper
Add Code

Performance Evaluation of 3D Correspondence Grouping Algorithms

no code implementations • 6 Apr 2018 • Jiaqi Yang, Ke Xian, Yang Xiao, Zhiguo Cao

This paper presents a thorough evaluation of several widely-used 3D correspondence grouping algorithms, motived by their significance in vision tasks relying on correct feature correspondences.

3D Object Recognition Point Cloud Registration +1

Paper
Add Code

When Unsupervised Domain Adaptation Meets Tensor Representations

1 code implementation • ICCV 2017 • Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, Ke Xian, Chunhua Shen, Anton Van Den Hengel

Domain adaption (DA) allows machine learning methods trained on data sampled from one distribution to be applied to data sampled from another.

Unsupervised Domain Adaptation

Paper
Code

TasselNet: Counting maize tassels in the wild via local counts regression network

no code implementations • 7 Jul 2017 • Hao Lu, Zhiguo Cao, Yang Xiao, Bohan Zhuang, Chunhua Shen

To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment.

Plant Phenotyping regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.