Search Results for author: Xibin Song

Found 17 papers, 6 papers with code

RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery

1 code implementation19 Sep 2023 Jiaxin Wei, Xibin Song, Weizhe Liu, Laurent Kneip, Hongdong Li, Pan Ji

While showing promising results, recent RGB-D camera-based category-level object pose estimation methods have restricted applications due to the heavy reliance on depth sensors.

Object Pose Estimation

Digging Into Uncertainty-based Pseudo-label for Robust Stereo Matching

1 code implementation31 Jul 2023 Zhelun Shen, Xibin Song, Yuchao Dai, Dingfu Zhou, Zhibo Rao, Liangjun Zhang

Due to the domain differences and unbalanced disparity distribution across multiple datasets, current stereo matching approaches are commonly limited to a specific dataset and generalize poorly to others.

Monocular Depth Estimation Pseudo Label +1

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

no code implementations24 Mar 2022 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds.

Point Cloud Registration

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

no code implementations28 Oct 2021 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally.

Point Cloud Registration

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

2 code implementations ICCV 2021 Lina Liu, Xibin Song, Mengmeng Wang, Yong liu, Liangjun Zhang

Meanwhile, to guarantee that the day and night images contain the same information, the domain-separated network takes the day-time images and corresponding night-time images (generated by GAN) as input, and the private and invariant feature extractors are learned by orthogonality and similarity loss, where the domain gap can be alleviated, thus better depth maps can be expected.

Monocular Depth Estimation

MapFusion: A General Framework for 3D Object Detection with HDMaps

no code implementations10 Mar 2021 Jin Fang, Dingfu Zhou, Xibin Song, Liangjun Zhang

In this paper, we propose a simple but effective framework - MapFusion to integrate the map information into modern 3D object detector pipelines.

3D Object Detection Autonomous Driving +2

FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion

no code implementations15 Dec 2020 Lina Liu, Xibin Song, Xiaoyang Lyu, Junwei Diao, Mengmeng Wang, Yong liu, Liangjun Zhang

Then, a refined depth map is further obtained using a residual learning strategy in the coarse-to-fine stage with a coarse depth map and color image as input.

Depth Completion

PerMO: Perceiving More at Once from a Single Image for Autonomous Driving

no code implementations16 Jul 2020 Feixiang Lu, Zongdai Liu, Xibin Song, Dingfu Zhou, Wei Li, Hui Miao, Miao Liao, Liangjun Zhang, Bin Zhou, Ruigang Yang, Dinesh Manocha

We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image for autonomous driving.

3D Reconstruction Autonomous Driving +3

PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching

2 code implementations23 Jun 2020 Zhelun Shen, Yuchao Dai, Xibin Song, Zhibo Rao, Dingfu Zhou, Liangjun Zhang

First, we construct combination volumes on the upper levels of the pyramid and develop a cost volume fusion module to integrate them for initial disparity estimation.

Disparity Estimation Domain Generalization +1

Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

no code implementations CVPR 2020 Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, Hongdng Li, Ruigang Yang

Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively re-exploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss.

Benchmarking Depth Map Super-Resolution

Learning 2D-3D Correspondences To Solve The Blind Perspective-n-Point Problem

1 code implementation15 Mar 2020 Liu Liu, Dylan Campbell, Hongdong Li, Dingfu Zhou, Xibin Song, Ruigang Yang

Conventional absolute camera pose via a Perspective-n-Point (PnP) solver often assumes that the correspondences between 2D image pixels and 3D points are given.

IoU Loss for 2D/3D Object Detection

1 code implementation11 Aug 2019 Dingfu Zhou, Jin Fang, Xibin Song, Chenye Guan, Junbo Yin, Yuchao Dai, Ruigang Yang

In 2D/3D object detection task, Intersection-over-Union (IoU) has been widely employed as an evaluation metric to evaluate the performance of different detectors in the testing stage.

3D Object Detection Object +1

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

no code implementations CVPR 2019 Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, Ruigang Yang

Specifically, we first segment each car with a pre-trained Mask R-CNN, and then regress towards its 3D pose and shape based on a deformable 3D car model with or without using semantic keypoints.

3D Car Instance Understanding Autonomous Driving

Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis

no code implementations27 Aug 2018 Xibin Song, Yuchao Dai, Xueying Qin

However, there still exist two major issues with these DCNN based depth map super-resolution methods that hinder the performance: i) The low-resolution depth maps either need to be up-sampled before feeding into the network or substantial deconvolution has to be used; and ii) The supervision (high-resolution depth maps) is only applied at the end of the network, thus it is difficult to handle large up-sampling factors, such as $\times 8, \times 16$.

Benchmarking Blocking +2

Deep Depth Super-Resolution : Learning Depth Super-Resolution using Deep Convolutional Neural Network

no code implementations7 Jul 2016 Xibin Song, Yuchao Dai, Xueying Qin

In this paper, we bridge up the gap and extend the success of deep convolutional neural network to depth super-resolution.

Image Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.