Search Results for author: Guangkai Xu

Found 7 papers, 1 papers with code

Diffusion Models Trained with Large Data Are Transferable Visual Models

no code implementations10 Mar 2024 Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen

We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.

Image Matting Image Segmentation +2

Towards Domain-agnostic Depth Completion

1 code implementation29 Jul 2022 Guangkai Xu, Wei Yin, Jianming Zhang, Oliver Wang, Simon Niklaus, Simon Chen, Jia-Wang Bian

Our method leverages a data-driven prior in the form of a single image depth prediction network trained on large-scale datasets, the output of which is used as an input to our model.

Depth Completion Depth Estimation +2

Exploiting Correspondences with All-pairs Correlations for Multi-view Depth Estimation

no code implementations5 May 2022 Kai Cheng, Hao Chen, Wei Yin, Guangkai Xu, Xuejin Chen

However, multi-view depth estimation is fundamentally a correspondence-based optimization problem, but previous learning-based methods mainly rely on predefined depth hypotheses to build correspondence as the cost volume and implicitly regularize it to fit depth prediction, deviating from the essence of iterative optimization based on stereo correspondence.

Depth Estimation Depth Prediction +1

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

no code implementations3 Feb 2022 Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Wu, Feng Zhao

However, in some video-based scenarios such as video depth estimation and 3D scene reconstruction from a video, the unknown scale and shift residing in per-frame prediction may cause the depth inconsistency.

3D Scene Reconstruction Depth Completion +1

Cannot find the paper you are looking for? You can Submit a new open access paper.