4 code implementations • CVPR 2021 • Xiaotian Chen, Yuwang Wang, Xuejin Chen, Wenjun Zeng
S2R-DepthNet consists of: a) a Structure Extraction (STE) module which extracts a domaininvariant structural representation from an image by disentangling the image into domain-invariant structure and domain-specific style components, b) a Depth-specific Attention (DSA) module, which learns task-specific knowledge to suppress depth-irrelevant structures for better depth estimation and generalization, and c) a depth prediction module (DP) to predict depth from the depth-specific representation.
1 code implementation • 13 Jul 2019 • Xiaotian Chen, Xuejin Chen, Zheng-Jun Zha
We propose a Residual Pyramid Decoder (RPD) which expresses global scene structure in upper levels to represent layouts, and local structure in lower levels to present shape details.
Ranked #56 on Monocular Depth Estimation on NYU-Depth V2