Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation

Self-supervised methods play an increasingly important role in monocular depth estimation due to their great potential and low annotation cost. To close the gap with supervised methods, recent works take advantage of extra constraints, e.g., semantic segmentation. However, these methods will inevitably increase the burden on the model. In this paper, we show theoretical and empirical evidence that the potential capacity of self-supervised monocular depth estimation can be excavated without increasing this cost. In particular, we propose (1) a novel data augmentation approach called data grafting, which forces the model to explore more cues to infer depth besides the vertical image position, (2) an exploratory self-distillation loss, which is supervised by the self-distillation label generated by our new post-processing method - selective post-processing, and (3) the full-scale network, designed to endow the encoder with the specialization of depth estimation task and enhance the representational power of the model. Extensive experiments show that our contributions can bring significant performance improvement to the baseline with even less computational overhead, and our model, named EPCDepth, surpasses the previous state-of-the-art methods even those supervised by additional constraints.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Monocular Depth Estimation KITTI Eigen split unsupervised EPCDepth(S+1024x320) absolute relative error 0.091 # 4
RMSE 4.207 # 7
Sq Rel 0.646 # 6
RMSE log 0.176 # 11
Delta < 1.25 0.901 # 10
Delta < 1.25^2 0.966 # 8
Delta < 1.25^3 0.983 # 10
Resolution 1024x320 # 1
Mono X # 1
Monocular Depth Estimation KITTI Eigen split unsupervised EPCDepth(S+640x192) absolute relative error 0.099 # 11
RMSE 4.490 # 17
Sq Rel 0.183 # 1
RMSE log 0.183 # 16
Delta < 1.25 0.888 # 15
Delta < 1.25^2 0.963 # 14
Delta < 1.25^3 0.982 # 14

Methods


No methods listed for this paper. Add relevant methods here