We present an approach which takes advantage of both structure and semantics for unsupervised monocular learning of depth and ego-motion.
Models and examples built with TensorFlow
#15 best model for Monocular Depth Estimation on KITTI Eigen split
Per-pixel ground-truth depth data is challenging to acquire at scale.
#7 best model for Monocular Depth Estimation on KITTI Eigen split
We propose GeoNet, a jointly unsupervised learning framework for monocular depth, optical flow and ego-motion estimation from videos.
Human motion modelling is a classical problem at the intersection of graphics and computer vision, with applications spanning human-computer interaction, motion synthesis, and motion prediction for virtual and augmented reality.
At each time step, the system receives as input a video frame, predicts the optical flow based on the current observation and the LSTM memory state as a dense transformation map, and applies it to the current frame to generate the next frame.
We address the unsupervised learning of several interconnected problems in low-level vision: single view depth prediction, camera motion estimation, optical flow, and segmentation of a video into the static scene and moving regions.
#17 best model for Monocular Depth Estimation on KITTI Eigen split
Many video enhancement algorithms rely on optical flow to register frames in a video sequence.
#4 best model for Video Frame Interpolation on Vimeo90k