This work focuses on low bitrate video streaming scenarios (e. g. 50 - 200Kbps) where the video quality is severely compromised.
We demonstrate our ability to learn MVS without 3D supervision using a real dataset, and show that each component of our proposed robust loss results in a significant improvement.
Shape completion, the problem of estimating the complete geometry of objects from partial observations, lies at the core of many vision and robotics applications.
Ranked #4 on Point Cloud Completion on ShapeNet
We propose to counter these language priors for the task of Visual Question Answering (VQA) and make vision (the V in VQA) matter!