1 code implementation • 14 Jul 2023 • Kaylene C. Stocking, Zak Murez, Vijay Badrinarayanan, Jamie Shotton, Alex Kendall, Claire Tomlin, Christopher P. Burgess
Object-centric representations enable autonomous driving algorithms to reason about interactions between many independent agents and scene features.
no code implementations • 12 Aug 2021 • Jeffrey Hawke, Haibo E, Vijay Badrinarayanan, Alex Kendall
The self driving challenge in 2021 is this century's technological equivalent of the space race, and is now entering the second major decade of development.
1 code implementation • ICCV 2021 • Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall
We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras.
Ranked #1 on
Bird's-Eye View Semantic Segmentation
on nuScenes
1 code implementation • ECCV 2020 • Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, Andrew Rabinovich
Traditional approaches to 3D reconstruction rely on an intermediate representation of depth maps prior to estimating a full 3D model of a scene.
Ranked #1 on
3D Reconstruction
on ScanNet
1 code implementation • ECCV 2020 • Ayan Sinha, Zak Murez, James Bartolozzi, Vijay Badrinarayanan, Andrew Rabinovich
Cost volume based approaches employing 3D convolutional neural networks (CNNs) have considerably improved the accuracy of MVS systems.
Ranked #1 on
Depth Estimation
on ScanNetV2
no code implementations • 18 Mar 2020 • Zhengyang Wu, Srivignesh Rajendran, Tarrence van As, Joelle Zimmermann, Vijay Badrinarayanan, Andrew Rabinovich
With the emergence of Virtual and Mixed Reality (XR) devices, eye tracking has received significant attention in the computer vision community.
no code implementations • 16 Mar 2020 • Ameya Phalak, Vijay Badrinarayanan, Andrew Rabinovich
We introduce Scan2Plan, a novel approach for accurate estimation of a floorplan from a 3D scan of the structural elements of indoor environments.
no code implementations • 24 Aug 2019 • Zhengyang Wu, Srivignesh Rajendran, Tarrence van As, Joelle Zimmermann, Vijay Badrinarayanan, Andrew Rabinovich
Eye gaze estimation and simultaneous semantic understanding of a user through eye images is a crucial component in Virtual and Mixed Reality; enabling energy efficient rendering, multi-focal displays and effective interaction with 3D content.
no code implementations • 25 Apr 2019 • Ameya Phalak, Zhao Chen, Darvin Yi, Khushi Gupta, Vijay Badrinarayanan, Andrew Rabinovich
We present DeepPerimeter, a deep learning based pipeline for inferring a full indoor perimeter (i. e. exterior boundary map) from a sequence of posed RGB images.
no code implementations • 21 Jun 2018 • Ayan Sinha, Zhao Chen, Vijay Badrinarayanan, Andrew Rabinovich
We demonstrate gradient adversarial training for three different scenarios: (1) as a defense to adversarial examples we classify gradient tensors and tune them to be agnostic to the class of their corresponding example, (2) for knowledge distillation, we do binary classification of gradient tensors derived from the student or teacher network and tune the student gradient tensor to mimic the teacher's gradient tensor; and (3) for multi-task learning we classify the gradient tensors derived from different task loss functions and tune them to be statistically indistinguishable.
2 code implementations • ECCV 2018 • Zhao Chen, Vijay Badrinarayanan, Gilad Drozdov, Andrew Rabinovich
We present a deep model that can accurately produce dense depth maps given an RGB image with known depth at a very sparse set of pixels.
4 code implementations • ICML 2018 • Zhao Chen, Vijay Badrinarayanan, Chen-Yu Lee, Andrew Rabinovich
Deep multitask networks, in which one neural network produces multiple predictive outputs, can offer better speed and performance than their single-task counterparts but are challenging to train properly.
1 code implementation • ICCV 2017 • Chen-Yu Lee, Vijay Badrinarayanan, Tomasz Malisiewicz, Andrew Rabinovich
This paper focuses on the task of room layout estimation from a monocular RGB image.
1 code implementation • 30 Nov 2016 • Debidatta Dwibedi, Tomasz Malisiewicz, Vijay Badrinarayanan, Andrew Rabinovich
We present a Deep Cuboid Detector which takes a consumer-quality RGB image of a cluttered scene and localizes all 3D cuboids (box-like objects).
no code implementations • CVPR 2016 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.
1 code implementation • 22 Nov 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.
no code implementations • 10 Nov 2015 • Ujwal Bonde, Vijay Badrinarayanan, Roberto Cipolla, Minh-Tri Pham
We present a novel deep architecture termed templateNet for depth based object instance recognition.
24 code implementations • 9 Nov 2015 • Alex Kendall, Vijay Badrinarayanan, Roberto Cipolla
Semantic segmentation is an important tool for visual scene understanding and a meaningful measure of uncertainty is essential for decision making.
no code implementations • 5 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla
Recent works have highlighted scale invariance or symmetry that is present in the weight space of a typical deep network and the adverse effect that it has on the Euclidean gradient based stochastic gradient descent optimization.
no code implementations • 3 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla
Consequently, training the network boils down to using stochastic gradient descent updates on the unit-norm manifold.
75 code implementations • 2 Nov 2015 • Vijay Badrinarayanan, Alex Kendall, Roberto Cipolla
We show that SegNet provides good performance with competitive inference time and more efficient inference memory-wise as compared to other architectures.
Ranked #3 on
Medical Image Segmentation
on RITE
5 code implementations • 27 May 2015 • Vijay Badrinarayanan, Ankur Handa, Roberto Cipolla
These methods lack a mechanism to map deep layer feature maps to input dimensions.
no code implementations • 1 May 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
We are interested in automatic scene understanding from geometric cues.