1 code implementation • CVPR 2023 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla
Monocular 3D human pose and shape estimation is an ill-posed problem since multiple 3D solutions can explain a 2D image of a subject.
Ranked #51 on
3D Human Pose Estimation
on 3DPW
(MPJPE metric)
1 code implementation • CVPR 2023 • Fei Xue, Ignas Budvytis, Roberto Cipolla
Previous methods solve feature matching and pose estimation using a two-stage process by first finding matches and then estimating the pose.
1 code implementation • CVPR 2023 • Fei Xue, Ignas Budvytis, Roberto Cipolla
Visual localization is a fundamental task for various applications including autonomous driving and robotics.
1 code implementation • 21 Oct 2022 • Oliver Boyne, James Charles, Roberto Cipolla
In this paper we present a high fidelity and articulated 3D human foot model.
1 code implementation • 14 Oct 2022 • Anthony Hu, Gianluca Corrado, Nicolas Griffiths, Zak Murez, Corina Gurau, Hudson Yeo, Alex Kendall, Roberto Cipolla, Jamie Shotton
Our approach is the first camera-only method that models static scene, dynamic scene, and ego-behaviour in an urban driving environment.
no code implementations • 10 Oct 2022 • Fotios Logothetis, Roberto Mecca, Ignas Budvytis, Roberto Cipolla
Reconstructing the 3D shape of an object using several images under different light sources is a very challenging task, especially when realistic assumptions such as light propagation and attenuation, perspective viewing geometry and specular light reflection are considered.
1 code implementation • 7 Oct 2022 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla
The depth of each pixel can be propagated to a query pixel, using the predicted surface normal as guidance.
Ranked #23 on
Monocular Depth Estimation
on NYU-Depth V2
1 code implementation • 5 Oct 2022 • Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien Valentin, Roberto Cipolla, Jingjing Shen
Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet.
Ranked #1 on
Face Recognition
on AgeDB
1 code implementation • 3 Oct 2022 • Florian Langer, Gwangbin Bae, Ignas Budvytis, Roberto Cipolla
This combined information is the input to a pose prediction network, SPARC-Net which we train to predict a 9 DoF CAD model pose update.
no code implementations • 29 Sep 2022 • Rudra P. K. Poudel, Harit Pandya, Roberto Cipolla
In particular, we use contrastive unsupervised learning to learn the invariant causal features, which enforces invariance across augmentations of irrelevant parts or styles of the observation.
no code implementations • CVPR 2022 • Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla
Hierarchical frameworks consisting of both coarse and fine localization are often used as the standard pipeline for large-scale visual localization.
1 code implementation • CVPR 2022 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla
To this end, we propose MaGNet, a novel framework for fusing single-view depth probability with multi-view geometry, to improve the accuracy, robustness and efficiency of multi-view depth estimation.
no code implementations • 10 Dec 2021 • Stanislaw Szymanowicz, James Charles, Roberto Cipolla
The aim of this work is to detect and automatically generate high-level explanations of anomalous events in video.
no code implementations • 30 Nov 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla
This paper addresses the problem of 3D human body shape and pose estimation from RGB images.
1 code implementation • 10 Nov 2021 • Florian Langer, Ignas Budvytis, Roberto Cipolla
In this work we demonstrate how cross-domain keypoint matches from an RGB image to a rendered CAD model allow for more precise object pose predictions compared to ones obtained through direct predictions.
1 code implementation • ICCV 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla
Thus, it is desirable to estimate a distribution over 3D body shape and pose conditioned on the input image instead of a single 3D reconstruction.
Ranked #1 on
3D Human Shape Estimation
on SSP-3D
1 code implementation • ICCV 2021 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla
Experimental results show that the proposed method outperforms the state-of-the-art in ScanNet and NYUv2, and that the estimated uncertainty correlates well with the prediction error.
Ranked #1 on
Surface Normals Estimation
on ScanNetV2
no code implementations • 16 Jun 2021 • Stanislaw Szymanowicz, James Charles, Roberto Cipolla
In an effort to tackle this problem we make the following contributions: (1) we show how to build interpretable feature representations suitable for detecting anomalies with state of the art performance, (2) we propose an interpretable probabilistic anomaly detector which can describe the reason behind it's response using high level concepts, (3) we are the first to directly consider object interactions for anomaly detection and (4) we propose a new task of explaining anomalies and release a large dataset for evaluating methods on this task.
no code implementations • 27 Apr 2021 • Roberto Mecca, Fotios Logothetis, Ignas Budvytis, Roberto Cipolla
In order to fill the gap in evaluating near-field photometric stereo methods, we introduce LUCES the first real-world 'dataset for near-fieLd point light soUrCe photomEtric Stereo' of 14 objects of a varying of materials.
1 code implementation • ICCV 2021 • Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall
We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras.
Ranked #1 on
Bird's-Eye View Semantic Segmentation
on nuScenes
no code implementations • CVPR 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla
In contrast, we propose a new task: shape and pose estimation from a group of multiple images of a human subject, without constraints on subject pose, camera viewpoint or background conditions between images in the group.
Ranked #3 on
3D Human Shape Estimation
on SSP-3D
1 code implementation • 21 Sep 2020 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla
Thus, we propose STRAPS (Synthetic Training for Real Accurate Pose and Shape), a system that utilises proxy representations, such as silhouettes and 2D joints, as inputs to a shape and pose regression neural network, which is trained with synthetic training data (generated on-the-fly during training using the SMPL statistical body model) to overcome data scarcity.
Ranked #1 on
3D Human Shape Estimation
on MoVi
3D human pose and shape estimation
3D Human Shape Estimation
+3
no code implementations • 12 Sep 2020 • Fotios Logothetis, Ignas Budvytis, Roberto Mecca, Roberto Cipolla
Secondly, we compute the depth by integrating the normal field in order to iteratively estimate light directions and attenuation which is used to compensate the input images to compute reflectance samples for the next iteration.
no code implementations • 11 Sep 2020 • Steven D. Morad, Roberto Mecca, Rudra P. K. Poudel, Stephan Liwicki, Roberto Cipolla
We present NavACL, a method of automatic curriculum learning tailored to the navigation task.
no code implementations • ICCV 2021 • Fotios Logothetis, Ignas Budvytis, Roberto Mecca, Roberto Cipolla
We show that global physical effects can be approximated on the observation map domain and this simplifies and speeds up the data creation procedure.
2 code implementations • ECCV 2020 • Benjamin Biggs, Oliver Boyne, James Charles, Andrew Fitzgibbon, Roberto Cipolla
We introduce an automatic, end-to-end method for recovering the 3D pose and shape of dogs from monocular internet images.
1 code implementation • CVPR 2020 • Thomas Roddick, Roberto Cipolla
Autonomous vehicles commonly rely on highly detailed birds-eye-view maps of their environment, which capture both static elements of the scene such as road layout as well as dynamic elements such as other cars and pedestrians.
1 code implementation • 19 Dec 2019 • Anthony Hu, Alex Kendall, Roberto Cipolla
We present a novel embedding approach for video instance segmentation.
no code implementations • 23 Sep 2019 • Ignas Budvytis, Marvin Teichmann, Tomas Vojir, Roberto Cipolla
We obtain smaller mean distance and angular errors than state-of-the-art 6-DoF pose estimation algorithms based on direct pose regression and pose estimation from scene coordinates on all datasets.
1 code implementation • ICCV 2019 • Chao Zhang, Stephan Liwicki, William Smith, Roberto Cipolla
For the spherical domain, several methods recently adopt an icosahedron mesh, but systems are typically rotation invariant or require significant memory and parameters, thus enabling execution only at very low resolutions.
Ranked #14 on
Semantic Segmentation
on Stanford2D3D Panoramic
21 code implementations • 12 Feb 2019 • Rudra P. K. Poudel, Stephan Liwicki, Roberto Cipolla
The encoder-decoder framework is state-of-the-art for offline semantic image segmentation.
Ranked #7 on
Semantic Segmentation
on SynPASS
1 code implementation • 20 Nov 2018 • Thomas Roddick, Alex Kendall, Roberto Cipolla
This allows us to reason holistically about the spatial configuration of the scene in a domain where scale is consistent and distances between objects are meaningful.
3D Object Detection From Monocular Images
Monocular 3D Object Detection
+1
no code implementations • 14 Nov 2018 • Benjamin Biggs, Thomas Roddick, Andrew Fitzgibbon, Roberto Cipolla
We present a system to recover the 3D shape and motion of a wide variety of quadrupeds from video.
no code implementations • ICCV 2019 • Fotios Logothetis, Roberto Mecca, Roberto Cipolla
In this work, we present a volumetric approach to the multi-view photometric stereo problem.
1 code implementation • ICLR 2019 • Marvin T. T. Teichmann, Roberto Cipolla
For the challenging semantic image segmentation task the most efficient models have traditionally combined the structured modelling capabilities of Conditional Random Fields (CRFs) with the feature extraction power of CNNs.
no code implementations • CVPR 2017 • Fotios Logothetis, Roberto Mecca, Roberto Cipolla
3D reconstruction from shading information through Photometric Stereo is considered a very challenging problem in Computer Vision.
16 code implementations • CVPR 2018 • Alex Kendall, Yarin Gal, Roberto Cipolla
Numerous deep learning applications benefit from multi-task learning with multiple regression and classification objectives.
1 code implementation • CVPR 2017 • Alex Kendall, Roberto Cipolla
Deep learning has shown to be effective for robust and real-time monocular image relocalisation.
16 code implementations • 22 Dec 2016 • Marvin Teichmann, Michael Weber, Marius Zoellner, Roberto Cipolla, Raquel Urtasun
While most approaches to semantic reasoning have focused on improving performance, in this paper we argue that computational times are very important in order to enable real time applications such as autonomous driving.
no code implementations • CVPR 2016 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.
no code implementations • CVPR 2017 • Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi
We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root.
no code implementations • CVPR 2016 • Sukrit Shankar, Duncan Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla
Deep Convolutional Neural Networks (CNNs) have recently evinced immense success for various image recognition tasks.
1 code implementation • 22 Nov 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.
no code implementations • 20 Nov 2015 • Yani Ioannou, Duncan Robertson, Jamie Shotton, Roberto Cipolla, Antonio Criminisi
Applying our method to a near state-of-the-art network for CIFAR, we achieved comparable accuracy with 46% less compute and 55% fewer parameters.
1 code implementation • 19 Nov 2015 • Viorica Patraucean, Ankur Handa, Roberto Cipolla
At each time step, the system receives as input a video frame, predicts the optical flow based on the current observation and the LSTM memory state as a dense transformation map, and applies it to the current frame to generate the next frame.
no code implementations • 10 Nov 2015 • Ujwal Bonde, Vijay Badrinarayanan, Roberto Cipolla, Minh-Tri Pham
We present a novel deep architecture termed templateNet for depth based object instance recognition.
24 code implementations • 9 Nov 2015 • Alex Kendall, Vijay Badrinarayanan, Roberto Cipolla
Semantic segmentation is an important tool for visual scene understanding and a meaningful measure of uncertainty is essential for decision making.
no code implementations • 5 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla
Recent works have highlighted scale invariance or symmetry that is present in the weight space of a typical deep network and the adverse effect that it has on the Euclidean gradient based stochastic gradient descent optimization.
no code implementations • 3 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla
Consequently, training the network boils down to using stochastic gradient descent updates on the unit-norm manifold.
75 code implementations • 2 Nov 2015 • Vijay Badrinarayanan, Alex Kendall, Roberto Cipolla
We show that SegNet provides good performance with competitive inference time and more efficient inference memory-wise as compared to other architectures.
Ranked #3 on
Medical Image Segmentation
on RITE
1 code implementation • 19 Sep 2015 • Alex Kendall, Roberto Cipolla
Using a Bayesian convolutional neural network implementation we obtain an estimate of the model's relocalization uncertainty and improve state of the art localization accuracy on a large scale outdoor dataset.
5 code implementations • 27 May 2015 • Vijay Badrinarayanan, Ankur Handa, Roberto Cipolla
These methods lack a mechanism to map deep layer feature maps to input dimensions.
5 code implementations • ICCV 2015 • Alex Kendall, Matthew Grimes, Roberto Cipolla
We present a robust and real-time monocular six degree of freedom relocalization system.
no code implementations • 1 May 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla
We are interested in automatic scene understanding from geometric cues.
no code implementations • CVPR 2015 • Sukrit Shankar, Vikas K. Garg, Roberto Cipolla
To ameliorate this limitation, we propose Deep-Carving, a novel training procedure with CNNs, that helps the net efficiently carve itself for the task of multiple attribute prediction.
no code implementations • CVPR 2014 • Wenhan Luo, Tae-Kyun Kim, Bjorn Stenger, Xiaowei Zhao, Roberto Cipolla
In this paper, we propose a label propagation framework to handle the multiple object tracking (MOT) problem for a generic object type (cf.
no code implementations • CVPR 2013 • Robert Anderson, Bjorn Stenger, Vincent Wan, Roberto Cipolla
This paper presents a complete system for expressive visual text-to-speech (VTTS), which is capable of producing expressive output, in the form of a 'talking head', given an input text and a set of continuous expression weights.
no code implementations • CVPR 2013 • Tsz-Ho Yu, Tae-Kyun Kim, Roberto Cipolla
This work addresses the challenging problem of unconstrained 3D human pose estimation (HPE) from a novel perspective.
no code implementations • NeurIPS 2008 • Tae-Kyun Kim, Roberto Cipolla
We present a new co-clustering problem of images and visual features.