no code implementations • 4 Dec 2023 • Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi, Ronald Clark
In this paper, we introduce the first method for obtaining calibrated uncertainties from NeRF models.
no code implementations • 29 Nov 2023 • Jacob Lin, Miguel Farinha, Edward Gryspeerdt, Ronald Clark
Volumetric phenomena, such as clouds and fog, present a significant challenge for 3D reconstruction systems due to their translucent nature and their complex interactions with light.
no code implementations • 27 Jul 2022 • Tristan Laidlow, Jan Czarnowski, Andrea Nicastro, Ronald Clark, Stefan Leutenegger
While systems that pass the output of traditional multi-view stereo approaches to a network for regularisation or refinement currently seem to get the best results, it may be preferable to treat deep neural networks as separate components whose results can be probabilistically fused into geometry-based systems.
no code implementations • CVPR 2022 • Ronald Clark
To the best of our knowledge, this is the first method that can achieve online photorealistic scene capture.
no code implementations • 5 Nov 2021 • Martin Piala, Ronald Clark
Volume rendering using neural fields has shown great promise in capturing and synthesizing novel views of 3D scenes.
1 code implementation • 1 May 2021 • Alexandru-Iosif Toma, Hussein Ali Jaafar, Hao-Ya Hsueh, Stephen James, Daniel Lenton, Ronald Clark, Sajad Saeedi
We propose waypoint planning networks (WPN), a hybrid algorithm based on LSTMs with a local kernel - a classic algorithm such as A*, and a global kernel using a learned algorithm.
2 code implementations • 15 Feb 2021 • Daniel Lenton, Stephen James, Ronald Clark, Andrew J. Davison
Spatial memory, or the ability to remember and recall specific locations and objects, is central to autonomous agents' ability to carry out tasks in real environments.
1 code implementation • 4 Feb 2021 • Daniel Lenton, Fabio Pardo, Fabian Falck, Stephen James, Ronald Clark
We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.
no code implementations • ICLR 2021 • Daniel James Lenton, Stephen James, Ronald Clark, Andrew Davison
With our broad demonstrations, we show that ESMN represents a useful and general computation graph for embodied spatial reasoning, and the module forms a bridge between real-time mapping systems and differentiable memory architectures.
no code implementations • 30 Nov 2020 • Michal Pándy, Daniel Lenton, Ronald Clark
We demonstrate that challenging shortest path problems can be solved via direct spline regression from a neural network, trained in an unsupervised manner (i. e. without requiring ground truth optimal paths for training).
no code implementations • 10 Sep 2020 • Martin Fisch, Ronald Clark
Most realtime human pose estimation approaches are based on detecting joint positions.
1 code implementation • 31 Aug 2020 • Shuyu Lin, Ronald Clark
In this paper, we show that the performance of a learnt generative model is closely related to the model's ability to accurately represent the inferred \textbf{latent data distribution}, i. e. its topology and structural properties.
1 code implementation • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020 • Shuyu Lin, Ronald Clark, Robert Birke, Sandro Schönborn, Niki Trigoni, Stephen Roberts
In this work, we propose a VAE-LSTM hybrid model as an unsupervised approach for anomaly detection in time series.
no code implementations • CVPR 2020 • Eduardo D. C. Carvalho, Ronald Clark, Andrea Nicastro, Paul H. J. Kelly
As Deep Learning continues to yield successful applications in Computer Vision, the ability to quantify all forms of uncertainty is a paramount requirement for its safe and reliable deployment in the real-world.
1 code implementation • 5 Mar 2020 • Wei Wang, Bing Wang, Peijun Zhao, Changhao Chen, Ronald Clark, Bo Yang, Andrew Markham, Niki Trigoni
In this paper, we present a novel end-to-end learning-based LiDAR relocalization framework, termed PointLoc, which infers 6-DoF poses directly using only a single point cloud as input, without requiring a pre-built map.
Robotics
1 code implementation • 14 Jan 2020 • Jan Czarnowski, Tristan Laidlow, Ronald Clark, Andrew J. Davison
The ability to estimate rich geometry and camera motion from monocular imagery is fundamental to future interactive robotics and augmented reality applications.
no code implementations • 9 Sep 2019 • Shuyu Lin, Stephen Roberts, Niki Trigoni, Ronald Clark
A trade-off exists between reconstruction quality and the prior regularisation in the Evidence Lower Bound (ELBO) loss that Variational Autoencoder (VAE) models use for learning.
1 code implementation • NeurIPS 2019 • Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni
The framework directly regresses 3D bounding boxes for all instances in a point cloud, while simultaneously predicting a point-level mask for each instance.
Ranked #13 on
3D Instance Segmentation
on S3DIS
(mPrec metric)
no code implementations • ICLR Workshop DeepGenStruct 2019 • Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts
In this paper, we present a new generative model for learning latent embeddings.
no code implementations • 3 Mar 2019 • Andrea Nicastro, Ronald Clark, Stefan Leutenegger
Detailed 3D reconstruction is an important challenge with application to robotics, augmented and virtual reality, which has seen impressive progress throughout the past years.
no code implementations • 16 Feb 2019 • Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts
Variational Auto-encoders (VAEs) have been very successful as methods for forming compressed latent representations of complex, often high-dimensional, data.
no code implementations • ECCV 2018 • Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison
In this paper, we propose LS-Net, a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.
no code implementations • 3 Sep 2018 • Wenbin Li, Sajad Saeedi, John McCormac, Ronald Clark, Dimos Tzoumanikas, Qing Ye, Yuzhong Huang, Rui Tang, Stefan Leutenegger
Datasets have gained an enormous amount of popularity in the computer vision community, from training and evaluation of Deep Learning-based methods to benchmarking Simultaneous Localization and Mapping (SLAM).
no code implementations • ECCV 2018 • Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison
In this paper, we propose a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.
no code implementations • 25 Aug 2018 • John McCormac, Ronald Clark, Michael Bloesch, Andrew J. Davison, Stefan Leutenegger
Reconstructed objects are stored in an optimisable 6DoF pose graph which is our only persistent map representation.
1 code implementation • CVPR 2018 • Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison
Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency.
3 code implementations • 3 Apr 2018 • Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison
Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency.
5 code implementations • 25 Sep 2017 • Sen Wang, Ronald Clark, Hongkai Wen, Niki Trigoni
This paper presents a novel end-to-end framework for monocular VO by using deep Recurrent Convolutional Neural Networks (RCNNs).
2 code implementations • 26 Aug 2017 • Bo Yang, Hongkai Wen, Sen Wang, Ronald Clark, Andrew Markham, Niki Trigoni
In this paper, we propose a novel 3D-RecGAN approach, which reconstructs the complete 3D structure of a given object from a single arbitrary depth view using generative adversarial networks.
no code implementations • CVPR 2017 • Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen
Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images.
no code implementations • 29 Jan 2017 • Ronald Clark, Sen Wang, Hongkai Wen, Andrew Markham, Niki Trigoni
In this paper we present an on-manifold sequence-to-sequence learning approach to motion estimation using visual and inertial sensors.