Search Results for author: Ronald Clark

Found 33 papers, 13 papers with code

VINet: Visual-Inertial Odometry as a Sequence-to-Sequence Learning Problem

no code implementations29 Jan 2017 Ronald Clark, Sen Wang, Hongkai Wen, Andrew Markham, Niki Trigoni

In this paper we present an on-manifold sequence-to-sequence learning approach to motion estimation using visual and inertial sensors.

Motion Estimation

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

no code implementations CVPR 2017 Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images.

Autonomous Driving Indoor Localization

3D Object Reconstruction from a Single Depth View with Adversarial Learning

2 code implementations26 Aug 2017 Bo Yang, Hongkai Wen, Sen Wang, Ronald Clark, Andrew Markham, Niki Trigoni

In this paper, we propose a novel 3D-RecGAN approach, which reconstructs the complete 3D structure of a given object from a single arbitrary depth view using generative adversarial networks.

3D Object Reconstruction Object

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

5 code implementations25 Sep 2017 Sen Wang, Ronald Clark, Hongkai Wen, Niki Trigoni

This paper presents a novel end-to-end framework for monocular VO by using deep Recurrent Convolutional Neural Networks (RCNNs).

Monocular Visual Odometry Motion Estimation

CodeSLAM - Learning a Compact, Optimisable Representation for Dense Visual SLAM

3 code implementations3 Apr 2018 Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison

Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency.

CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM

1 code implementation CVPR 2018 Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison

Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency.

Fusion++: Volumetric Object-Level SLAM

no code implementations25 Aug 2018 John McCormac, Ronald Clark, Michael Bloesch, Andrew J. Davison, Stefan Leutenegger

Reconstructed objects are stored in an optimisable 6DoF pose graph which is our only persistent map representation.

Loop Closure Detection Object

Learning to Solve Nonlinear Least Squares for Monocular Stereo

no code implementations ECCV 2018 Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison

In this paper, we propose a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.

InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset

no code implementations3 Sep 2018 Wenbin Li, Sajad Saeedi, John McCormac, Ronald Clark, Dimos Tzoumanikas, Qing Ye, Yuzhong Huang, Rui Tang, Stefan Leutenegger

Datasets have gained an enormous amount of popularity in the computer vision community, from training and evaluation of Deep Learning-based methods to benchmarking Simultaneous Localization and Mapping (SLAM).

Benchmarking Simultaneous Localization and Mapping

LS-Net: Learning to Solve Nonlinear Least Squares for Monocular Stereo

no code implementations ECCV 2018 Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison

In this paper, we propose LS-Net, a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.

WiSE-ALE: Wide Sample Estimator for Approximate Latent Embedding

no code implementations16 Feb 2019 Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts

Variational Auto-encoders (VAEs) have been very successful as methods for forming compressed latent representations of complex, often high-dimensional, data.

X-Section: Cross-Section Prediction for Enhanced RGBD Fusion

no code implementations3 Mar 2019 Andrea Nicastro, Ronald Clark, Stefan Leutenegger

Detailed 3D reconstruction is an important challenge with application to robotics, augmented and virtual reality, which has seen impressive progress throughout the past years.

3D Reconstruction Object

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

1 code implementation NeurIPS 2019 Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni

The framework directly regresses 3D bounding boxes for all instances in a point cloud, while simultaneously predicting a point-level mask for each instance.

Ranked #13 on 3D Instance Segmentation on S3DIS (mPrec metric)

3D Instance Segmentation Clustering +2

Balancing Reconstruction Quality and Regularisation in ELBO for VAEs

no code implementations9 Sep 2019 Shuyu Lin, Stephen Roberts, Niki Trigoni, Ronald Clark

A trade-off exists between reconstruction quality and the prior regularisation in the Evidence Lower Bound (ELBO) loss that Variational Autoencoder (VAE) models use for learning.

DeepFactors: Real-Time Probabilistic Dense Monocular SLAM

1 code implementation14 Jan 2020 Jan Czarnowski, Tristan Laidlow, Ronald Clark, Andrew J. Davison

The ability to estimate rich geometry and camera motion from monocular imagery is fundamental to future interactive robotics and augmented reality applications.

PointLoc: Deep Pose Regressor for LiDAR Point Cloud Localization

2 code implementations5 Mar 2020 Wei Wang, Bing Wang, Peijun Zhao, Changhao Chen, Ronald Clark, Bo Yang, Andrew Markham, Niki Trigoni

In this paper, we present a novel end-to-end learning-based LiDAR relocalization framework, termed PointLoc, which infers 6-DoF poses directly using only a single point cloud as input, without requiring a pre-built map.

Robotics

Scalable Uncertainty for Computer Vision with Functional Variational Inference

no code implementations CVPR 2020 Eduardo D. C. Carvalho, Ronald Clark, Andrea Nicastro, Paul H. J. Kelly

As Deep Learning continues to yield successful applications in Computer Vision, the ability to quantify all forms of uncertainty is a paramount requirement for its safe and reliable deployment in the real-world.

Depth Estimation Gaussian Processes +3

LaDDer: Latent Data Distribution Modelling with a Generative Prior

1 code implementation31 Aug 2020 Shuyu Lin, Ronald Clark

In this paper, we show that the performance of a learnt generative model is closely related to the model's ability to accurately represent the inferred \textbf{latent data distribution}, i. e. its topology and structural properties.

Representation Learning

Orientation Keypoints for 6D Human Pose Estimation

no code implementations10 Sep 2020 Martin Fisch, Ronald Clark

Most realtime human pose estimation approaches are based on detecting joint positions.

Pose Estimation

Unsupervised Path Regression Networks

no code implementations30 Nov 2020 Michal Pándy, Daniel Lenton, Ronald Clark

We demonstrate that challenging shortest path problems can be solved via direct spline regression from a neural network, trained in an unsupervised manner (i. e. without requiring ground truth optimal paths for training).

Motion Planning regression

Ego-Centric Spatial Memory Networks

no code implementations ICLR 2021 Daniel James Lenton, Stephen James, Ronald Clark, Andrew Davison

With our broad demonstrations, we show that ESMN represents a useful and general computation graph for embodied spatial reasoning, and the module forms a bridge between real-time mapping systems and differentiable memory architectures.

Inductive Bias Semantic Segmentation

Ivy: Templated Deep Learning for Inter-Framework Portability

1 code implementation4 Feb 2021 Daniel Lenton, Fabio Pardo, Fabian Falck, Stephen James, Ronald Clark

We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.

End-to-End Egospheric Spatial Memory

2 code implementations15 Feb 2021 Daniel Lenton, Stephen James, Ronald Clark, Andrew J. Davison

Spatial memory, or the ability to remember and recall specific locations and objects, is central to autonomous agents' ability to carry out tasks in real environments.

General Reinforcement Learning Imitation Learning +3

Waypoint Planning Networks

1 code implementation1 May 2021 Alexandru-Iosif Toma, Hussein Ali Jaafar, Hao-Ya Hsueh, Stephen James, Daniel Lenton, Ronald Clark, Sajad Saeedi

We propose waypoint planning networks (WPN), a hybrid algorithm based on LSTMs with a local kernel - a classic algorithm such as A*, and a global kernel using a learned algorithm.

Motion Planning

TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering

no code implementations5 Nov 2021 Martin Piala, Ronald Clark

Volume rendering using neural fields has shown great promise in capturing and synthesizing novel views of 3D scenes.

Neural Rendering

Volumetric Bundle Adjustment for Online Photorealistic Scene Capture

no code implementations CVPR 2022 Ronald Clark

To the best of our knowledge, this is the first method that can achieve online photorealistic scene capture.

Towards the Probabilistic Fusion of Learned Priors into Standard Pipelines for 3D Reconstruction

no code implementations27 Jul 2022 Tristan Laidlow, Jan Czarnowski, Andrea Nicastro, Ronald Clark, Stefan Leutenegger

While systems that pass the output of traditional multi-view stereo approaches to a network for regularisation or refinement currently seem to get the best results, it may be preferable to treat deep neural networks as separate components whose results can be probabilistically fused into geometry-based systems.

3D Reconstruction

Volumetric Cloud Field Reconstruction

no code implementations29 Nov 2023 Jacob Lin, Miguel Farinha, Edward Gryspeerdt, Ronald Clark

Volumetric phenomena, such as clouds and fog, present a significant challenge for 3D reconstruction systems due to their translucent nature and their complex interactions with light.

3D Reconstruction

Instant Uncertainty Calibration of NeRFs Using a Meta-calibrator

no code implementations4 Dec 2023 Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi, Ronald Clark

To address this, we introduce the concept of a meta-calibrator that performs uncertainty calibration for NeRFs with a single forward pass without the need for holding out any images from the target scene.

Image Reconstruction Medical Diagnosis +2

DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications

no code implementations19 Feb 2024 Nillan Nimal, Wenbin Li, Ronald Clark, Sajad Saeedi

These images were processed using a photogrammetry software known as Meshroom to generate a dense surface reconstruction of the scene.

Surface Reconstruction

DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion

1 code implementation25 Mar 2024 Yuanze Lin, Ronald Clark, Philip Torr

We present DreamPolisher, a novel Gaussian Splatting based method with geometric guidance, tailored to learn cross-view consistency and intricate detail from textual descriptions.

3D Generation Text to 3D

Cannot find the paper you are looking for? You can Submit a new open access paper.