Search Results for author: Ronald Clark

Found 33 papers, 13 papers with code

VINet: Visual-Inertial Odometry as a Sequence-to-Sequence Learning Problem

no code implementations • 29 Jan 2017 • Ronald Clark, Sen Wang, Hongkai Wen, Andrew Markham, Niki Trigoni

In this paper we present an on-manifold sequence-to-sequence learning approach to motion estimation using visual and inertial sensors.

Motion Estimation

Paper
Add Code

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

no code implementations • CVPR 2017 • Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images.

Autonomous Driving Indoor Localization

Paper
Add Code

3D Object Reconstruction from a Single Depth View with Adversarial Learning

2 code implementations • 26 Aug 2017 • Bo Yang, Hongkai Wen, Sen Wang, Ronald Clark, Andrew Markham, Niki Trigoni

In this paper, we propose a novel 3D-RecGAN approach, which reconstructs the complete 3D structure of a given object from a single arbitrary depth view using generative adversarial networks.

3D Object Reconstruction Object

129

Paper
Code

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

5 code implementations • 25 Sep 2017 • Sen Wang, Ronald Clark, Hongkai Wen, Niki Trigoni

This paper presents a novel end-to-end framework for monocular VO by using deep Recurrent Convolutional Neural Networks (RCNNs).

Monocular Visual Odometry Motion Estimation

318

Paper
Code

CodeSLAM - Learning a Compact, Optimisable Representation for Dense Visual SLAM

3 code implementations • 3 Apr 2018 • Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison

Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency.

189

Paper
Code

CodeSLAM â Learning a Compact, Optimisable Representation for Dense Visual SLAM

1 code implementation • CVPR 2018 • Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison

Paper
Code

Fusion++: Volumetric Object-Level SLAM

no code implementations • 25 Aug 2018 • John McCormac, Ronald Clark, Michael Bloesch, Andrew J. Davison, Stefan Leutenegger

Reconstructed objects are stored in an optimisable 6DoF pose graph which is our only persistent map representation.

Loop Closure Detection Object

Paper
Add Code

Learning to Solve Nonlinear Least Squares for Monocular Stereo

no code implementations • ECCV 2018 • Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison

In this paper, we propose a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.

Paper
Add Code

InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset

no code implementations • 3 Sep 2018 • Wenbin Li, Sajad Saeedi, John McCormac, Ronald Clark, Dimos Tzoumanikas, Qing Ye, Yuzhong Huang, Rui Tang, Stefan Leutenegger

Datasets have gained an enormous amount of popularity in the computer vision community, from training and evaluation of Deep Learning-based methods to benchmarking Simultaneous Localization and Mapping (SLAM).

Benchmarking Simultaneous Localization and Mapping

Paper
Add Code

LS-Net: Learning to Solve Nonlinear Least Squares for Monocular Stereo

no code implementations • ECCV 2018 • Ronald Clark, Michael Bloesch, Jan Czarnowski, Stefan Leutenegger, Andrew J. Davison

In this paper, we propose LS-Net, a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities.

Paper
Add Code

WiSE-ALE: Wide Sample Estimator for Approximate Latent Embedding

no code implementations • 16 Feb 2019 • Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts

Variational Auto-encoders (VAEs) have been very successful as methods for forming compressed latent representations of complex, often high-dimensional, data.

Paper
Add Code

X-Section: Cross-Section Prediction for Enhanced RGBD Fusion

no code implementations • 3 Mar 2019 • Andrea Nicastro, Ronald Clark, Stefan Leutenegger

Detailed 3D reconstruction is an important challenge with application to robotics, augmented and virtual reality, which has seen impressive progress throughout the past years.

3D Reconstruction Object

Paper
Add Code

WiSE-ALE: Wide Sample Estimator for Aggregate Latent Embedding

no code implementations • ICLR Workshop DeepGenStruct 2019 • Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts

In this paper, we present a new generative model for learning latent embeddings.

Paper
Add Code

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

1 code implementation • NeurIPS 2019 • Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni

The framework directly regresses 3D bounding boxes for all instances in a point cloud, while simultaneously predicting a point-level mask for each instance.

Ranked #13 on 3D Instance Segmentation on S3DIS (mPrec metric)

3D Instance Segmentation Clustering +2

386

Paper
Code

Balancing Reconstruction Quality and Regularisation in ELBO for VAEs

no code implementations • 9 Sep 2019 • Shuyu Lin, Stephen Roberts, Niki Trigoni, Ronald Clark

A trade-off exists between reconstruction quality and the prior regularisation in the Evidence Lower Bound (ELBO) loss that Variational Autoencoder (VAE) models use for learning.

Paper
Add Code

DeepFactors: Real-Time Probabilistic Dense Monocular SLAM

1 code implementation • 14 Jan 2020 • Jan Czarnowski, Tristan Laidlow, Ronald Clark, Andrew J. Davison

The ability to estimate rich geometry and camera motion from monocular imagery is fundamental to future interactive robotics and augmented reality applications.

617

Paper
Code

PointLoc: Deep Pose Regressor for LiDAR Point Cloud Localization

2 code implementations • 5 Mar 2020 • Wei Wang, Bing Wang, Peijun Zhao, Changhao Chen, Ronald Clark, Bo Yang, Andrew Markham, Niki Trigoni

In this paper, we present a novel end-to-end learning-based LiDAR relocalization framework, termed PointLoc, which infers 6-DoF poses directly using only a single point cloud as input, without requiring a pre-built map.

Robotics

Paper
Code

Scalable Uncertainty for Computer Vision with Functional Variational Inference

no code implementations • CVPR 2020 • Eduardo D. C. Carvalho, Ronald Clark, Andrea Nicastro, Paul H. J. Kelly

As Deep Learning continues to yield successful applications in Computer Vision, the ability to quantify all forms of uncertainty is a paramount requirement for its safe and reliable deployment in the real-world.

Depth Estimation Gaussian Processes +3

Paper
Add Code

Anomaly Detection for Time Series Using VAE-LSTM Hybrid Model

1 code implementation • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020 • Shuyu Lin, Ronald Clark, Robert Birke, Sandro Schönborn, Niki Trigoni, Stephen Roberts

In this work, we propose a VAE-LSTM hybrid model as an unsupervised approach for anomaly detection in time series.

Anomaly Detection Time Series +1

391

Paper
Code

LaDDer: Latent Data Distribution Modelling with a Generative Prior

1 code implementation • 31 Aug 2020 • Shuyu Lin, Ronald Clark

In this paper, we show that the performance of a learnt generative model is closely related to the model's ability to accurately represent the inferred \textbf{latent data distribution}, i. e. its topology and structural properties.

Representation Learning

Paper
Code

Orientation Keypoints for 6D Human Pose Estimation

no code implementations • 10 Sep 2020 • Martin Fisch, Ronald Clark

Most realtime human pose estimation approaches are based on detecting joint positions.

Pose Estimation

Paper
Add Code

Unsupervised Path Regression Networks

no code implementations • 30 Nov 2020 • Michal Pándy, Daniel Lenton, Ronald Clark

We demonstrate that challenging shortest path problems can be solved via direct spline regression from a neural network, trained in an unsupervised manner (i. e. without requiring ground truth optimal paths for training).

Motion Planning regression

Paper
Add Code

Ego-Centric Spatial Memory Networks

no code implementations • ICLR 2021 • Daniel James Lenton, Stephen James, Ronald Clark, Andrew Davison

With our broad demonstrations, we show that ESMN represents a useful and general computation graph for embodied spatial reasoning, and the module forms a bridge between real-time mapping systems and differentiable memory architectures.

Inductive Bias Semantic Segmentation

Paper
Add Code

Ivy: Templated Deep Learning for Inter-Framework Portability

1 code implementation • 4 Feb 2021 • Daniel Lenton, Fabio Pardo, Fabian Falck, Stephen James, Ronald Clark

We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.

14,014

Paper
Code

End-to-End Egospheric Spatial Memory

2 code implementations • 15 Feb 2021 • Daniel Lenton, Stephen James, Ronald Clark, Andrew J. Davison

Spatial memory, or the ability to remember and recall specific locations and objects, is central to autonomous agents' ability to carry out tasks in real environments.

General Reinforcement Learning Imitation Learning +3

Paper
Code

Waypoint Planning Networks

1 code implementation • 1 May 2021 • Alexandru-Iosif Toma, Hussein Ali Jaafar, Hao-Ya Hsueh, Stephen James, Daniel Lenton, Ronald Clark, Sajad Saeedi

We propose waypoint planning networks (WPN), a hybrid algorithm based on LSTMs with a local kernel - a classic algorithm such as A*, and a global kernel using a learned algorithm.

Motion Planning

Paper
Code

TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering

no code implementations • 5 Nov 2021 • Martin Piala, Ronald Clark

Volume rendering using neural fields has shown great promise in capturing and synthesizing novel views of 3D scenes.

Neural Rendering

Paper
Add Code

Volumetric Bundle Adjustment for Online Photorealistic Scene Capture

no code implementations • CVPR 2022 • Ronald Clark

To the best of our knowledge, this is the first method that can achieve online photorealistic scene capture.

Paper
Add Code

Towards the Probabilistic Fusion of Learned Priors into Standard Pipelines for 3D Reconstruction

no code implementations • 27 Jul 2022 • Tristan Laidlow, Jan Czarnowski, Andrea Nicastro, Ronald Clark, Stefan Leutenegger

While systems that pass the output of traditional multi-view stereo approaches to a network for regularisation or refinement currently seem to get the best results, it may be preferable to treat deep neural networks as separate components whose results can be probabilistically fused into geometry-based systems.

3D Reconstruction

Paper
Add Code

Volumetric Cloud Field Reconstruction

no code implementations • 29 Nov 2023 • Jacob Lin, Miguel Farinha, Edward Gryspeerdt, Ronald Clark

Volumetric phenomena, such as clouds and fog, present a significant challenge for 3D reconstruction systems due to their translucent nature and their complex interactions with light.

3D Reconstruction

Paper
Add Code

Instant Uncertainty Calibration of NeRFs Using a Meta-calibrator

no code implementations • 4 Dec 2023 • Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi, Ronald Clark

To address this, we introduce the concept of a meta-calibrator that performs uncertainty calibration for NeRFs with a single forward pass without the need for holding out any images from the target scene.

Image Reconstruction Medical Diagnosis +2

Paper
Add Code

DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications

no code implementations • 19 Feb 2024 • Nillan Nimal, Wenbin Li, Ronald Clark, Sajad Saeedi

These images were processed using a photogrammetry software known as Meshroom to generate a dense surface reconstruction of the scene.

Surface Reconstruction

Paper
Add Code

DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion

1 code implementation • 25 Mar 2024 • Yuanze Lin, Ronald Clark, Philip Torr

We present DreamPolisher, a novel Gaussian Splatting based method with geometric guidance, tailored to learn cross-view consistency and intricate detail from textual descriptions.

3D Generation Text to 3D

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.