Search Results for author: Oscar Mendez

Found 18 papers, 0 papers with code

Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks

no code implementations8 Apr 2024 Maksym Ivashechkin, Oscar Mendez, Richard Bowden

This work addresses the intersection of hands by exploiting an occupancy network that represents the hand's volume as a continuous manifold.

3D Hand Pose Estimation

BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation

no code implementations23 Dec 2023 Tavis Shore, Simon Hadfield, Oscar Mendez

Cross-view image matching for geo-localisation is a challenging problem due to the significant visual difference between aerial and ground-level viewpoints.

Navigate

Improving 3D Pose Estimation for Sign Language

no code implementations18 Aug 2023 Maksym Ivashechkin, Oscar Mendez, Richard Bowden

Given a 2D detection of keypoints in the image, we lift the skeleton to 3D using neural networks to predict both the joint rotations and bone lengths.

3D Pose Estimation valid

Learning Adaptive Neighborhoods for Graph Neural Networks

no code implementations ICCV 2023 Avishkar Saha, Oscar Mendez, Chris Russell, Richard Bowden

Our module can be readily integrated into existing pipelines involving graph convolution operations, replacing the predetermined or existing adjacency matrix with one that is learned, and optimized, as part of the general objective.

Node Classification Point Cloud Classification +1

AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

no code implementations26 Jun 2022 Nimet Kaygusuz, Oscar Mendez, Richard Bowden

To address this limitation, in this work, we propose AFT-VO, a novel transformer-based sensor fusion architecture to estimate VO from multiple sensors.

Motion Estimation Sensor Fusion +1

Generalizing to New Tasks via One-Shot Compositional Subgoals

no code implementations16 May 2022 Xihan Bian, Oscar Mendez, Simon Hadfield

In addition to improving learning efficiency for standard long-term tasks, this approach also makes it possible to perform one-shot generalization to previously unseen tasks, given only a single reference trajectory for the task in a different environment.

Imitation Learning

"The Pedestrian next to the Lamppost" Adaptive Object Graphs for Better Instantaneous Mapping

no code implementations CVPR 2022 Avishkar Saha, Oscar Mendez, Chris Russell, Richard Bowden

Estimating a semantically segmented bird's-eye-view (BEV) map from a single image has become a popular technique for autonomous control and navigation.

MDN-VO: Estimating Visual Odometry with Confidence

no code implementations23 Dec 2021 Nimet Kaygusuz, Oscar Mendez, Richard Bowden

Visual Odometry (VO) is used in many applications including robotics and autonomous systems.

Visual Odometry

Multi-Camera Sensor Fusion for Visual Odometry using Deep Uncertainty Estimation

no code implementations23 Dec 2021 Nimet Kaygusuz, Oscar Mendez, Richard Bowden

To address this issue, we propose a deep sensor fusion framework which estimates vehicle motion using both pose and uncertainty estimations from multiple on-board cameras.

Autonomous Driving Sensor Fusion +1

Markov Localisation using Heatmap Regression and Deep Convolutional Odometry

no code implementations1 Jun 2021 Oscar Mendez, Simon Hadfield, Richard Bowden

Recent advances in deep learning hardware allow large likelihood volumes to be stored directly on the GPU, along with the hardware necessary to efficiently perform GPU-bound 3D convolutions and this obviates many of the disadvantages of grid based methods.

regression

There and Back Again: Self-supervised Multispectral Correspondence Estimation

no code implementations19 Mar 2021 Celyn Walters, Oscar Mendez, Mark Johnson, Richard Bowden

In this work, we aim to address the dense correspondence estimation problem in a way that generalizes to more than one spectrum.

Autonomous Vehicles

Localisation via Deep Imagination: learn the features not the map

no code implementations19 Nov 2018 Jaime Spencer, Oscar Mendez, Richard Bowden, Simon Hadfield

In order to build the embedded map, we train a deep Siamese Fully Convolutional U-Net to perform dense feature extraction.

Taking the Scenic Route to 3D: Optimising Reconstruction From Moving Cameras

no code implementations ICCV 2017 Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden

This approach is ill-suited for reconstruction applications, where learning about the environment is more valuable than speed of traversal.

SeDAR - Semantic Detection and Ranging: Humans can localise without LiDAR, can robots?

no code implementations5 Sep 2017 Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden

Similarly, we do not extrude the 2D geometry present in the floorplan into 3D and try to align it to the real-world.

Cannot find the paper you are looking for? You can Submit a new open access paper.