Search Results for author: Andrew Markham

Found 71 papers, 34 papers with code

VINet: Visual-Inertial Odometry as a Sequence-to-Sequence Learning Problem

no code implementations • 29 Jan 2017 • Ronald Clark, Sen Wang, Hongkai Wen, Andrew Markham, Niki Trigoni

In this paper we present an on-manifold sequence-to-sequence learning approach to motion estimation using visual and inertial sensors.

Motion Estimation

Paper
Add Code

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

no code implementations • CVPR 2017 • Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images.

Autonomous Driving Indoor Localization

Paper
Add Code

3D Object Reconstruction from a Single Depth View with Adversarial Learning

2 code implementations • 26 Aug 2017 • Bo Yang, Hongkai Wen, Sen Wang, Ronald Clark, Andrew Markham, Niki Trigoni

In this paper, we propose a novel 3D-RecGAN approach, which reconstructs the complete 3D structure of a given object from a single arbitrary depth view using generative adversarial networks.

3D Object Reconstruction Object

129

Paper
Code

Learning from lions: inferring the utility of agents from their trajectories

no code implementations • 7 Sep 2017 • Adam D. Cobb, Andrew Markham, Stephen J. Roberts

We build a model using Gaussian processes to infer a spatio-temporal vector field from observed agent trajectories.

Decision Making Gaussian Processes

Paper
Add Code

IONet: Learning to Cure the Curse of Drift in Inertial Odometry

no code implementations • 30 Jan 2018 • Changhao Chen, Xiaoxuan Lu, Andrew Markham, Niki Trigoni

Inertial sensors play a pivotal role in indoor localization, which in turn lays the foundation for pervasive personal applications.

Indoor Localization

Paper
Add Code

Dense 3D Object Reconstruction from a Single Depth View

2 code implementations • 1 Feb 2018 • Bo Yang, Stefano Rosa, Andrew Markham, Niki Trigoni, Hongkai Wen

Unlike existing work which typically requires multiple views of the same object or class labels to recover the full 3D geometry, the proposed 3D-RecGAN++ only takes the voxel grid representation of a depth view of the object as input, and is able to generate the complete 3D occupancy grid with a high resolution of 256^3 by recovering the occluded/missing regions.

3D Object Reconstruction Object

133

Paper
Code

Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

1 code implementation • 22 Feb 2018 • Adam D. Cobb, Richard Everett, Andrew Markham, Stephen J. Roberts

In systems of multiple agents, identifying the cause of observed agent dynamics is challenging.

Gaussian Processes

Paper
Code

Defo-Net: Learning Body Deformation using Generative Adversarial Networks

1 code implementation • 16 Apr 2018 • Zhihua Wang, Stefano Rosa, Linhai Xie, Bo Yang, Sen Wang, Niki Trigoni, Andrew Markham

Modelling the physical properties of everyday objects is a fundamental prerequisite for autonomous robots.

Robotics

Paper
Code

3D-PhysNet: Learning the Intuitive Physics of Non-Rigid Object Deformations

1 code implementation • 25 Apr 2018 • Zhihua Wang, Stefano Rosa, Bo Yang, Sen Wang, Niki Trigoni, Andrew Markham

This is further confounded by the fact that shape information about encountered objects in the real world is often impaired by occlusions, noise and missing regions e. g. a robot manipulating an object will only be able to observe a partial view of the entire solid.

Paper
Code

Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction

1 code implementation • 2 Aug 2018 • Bo Yang, Sen Wang, Andrew Markham, Niki Trigoni

However, GRU based approaches are unable to consistently estimate 3D shapes given different permutations of the same set of input images as the recurrent unit is permutation variant.

Ranked #1 on 3D Reconstruction on Data3D−R2N2

3D Object Reconstruction 3D Reconstruction +1

Paper
Code

Neural Allocentric Intuitive Physics Prediction from Real Videos

no code implementations • 7 Sep 2018 • Zhihua Wang, Stefano Rosa, Yishu Miao, Zihang Lai, Linhai Xie, Andrew Markham, Niki Trigoni

In this framework, real images are first converted to a synthetic domain representation that reduces complexity arising from lighting and texture.

Paper
Add Code

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

no code implementations • 16 Sep 2018 • Yasin Almalioglu, Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Andrew Markham, Niki Trigoni

In the last decade, supervised deep learning approaches have been extensively employed in visual odometry (VO) applications, which is not feasible in environments where labelled data is not abundant.

Depth Estimation Monocular Visual Odometry +1

Paper
Add Code

OxIOD: The Dataset for Deep Inertial Odometry

no code implementations • 20 Sep 2018 • Changhao Chen, Peijun Zhao, Chris Xiaoxuan Lu, Wei Wang, Andrew Markham, Niki Trigoni

Advances in micro-electro-mechanical (MEMS) techniques enable inertial measurements units (IMUs) to be small, cheap, energy efficient, and widely used in smartphones, robots, and drones.

Paper
Add Code

Transferring Physical Motion Between Domains for Neural Inertial Tracking

no code implementations • 4 Oct 2018 • Changhao Chen, Yishu Miao, Chris Xiaoxuan Lu, Phil Blunsom, Andrew Markham, Niki Trigoni

Inertial information processing plays a pivotal role in ego-motion awareness for mobile agents, as inertial measurements are entirely egocentric and not environment dependent.

Domain Adaptation

Paper
Add Code

Learning with Stochastic Guidance for Navigation

1 code implementation • 27 Nov 2018 • Linhai Xie, Yishu Miao, Sen Wang, Phil Blunsom, Zhihua Wang, Changhao Chen, Andrew Markham, Niki Trigoni

Due to the sparse rewards and high degree of environment variation, reinforcement learning approaches such as Deep Deterministic Policy Gradient (DDPG) are plagued by issues of high variance when applied in complex real world environments.

Robotics

Paper
Code

Selective Sensor Fusion for Neural Visual-Inertial Odometry

no code implementations • CVPR 2019 • Changhao Chen, Stefano Rosa, Yishu Miao, Chris Xiaoxuan Lu, Wei Wu, Andrew Markham, Niki Trigoni

Deep learning approaches for Visual-Inertial Odometry (VIO) have proven successful, but they rarely focus on incorporating robust fusion strategies for dealing with imperfect input sensory data.

Autonomous Driving Sensor Fusion

Paper
Add Code

Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning

no code implementations • 25 Mar 2019 • Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Sen Wang, Andrew Markham, Niki Trigoni

Inspired by the cognitive process of humans and animals, Curriculum Learning (CL) trains a model by gradually increasing the difficulty of the training data.

Monocular Visual Odometry Optical Flow Estimation

Paper
Add Code

Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds

1 code implementation • NeurIPS 2019 • Bo Yang, Jianan Wang, Ronald Clark, Qingyong Hu, Sen Wang, Andrew Markham, Niki Trigoni

The framework directly regresses 3D bounding boxes for all instances in a point cloud, while simultaneously predicting a point-level mask for each instance.

Ranked #13 on 3D Instance Segmentation on S3DIS (mPrec metric)

3D Instance Segmentation Clustering +2

386

Paper
Code

Distilling Knowledge From a Deep Pose Regressor Network

no code implementations • ICCV 2019 • Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu, Andrew Markham, Niki Trigoni

To the best of our knowledge, this is the first work which successfully distill the knowledge from a deep pose regression network.

regression Transfer Learning +1

Paper
Add Code

DynaNet: Neural Kalman Dynamical Model for Motion Estimation and Prediction

no code implementations • 11 Aug 2019 • Changhao Chen, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

In addition we show how DynaNet can indicate failures through investigation of properties such as the rate of innovation (Kalman Gain).

Motion Estimation Sensor Fusion +1

Paper
Add Code

Autonomous Learning for Face Recognition in the Wild via Ambient Wireless Cues

1 code implementation • 14 Aug 2019 • Chris Xiaoxuan Lu, Xuan Kan, Bowen Du, Changhao Chen, Hongkai Wen, Andrew Markham, Niki Trigoni, John Stankovic

Inspired by the fact that most people carry smart wireless devices with them, e. g. smartphones, we propose to use this wireless identifier as a supervisory label.

Face Recognition

Paper
Code

AtLoc: Attention Guided Camera Localization

1 code implementation • 8 Sep 2019 • Bing Wang, Changhao Chen, Chris Xiaoxuan Lu, Peijun Zhao, Niki Trigoni, Andrew Markham

Deep learning has achieved impressive results in camera localization, but current single-image techniques typically suffer from a lack of robustness, leading to large outliers.

Ranked #2 on Visual Localization on Oxford RobotCar Full

Camera Localization Visual Localization

Paper
Code

Milli-RIO: Ego-Motion Estimation with Millimetre-Wave Radar and Inertial Measurement Unit Sensor

no code implementations • 12 Sep 2019 • Yasin Almalioglu, Mehmet Turan, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

With the fast-growing demand of location-based services in various indoor environments, robust indoor ego-motion estimation has attracted significant interest in the last decades.

Indoor Localization Motion Estimation +1

Paper
Add Code

DeepTIO: A Deep Thermal-Inertial Odometry with Visual Hallucination

no code implementations • 16 Sep 2019 • Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Chris Xiaoxuan Lu, Yasin Almalioglu, Stefano Rosa, Changhao Chen, Johan Wahlström, Wei Wang, Andrew Markham, Niki Trigoni

The hallucination network is taught to predict fake visual features from thermal images by using Huber loss.

Hallucination Visual Odometry

Paper
Add Code

DeepPCO: End-to-End Point Cloud Odometry through Deep Parallel Neural Network

no code implementations • 13 Oct 2019 • Wei Wang, Muhamad Risqi U. Saputra, Peijun Zhao, Pedro Gusmao, Bo Yang, Changhao Chen, Andrew Markham, Niki Trigoni

There is considerable work in the area of visual odometry (VO), and recent advances in deep learning have brought novel approaches to VO, which directly learn salient features from raw images.

Translation Visual Odometry

Paper
Add Code

Introducing an Explicit Symplectic Integration Scheme for Riemannian Manifold Hamiltonian Monte Carlo

1 code implementation • 14 Oct 2019 • Adam D. Cobb, Atılım Güneş Baydin, Andrew Markham, Stephen J. Roberts

We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians.

Bayesian Inference

388

Paper
Code

See Through Smoke: Robust Indoor Mapping with Low-cost mmWave Radar

1 code implementation • 1 Nov 2019 • Chris Xiaoxuan Lu, Stefano Rosa, Peijun Zhao, Bing Wang, Changhao Chen, John A. Stankovic, Niki Trigoni, Andrew Markham

This paper presents the design, implementation and evaluation of milliMap, a single-chip millimetre wave (mmWave) radar based indoor mapping system targetted towards low-visibility environments to assist in emergency response.

Paper
Code

SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation

no code implementations • 22 Nov 2019 • Yasin Almalioglu, Mehmet Turan, Alp Eren Sari, Muhamad Risqi U. Saputra, Pedro P. B. de Gusmão, Andrew Markham, Niki Trigoni

In the last decade, numerous supervised deep learning approaches requiring large amounts of labeled data have been proposed for visual-inertial odometry (VIO) and depth map estimation.

Depth Estimation Pose Estimation +3

Paper
Add Code

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

6 code implementations • CVPR 2020 • Qingyong Hu, Bo Yang, Linhai Xie, Stefano Rosa, Yulan Guo, Zhihua Wang, Niki Trigoni, Andrew Markham

We study the problem of efficient semantic segmentation for large-scale 3D point clouds.

Ranked #3 on Semantic Segmentation on Toronto-3D L002

3D Semantic Segmentation LIDAR Semantic Segmentation +1

1,668

Paper
Code

Snoopy: Sniffing Your Smartwatch Passwords via Deep Sequence Learning

1 code implementation • 10 Dec 2019 • Chris Xiaoxuan Lu, Bowen Du, Hongkai Wen, Sen Wang, Andrew Markham, Ivan Martinovic, Yiran Shen, Niki Trigoni

Demand for smartwatches has taken off in recent years with new models which can run independently from smartphones and provide more useful features, becoming first-class mobile platforms.

Paper
Code

Learning Selective Sensor Fusion for States Estimation

no code implementations • 30 Dec 2019 • Changhao Chen, Stefano Rosa, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

By integrating the observations from different sensors, these mobile agents are able to perceive the environment and estimate system states, e. g. locations and orientations.

Autonomous Vehicles Sensor Fusion

Paper
Add Code

Deep Learning based Pedestrian Inertial Navigation: Methods, Dataset and On-Device Inference

no code implementations • 13 Jan 2020 • Changhao Chen, Peijun Zhao, Chris Xiaoxuan Lu, Wei Wang, Andrew Markham, Niki Trigoni

Modern inertial measurements units (IMUs) are small, cheap, energy efficient, and widely employed in smart devices and mobile robots.

Paper
Add Code

PointLoc: Deep Pose Regressor for LiDAR Point Cloud Localization

2 code implementations • 5 Mar 2020 • Wei Wang, Bing Wang, Peijun Zhao, Changhao Chen, Ronald Clark, Bo Yang, Andrew Markham, Niki Trigoni

In this paper, we present a novel end-to-end learning-based LiDAR relocalization framework, termed PointLoc, which infers 6-DoF poses directly using only a single point cloud as input, without requiring a pre-built map.

Robotics

Paper
Code

VMLoc: Variational Fusion For Learning-Based Multimodal Camera Localization

1 code implementation • 12 Mar 2020 • Kaichen Zhou, Changhao Chen, Bing Wang, Muhamad Risqi U. Saputra, Niki Trigoni, Andrew Markham

We conjecture that this is because of the naive approaches to feature space fusion through summation or concatenation which do not take into account the different strengths of each modality.

Camera Relocalization Visual Localization

Paper
Code

A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence

1 code implementation • 22 Jun 2020 • Changhao Chen, Bing Wang, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

Deep learning based localization and mapping has recently attracted significant attention.

Scene Understanding Simultaneous Localization and Mapping

610

Paper
Code

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

2 code implementations • CVPR 2021 • Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham

An essential prerequisite for unleashing the potential of supervised deep learning algorithms in the area of 3D scene understanding is the availability of large-scale and richly annotated datasets.

Scene Understanding Semantic Segmentation

469

Paper
Code

Demo Abstract: Indoor Positioning System in Visually-Degraded Environments with Millimetre-Wave Radar and Inertial Sensors

no code implementations • 26 Oct 2020 • Zhuangzhuang Dai, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

In this demonstration, we present a real-time indoor positioning system which fuses millimetre-wave (mmWave) radar and IMU data via deep sensor fusion.

Motion Estimation Sensor Fusion

Paper
Add Code

SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration

1 code implementation • CVPR 2021 • Sheng Ao, Qingyong Hu, Bo Yang, Andrew Markham, Yulan Guo

Extracting robust and general 3D local features is key to downstream tasks such as point cloud registration and reconstruction.

Ranked #2 on Point Cloud Registration on ETH (trained on 3DMatch)

Point Cloud Registration

246

Paper
Code

P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

1 code implementation • ICCV 2021 • Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham

Accurately describing and detecting 2D and 3D keypoints is crucial to establishing correspondences across images and point clouds.

Visual Localization

Paper
Code

RadarLoc: Learning to Relocalize in FMCW Radar

no code implementations • 22 Mar 2021 • Wei Wang, Pedro P. B. de Gusmo, Bo Yang, Andrew Markham, Niki Trigoni

There is considerable work in the field of deep camera relocalization, which directly estimates poses from raw images.

Camera Relocalization

Paper
Add Code

SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds

2 code implementations • 11 Apr 2021 • Qingyong Hu, Bo Yang, Guangchi Fang, Yulan Guo, Ales Leonardis, Niki Trigoni, Andrew Markham

Labelling point clouds fully is highly time-consuming and costly.

Paper
Code

Graph-based Thermal-Inertial SLAM with Probabilistic Neural Networks

1 code implementation • 15 Apr 2021 • Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Pedro P. B. de Gusmao, Bing Wang, Andrew Markham, Niki Trigoni

Simultaneous Localization and Mapping (SLAM) system typically employ vision-based sensors to observe the surrounding environment.

feature selection Probabilistic Deep Learning +1

Paper
Code

SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform

no code implementations • 13 Jun 2021 • Yuhang He, Niki Trigoni, Andrew Markham

Specifically, SoundDet consists of a backbone neural network and two parallel heads for temporal detection and spatial localization, respectively.

Event Detection Sound Event Detection

Paper
Add Code

Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling

1 code implementation • 6 Jul 2021 • Qingyong Hu, Bo Yang, Linhai Xie, Stefano Rosa, Yulan Guo, Zhihua Wang, Niki Trigoni, Andrew Markham

We study the problem of efficient semantic segmentation of large-scale 3D point clouds.

Segmentation Semantic Segmentation

1,241

Paper
Code

Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations

1 code implementation • 16 Jul 2021 • Ben Moseley, Andrew Markham, Tarje Nissen-Meyer

FBINNs are designed to address the spectral bias of neural networks by using separate input normalisation over each subdomain, and reduce the complexity of the underlying optimisation problem by using many smaller neural networks in a parallel divide-and-conquer approach.

218

Paper
Code

Scaling physics-informed neural networks to large domains by using domain decomposition

no code implementations • NeurIPS Workshop DLDE 2021 • Ben Moseley, Andrew Markham, Tarje Nissen-Meyer

Recently, physics-informed neural networks (PINNs) have offered a powerful new paradigm for solving forward and inverse problems relating to differential equations.

Paper
Add Code

CubeLearn: End-to-end Learning for Human Motion Recognition from Raw mmWave Radar Signals

1 code implementation • 7 Nov 2021 • Peijun Zhao, Chris Xiaoxuan Lu, Bing Wang, Niki Trigoni, Andrew Markham

To avoid the drawbacks of conventional DFT pre-processing, we propose a learnable pre-processing module, named CubeLearn, to directly extract features from raw radar signal and build an end-to-end deep neural network for mmWave FMCW radar motion recognition applications.

Activity Recognition

Paper
Code

DeepAoANet: Learning Angle of Arrival from Software Defined Radios with Deep Neural Networks

1 code implementation • 1 Dec 2021 • Zhuangzhuang Dai, Yuhang He, Tran Vu, Niki Trigoni, Andrew Markham

To demonstrate the utility of our approach we have collected IQ (In-phase and Quadrature components) samples from a four-element Universal Linear Array (ULA) in various Light-of-Sight (LOS) and Non-Line-of-Sight (NLOS) environments, and published the dataset.

Paper
Code

RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather

no code implementations • 5 Dec 2021 • Jialu Wang, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Niki Trigon, Andrew Markham

As a result, it learns to generate minimal image perturbations that are still capable of perplexing the network.

Camera Localization Data Augmentation

Paper
Add Code

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

no code implementations • 12 Jan 2022 • Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham

Each point in the dataset has been labelled with fine-grained semantic annotations, resulting in a dataset that is three times the size of the previous existing largest photogrammetric point cloud dataset.

Paper
Add Code

Real-Time Hybrid Mapping of Populated Indoor Scenes using a Low-Cost Monocular UAV

no code implementations • 4 Mar 2022 • Stuart Golodetz, Madhu Vankadari, Aluna Everitt, Sangyun Shin, Andrew Markham, Niki Trigoni

Monocular approaches to such tasks exist, and dense monocular mapping approaches have been successfully deployed for UAV applications.

Monocular 3D Human Pose Estimation Monocular Depth Estimation

Paper
Add Code

No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces

1 code implementation • CVPR 2022 • Jia-Xing Zhong, Kaichen Zhou, Qingyong Hu, Bing Wang, Niki Trigoni, Andrew Markham

Scene flow is a powerful tool for capturing the motion field of 3D point clouds.

Ranked #1 on 3D Action Recognition on NTU RGB+D

3D Action Recognition Point Cloud Classification +1

Paper
Code

Meta-Sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds

1 code implementation • 30 Mar 2022 • Ta-Ying Cheng, Qingyong Hu, Qian Xie, Niki Trigoni, Andrew Markham

In this work, we propose an almost-universal sampler, in our quest for a sampler that can learn to preserve the most useful points for a particular task, yet be inexpensive to adapt to different tasks, models, or datasets.

Computational Efficiency

Paper
Code

RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds

2 code implementations • 19 Apr 2022 • Bing Wang, Zhengdi Yu, Bo Yang, Jie Qin, Toby Breckon, Ling Shao, Niki Trigoni, Andrew Markham

We present RangeUDF, a new implicit representation based framework to recover the geometry and semantics of continuous 3D scene surfaces from point clouds.

Semantic Segmentation Surface Reconstruction

243

Paper
Code

When the Sun Goes Down: Repairing Photometric Losses for All-Day Depth Estimation

no code implementations • 28 Jun 2022 • Madhu Vankadari, Stuart Golodetz, Sourav Garg, Sangyun Shin, Andrew Markham, Niki Trigoni

In this paper, we show how to use a combination of three techniques to allow the existing photometric losses to work for both day and nighttime images.

Depth Estimation Motion Estimation

Paper
Add Code

Sample, Crop, Track: Self-Supervised Mobile 3D Object Detection for Urban Driving LiDAR

no code implementations • 21 Sep 2022 • Sangyun Shin, Stuart Golodetz, Madhu Vankadari, Kaichen Zhou, Andrew Markham, Niki Trigoni

Supervised approaches typically require the annotation of large training sets; there has thus been great interest in leveraging weakly, semi- or self-supervised methods to avoid this, with much success.

3D Object Detection Object +2

Paper
Add Code

Tracking People in Highly Dynamic Industrial Environments

no code implementations • 1 Feb 2023 • Savvas Papaioannou, Andrew Markham, Niki Trigoni

We have conducted extensive real-world experiments in a construction site showing significant accuracy improvement via cross-modality training and the use of social forces.

valid

Paper
Add Code

Fusion of Radio and Camera Sensor Data for Accurate Indoor Positioning

no code implementations • 1 Feb 2023 • Savvas Papaioannou, Hongkai Wen, Andrew Markham, Niki Trigoni

In this paper, we propose a novel positioning system, RAVEL (Radio And Vision Enhanced Localization), which fuses anonymous visual detections captured by widely available camera infrastructure, with radio readings (e. g. WiFi radio data).

Paper
Add Code

Decoupling Skill Learning from Robotic Control for Generalizable Object Manipulation

no code implementations • 7 Mar 2023 • Kai Lu, Bo Yang, Bing Wang, Andrew Markham

Our experiments on manipulating complex articulated objects show that the proposed approach is more generalizable to unseen objects with large intra-class variations, outperforming previous approaches.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

1 code implementation • NeurIPS 2023 • Jia-Xing Zhong, Ta-Ying Cheng, Yuhang He, Kai Lu, Kaichen Zhou, Andrew Markham, Niki Trigoni

A truly generalizable approach to rigid segmentation and motion estimation is fundamental to 3D understanding of articulated objects and moving scenes.

Computational Efficiency Motion Estimation +1

Paper
Code

Fast model inference and training on-board of Satellites

2 code implementations • 17 Jul 2023 • Vít Růžička, Gonzalo Mateo-García, Chris Bridges, Chris Brunskill, Cormac Purcell, Nicolas Longépé, Andrew Markham

In this work we demonstrate the reliable use of RaVAEn onboard a satellite, achieving an encoding time of 0. 110s for tiles of a 4. 8x4. 8 km$^2$ area.

Decision Making

Paper
Code

Deep Learning for Visual Localization and Mapping: A Survey

no code implementations • 27 Aug 2023 • Changhao Chen, Bing Wang, Chris Xiaoxuan Lu, Niki Trigoni, Andrew Markham

Deep learning based localization and mapping approaches have recently emerged as a new research direction and receive significant attentions from both industry and academia.

Simultaneous Localization and Mapping Visual Localization +1

Paper
Add Code

DynPoint: Dynamic Neural Point For View Synthesis

1 code implementation • NeurIPS 2023 • Kaichen Zhou, Jia-Xing Zhong, Sangyun Shin, Kai Lu, Yiyuan Yang, Andrew Markham, Niki Trigoni

The introduction of neural radiance fields has greatly improved the effectiveness of view synthesis for monocular videos.

Paper
Code

3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets

no code implementations • ICCV 2023 • Ta-Ying Cheng, Matheus Gadelha, Soren Pirk, Thibault Groueix, Radomir Mech, Andrew Markham, Niki Trigoni

We present 3DMiner -- a pipeline for mining 3D shapes from challenging large-scale unannotated image datasets.

3D Reconstruction

Paper
Add Code

Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation

1 code implementation • 18 Dec 2023 • Sangyun Shin, Kaichen Zhou, Madhu Vankadari, Andrew Markham, Niki Trigoni

Coarse-to-fine 3D instance segmentation methods show weak performances compared to recent Grouping-based, Kernel-based and Transformer-based methods.

Ranked #1 on 3D Instance Segmentation on ScanNet(v2)

3D Instance Segmentation Semantic Segmentation

Paper
Code

MGDepth: Motion-Guided Cost Volume For Self-Supervised Monocular Depth In Dynamic Scenarios

no code implementations • 23 Dec 2023 • Kaichen Zhou, Jia-Xing Zhong, Jia-Wang Bian, Qian Xie, Jian-Qing Zheng, Niki Trigoni, Andrew Markham

Despite advancements in self-supervised monocular depth estimation, challenges persist in dynamic scenarios due to the dependence on assumptions about a static world.

Computational Efficiency Monocular Depth Estimation +1

Paper
Add Code

Learning Continuous 3D Words for Text-to-Image Generation

no code implementations • 13 Feb 2024 • Ta-Ying Cheng, Matheus Gadelha, Thibault Groueix, Matthew Fisher, Radomir Mech, Andrew Markham, Niki Trigoni

We do this by engineering special sets of input tokens that can be transformed in a continuous manner -- we call them Continuous 3D Words.

Text-to-Image Generation

Paper
Add Code

Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition

1 code implementation • 23 Feb 2024 • Chun-Hsiao Yeh, Ta-Ying Cheng, He-Yen Hsieh, Chuan-En Lin, Yi Ma, Andrew Markham, Niki Trigoni, H. T. Kung, Yubei Chen

First, current personalization techniques fail to reliably extend to multiple concepts -- we hypothesize this to be due to the mismatch between complex scenes and simple text descriptions in the pre-training dataset (e. g., LAION).

Image Generation

Paper
Code

See, Imagine, Plan: Discovering and Hallucinating Tasks from a Single Image

no code implementations • 18 Mar 2024 • Chenyang Ma, Kai Lu, Ta-Ying Cheng, Niki Trigoni, Andrew Markham

Humans can not only recognize and understand the world in its current state but also envision future scenarios that extend beyond immediate perception.

Hallucination Motion Planning

Paper
Add Code

WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization

no code implementations • 22 Mar 2024 • Jialu Wang, Kaichen Zhou, Andrew Markham, Niki Trigoni

Despite the advancements in deep learning for camera relocalization tasks, obtaining ground truth pose labels required for the training process remains a costly endeavor.

Camera Relocalization Image Reconstruction +1

Paper
Add Code

ZeST: Zero-Shot Material Transfer from a Single Image

no code implementations • 9 Apr 2024 • Ta-Ying Cheng, Prafull Sharma, Andrew Markham, Niki Trigoni, Varun Jampani

We propose ZeST, a method for zero-shot material transfer to an object in the input image given a material exemplar image.

Object

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.