Search Results for author: Adrian Hilton

Found 27 papers, 3 papers with code

ANIM: Accurate Neural Implicit Model for Human Reconstruction from a single RGB-D image

no code implementations • 15 Mar 2024 • Marco Pesavento, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung

In this paper, we explore the benefits of incorporating depth observations in the reconstruction process by introducing ANIM, a novel method that reconstructs arbitrary 3D human shapes from single-view RGB-D images with an unprecedented level of accuracy.

Paper
Add Code

CAD -- Contextual Multi-modal Alignment for Dynamic AVQA

no code implementations • 25 Oct 2023 • Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

In the context of Audio Visual Question Answering (AVQA) tasks, the audio visual modalities could be learnt on three levels: 1) Spatial, 2) Temporal, and 3) Semantic.

Ranked #3 on Audio-visual Question Answering on MUSIC-AVQA

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +2

Paper
Add Code

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

no code implementations • 9 Aug 2023 • Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

To address this issue, we (i) embed relative positional encoding in the self-attention mechanism and (ii) exploit multi-scale temporal relationships by designing a novel non hierarchical network, in contrast to the recent transformer-based approaches that use a hierarchical structure.

Ranked #1 on Action Detection on MultiTHUMOS

Action Detection Event Detection +1

Paper
Add Code

SEM-POS: Grammatically and Semantically Correct Video Captioning

no code implementations • 26 Mar 2023 • Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

Generating grammatically and semantically correct captions in video captioning is a challenging task.

POS Video Captioning

Paper
Add Code

Super-resolution 3D Human Shape from a Single Low-Resolution Image

1 code implementation • 23 Aug 2022 • Marco Pesavento, Marco Volino, Adrian Hilton

The approach overcomes limitations of existing approaches that reconstruct 3D human shape from a single image, which require high-resolution images together with auxiliary data such as surface normal or a parametric model to reconstruct high-detail shape.

3D Human Reconstruction 3D Human Shape Estimation +2

Paper
Code

Visually Supervised Speaker Detection and Localization via Microphone Array

no code implementations • 7 Mar 2022 • Davide Berghi, Adrian Hilton, Philip J. B. Jackson

We propose to generate weak labels using a pre-trained active speaker detector on pre-extracted face tracks.

Paper
Add Code

Attention-based Multi-Reference Learning for Image Super-Resolution

1 code implementation • ICCV 2021 • Marco Pesavento, Marco Volino, Adrian Hilton

A novel hierarchical attention-based sampling approach is introduced to learn the similarity between low-resolution image features and multiple reference images based on a perceptual loss.

Image Super-Resolution

Paper
Code

Super-Resolution Appearance Transfer for 4D Human Performances

no code implementations • 31 Aug 2021 • Marco Pesavento, Marco Volino, Adrian Hilton

Typically the requirement to frame cameras to capture the volume of a dynamic performance ($>50m^3$) results in the person occupying only a small proportion $<$ 10% of the field of view.

4D reconstruction 4k +2

Paper
Add Code

SyDog: A Synthetic Dog Dataset for Improved 2D Pose Estimation

no code implementations • 31 Jul 2021 • Moira Shooter, Charles Malleson, Adrian Hilton

Estimating the pose of animals can facilitate the understanding of animal motion which is fundamental in disciplines such as biomechanics, neuroscience, ethology, robotics and the entertainment industry.

Ranked #1 on Animal Pose Estimation on StanfordExtra

2D Pose Estimation Animal Pose Estimation

Paper
Add Code

Temporal Consistency Loss for High Resolution Textured and Clothed 3DHuman Reconstruction from Monocular Video

no code implementations • 19 Apr 2021 • Akin Caliskan, Armin Mustafa, Adrian Hilton

We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video.

3D Human Reconstruction 3D Human Shape Estimation +2

Paper
Add Code

Multi-person Implicit Reconstruction from a Single Image

no code implementations • CVPR 2021 • Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton

We present a new end-to-end learning framework to obtain detailed and spatially coherent reconstructions of multiple people from a single image.

3D Human Reconstruction

Paper
Add Code

Multi-View Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People

no code implementations • 29 Sep 2020 • Akin Caliskan, Armin Mustafa, Evren Imre, Adrian Hilton

This paper introduces two advances to overcome this limitation: firstly a new synthetic dataset of realistic clothed people, 3DVH; and secondly, a novel multiple-view loss function for training of monocular volumetric shape estimation, which is demonstrated to significantly improve generalisation and reconstruction accuracy.

3D Human Shape Estimation 3D Reconstruction

Paper
Add Code

Spectral Analysis Network for Deep Representation Learning and Image Clustering

no code implementations • 11 Sep 2020 • Jinghua Wang, Adrian Hilton, Jianmin Jiang

This paper proposes a new network structure for unsupervised deep representation learning based on spectral analysis, which is a popular technique with solid theory foundations.

Clustering Image Clustering +1

Paper
Add Code

Learning Dense Wide Baseline Stereo Matching for People

no code implementations • 2 Oct 2019 • Akin Caliskan, Armin Mustafa, Evren Imre, Adrian Hilton

We show that it is possible to learn stereo matching from synthetic people dataset and improve performance on real datasets for stereo reconstruction of people from narrow and wide baseline stereo data.

Data Augmentation Stereo Matching

Paper
Add Code

EdgeNet: Semantic Scene Completion from a Single RGB-D Image

1 code implementation • 8 Aug 2019 • Aloisio Dourado, Teofilo Emidio de Campos, Hansung Kim, Adrian Hilton

Semantic scene completion is the task of predicting a complete 3D representation of volumetric occupancy with corresponding semantic labels for a scene from a single point of view.

Ranked #22 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Edge Detection

Paper
Code

Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras

no code implementations • 8 Aug 2019 • Andrew Gilbert, Matthew Trumble, Adrian Hilton, John Collomosse

We aim to simultaneously estimate the 3D articulated pose and high fidelity volumetric occupancy of human performance, from multiple viewpoint video (MVV) with as few as two views.

Ranked #163 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

U4D: Unsupervised 4D Dynamic Scene Understanding

no code implementations • ICCV 2019 • Armin Mustafa, Chris Russell, Adrian Hilton

We introduce the first approach to solve the challenging problem of unsupervised 4D visual scene understanding for complex dynamic scenes with multiple interacting people from multi-view video.

3D Pose Estimation Instance Segmentation +3

Paper
Add Code

Temporally Coherent General Dynamic Scene Reconstruction

no code implementations • 18 Jul 2019 • Armin Mustafa, Marco Volino, Hansung Kim, Jean-yves Guillemaut, Adrian Hilton

Existing techniques for dynamic scene reconstruction from multiple wide-baseline cameras primarily focus on reconstruction in controlled environments, with fixed calibrated cameras and strong prior constraints.

Segmentation Semantic Segmentation

Paper
Add Code

Volumetric performance capture from minimal camera viewpoints

no code implementations • ECCV 2018 • Andrew Gilbert, Marco Volino, John Collomosse, Adrian Hilton

We present a convolutional autoencoder that enables high fidelity volumetric reconstructions of human performance to be captured from multi-view video comprising only a small set of camera views.

Paper
Add Code

Deep Autoencoder for Combined Human Pose Estimation and body Model Upscaling

no code implementations • ECCV 2018 • Matthew Trumble, Andrew Gilbert, Adrian Hilton, John Collomosse

We present a method for simultaneously estimating 3D human pose and body shape from a sparse set of wide-baseline camera views.

Ranked #9 on 3D Human Pose Estimation on Total Capture

3D Human Pose Estimation

Paper
Add Code

4D Temporally Coherent Light-field Video

no code implementations • 30 Apr 2018 • Armin Mustafa, Marco Volino, Jean-yves Guillemaut, Adrian Hilton

Evaluation of the proposed light-field scene flow against existing multi-view dense correspondence approaches demonstrates a significant improvement in accuracy of temporal coherence.

Scene Flow Estimation

Paper
Add Code

Semantic Scene Completion Combining Colour and Depth: preliminary experiments

no code implementations • 13 Feb 2018 • Andre Bernardes Soares Guedes, Teofilo Emidio de Campos, Adrian Hilton

Semantic scene completion is the task of producing a complete 3D voxel representation of volumetric occupancy with semantic labels for a scene from a single-view observation.

Ranked #23 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion

Paper
Add Code

Total capture: 3D human pose estimation fusing video and inertial sensors

no code implementations • BMVC 2017 2017 • Matthew Trumble, Andrew Gilbert, Charles Malleson, Adrian Hilton, and John Collomosse

We incorporate this model within a dual stream network integrating pose embeddings derived from MVV and a forward kinematic solve of the IMU data.

Ranked #11 on 3D Human Pose Estimation on Total Capture

3D Human Pose Estimation

Paper
Add Code

Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes

no code implementations • CVPR 2017 • Armin Mustafa, Adrian Hilton

Semantic co-segmentation exploits the coherence in semantic class labels both spatially, between views at a single time instant, and temporally, between widely spaced time instants of dynamic objects with similar shape and appearance.

3D Reconstruction Segmentation

Paper
Add Code

Temporally coherent 4D reconstruction of complex dynamic scenes

no code implementations • CVPR 2016 • Armin Mustafa, Hansung Kim, Jean-yves Guillemaut, Adrian Hilton

Sparse-to-dense temporal correspondence is integrated with joint multi-view segmentation and reconstruction to obtain a complete 4D representation of static and dynamic objects.

4D reconstruction Camera Calibration +2

Paper
Add Code

FaceDirector: Continuous Control of Facial Performance in Video

no code implementations • ICCV 2015 • Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung

We present a method to continuously blend between multiple facial performances of an actor, which can contain different facial expressions or emotional states.

Audio-Visual Synchronization Continuous Control

Paper
Add Code

General Dynamic Scene Reconstruction from Multiple View Video

no code implementations • ICCV 2015 • Armin Mustafa, Hansung Kim, Jean-yves Guillemaut, Adrian Hilton

The primary contributions of this paper are twofold: an automatic method for initial coarse dynamic scene segmentation and reconstruction without prior knowledge of background appearance or structure; and a general robust approach for joint segmentation refinement and dense reconstruction of dynamic scenes from multiple wide-baseline static or moving cameras.

Scene Segmentation Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.