no code implementations • 15 Jul 2024 • Marco Pesavento, Marco Volino, Adrian Hilton
The generated 2D normal maps are then processed by a multi-view attention-based neural implicit model that estimates an implicit representation of the 3D shape, ensuring the reproduction of details in both observed and occluded regions.
no code implementations • 6 Jun 2024 • Haosen Yang, Chenhao Zhang, Wenqing Wang, Marco Volino, Adrian Hilton, Li Zhang, Xiatian Zhu
To address these limitations, we propose a Localized Point Management (LPM) strategy, capable of identifying those error-contributing zones in the highest demand for both point addition and geometry calibration.
no code implementations • CVPR 2024 • Marco Pesavento, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung
In this paper, we explore the benefits of incorporating depth observations in the reconstruction process by introducing ANIM, a novel method that reconstructs arbitrary 3D human shapes from single-view RGB-D images with an unprecedented level of accuracy.
no code implementations • 4 Dec 2022 • Davide Berghi, Marco Volino, Philip J. B. Jackson
This is partly due to the lack of available datasets enabling audio-visual research in this direction.
1 code implementation • 23 Aug 2022 • Marco Pesavento, Marco Volino, Adrian Hilton
The approach overcomes limitations of existing approaches that reconstruct 3D human shape from a single image, which require high-resolution images together with auxiliary data such as surface normal or a parametric model to reconstruct high-detail shape.
1 code implementation • ICCV 2021 • Marco Pesavento, Marco Volino, Adrian Hilton
A novel hierarchical attention-based sampling approach is introduced to learn the similarity between low-resolution image features and multiple reference images based on a perceptual loss.
no code implementations • 31 Aug 2021 • Marco Pesavento, Marco Volino, Adrian Hilton
Typically the requirement to frame cameras to capture the volume of a dynamic performance ($>50m^3$) results in the person occupying only a small proportion $<$ 10% of the field of view.
no code implementations • 18 Jul 2019 • Armin Mustafa, Marco Volino, Hansung Kim, Jean-yves Guillemaut, Adrian Hilton
Existing techniques for dynamic scene reconstruction from multiple wide-baseline cameras primarily focus on reconstruction in controlled environments, with fixed calibrated cameras and strong prior constraints.
no code implementations • ECCV 2018 • Andrew Gilbert, Marco Volino, John Collomosse, Adrian Hilton
We present a convolutional autoencoder that enables high fidelity volumetric reconstructions of human performance to be captured from multi-view video comprising only a small set of camera views.
no code implementations • 30 Apr 2018 • Armin Mustafa, Marco Volino, Jean-yves Guillemaut, Adrian Hilton
Evaluation of the proposed light-field scene flow against existing multi-view dense correspondence approaches demonstrates a significant improvement in accuracy of temporal coherence.