Search Results for author: Alessio Del Bue

Found 71 papers, 29 papers with code

Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation

no code implementations27 Sep 2024 Mahtab Dahaghin, Myrna Castillo, Kourosh Riahidehkordi, Matteo Toso, Alessio Del Bue

We propose a pipeline to generate a 3D replica of a scene using only RGB images (e. g. photos of a museum) and then extract a model for each item of interest (e. g. pieces in the exhibit).

Novel View Synthesis Semantic Segmentation

SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes

no code implementations5 Aug 2024 Mohammad Zohaib, Luca Cosmo, Alessio Del Bue

Unsupervised 3D keypoints estimation from Point Cloud Data (PCD) is a complex task, even more challenging when an object shape is deforming.

6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model

no code implementations22 Jul 2024 Matteo Bortolon, Theodore Tsesmelis, Stuart James, Fabio Poiesi, Alessio Del Bue

Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays.

6D Pose Estimation Novel View Synthesis

I2EDL: Interactive Instruction Error Detection and Localization

no code implementations7 Jun 2024 Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Yiming Wang

We evaluate the proposed I2EDL on a dataset of instructions containing errors, and further devise a novel metric, the Success weighted by Interaction Number (SIN), to reflect both the navigation performance and the interaction effectiveness.

Vision and Language Navigation

Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation

no code implementations19 Apr 2024 Myrna C. Silva, Mahtab Dahaghin, Matteo Toso, Alessio Del Bue

Recent works in novel-view synthesis have shown how to model the appearance of a scene via a cloud of 3D Gaussians, and how to generate accurate images from a given viewpoint by projecting on it the Gaussians before $\alpha$ blending their color.

Clustering Contrastive Learning +3

HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

1 code implementation1 Apr 2024 David Svitov, Pietro Morerio, Lourdes Agapito, Alessio Del Bue

We demonstrate the effectiveness of our approach on two open datasets: SnapshotPeople and X-Humans.

IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model

no code implementations19 Mar 2024 Matteo Bortolon, Theodore Tsesmelis, Stuart James, Fabio Poiesi, Alessio Del Bue

We introduce IFFNeRF to estimate the six degrees-of-freedom (6DoF) camera pose of a given image, building on the Neural Radiance Fields (NeRF) formulation.

Pose Estimation

Towards the Reusability and Compositionality of Causal Representations

no code implementations14 Mar 2024 Davide Talon, Phillip Lippe, Stuart James, Alessio Del Bue, Sara Magliacane

Causal Representation Learning (CRL) aims at identifying high-level causal factors and their relationships from high-dimensional observations, e. g., images.

Representation Learning Temporal Sequences

PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections

no code implementations13 Mar 2024 Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue

Robustly estimating camera poses from a set of images is a fundamental task which remains challenging for differentiable methods, especially in the case of small and sparse camera pose graphs.

graph construction Pose Estimation +1

mmFUSION: Multimodal Fusion for 3D Objects Detection

no code implementations7 Nov 2023 Javed Ahmad, Alessio Del Bue

The strong multi-modal features from the mmFUSION framework are fed to a simple 3D detection head for 3D predictions.

3D Object Detection object-detection +1

Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation

1 code implementation3 Oct 2023 Julio Ivan Davila Carrazco, Pietro Morerio, Alessio Del Bue, Vittorio Murino

This paper presents a classification framework based on learnable data augmentation to tackle the One-Shot Unsupervised Domain Adaptation (OS-UDA) problem.

Data Augmentation Decoder +3

Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos

no code implementations16 Aug 2023 Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

Compared to existing video modeling architectures for action anticipation, NAOGAT captures the relationship between objects and the global scene context in order to predict detections for the next active object and anticipate relevant future actions given these detections, leveraging the objects' dynamics to improve accuracy.

Action Anticipation Active Object Localization +3

SC3K: Self-supervised and Coherent 3D Keypoints Estimation from Rotated, Noisy, and Decimated Point Cloud Data

1 code implementation ICCV 2023 Mohammad Zohaib, Alessio Del Bue

This paper proposes a new method to infer keypoints from arbitrary object categories in practical scenarios where point cloud data (PCD) are noisy, down-sampled and arbitrarily rotated.

Person Re-Identification without Identification via Event Anonymization

1 code implementation ICCV 2023 SHAFIQ AHMAD, Pietro Morerio, Alessio Del Bue

In this work, we also bring to the community the first ever event-based person ReId dataset gathered to evaluate the performance of our approach.

Event-based vision Image Reconstruction +2

Guided Attention for Next Active Object @ EGO4D STA Challenge

1 code implementation25 May 2023 Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge.

Object Short-term Object Interaction Anticipation

Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention

1 code implementation22 May 2023 Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

To this end, we propose a novel approach that applies a guided attention mechanism between the objects, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocentric videos.

Action Anticipation Object +1

Target-driven One-Shot Unsupervised Domain Adaptation

no code implementations8 May 2023 Julio Ivan Davila Carrazco, Suvarna Kishorkumar Kadam, Pietro Morerio, Alessio Del Bue, Vittorio Murino

Unlike existing methods, our augmentation module allows for strong transformations of the source samples, and the style of the single target sample available is exploited to guide the augmentation by ensuring perceptual similarity.

One-shot Unsupervised Domain Adaptation Unsupervised Domain Adaptation

3DoF Localization from a Single Image and an Object Map: the Flatlandia Problem and Dataset

1 code implementation13 Apr 2023 Matteo Toso, Matteo Taiana, Stuart James, Alessio Del Bue

Efficient visual localization is crucial to many applications, such as large-scale deployment of autonomous agents and augmented reality.

Privacy Preserving Visual Localization

Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models

1 code implementation20 Mar 2023 Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang, Alessio Del Bue

We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning.

Graph Neural Network Sentence +2

Guiding Pseudo-labels with Uncertainty Estimation for Source-free Unsupervised Domain Adaptation

2 code implementations CVPR 2023 Mattia Litrico, Alessio Del Bue, Pietro Morerio

We propose a novel approach for the SF-UDA setting based on a loss reweighting strategy that brings robustness against the noise that inevitably affects the pseudo-labels.

Unsupervised Domain Adaptation

Self-improving object detection via disagreement reconciliation

no code implementations21 Feb 2023 Gianluca Scarpellini, Stefano Rosa, Pietro Morerio, Lorenzo Natale, Alessio Del Bue

Object detectors often experience a drop in performance when new environmental conditions are insufficiently represented in the training data.

Object object-detection +1

Anticipating Next Active Objects for Egocentric Videos

no code implementations13 Feb 2023 Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

This paper addresses the problem of anticipating the next-active-object location in the future, for a given egocentric video clip where the contact might happen, before any action takes place.

Object

3DSGrasp: 3D Shape-Completion for Robotic Grasp

no code implementations2 Jan 2023 Seyed S. Mohammadi, Nuno F. Duarte, Dimitris Dimou, Yiming Wang, Matteo Taiana, Pietro Morerio, Atabak Dehban, Plinio Moreno, Alexandre Bernardino, Alessio Del Bue, Jose Santos-Victor

However, in practice, PCDs are often incomplete when objects are viewed from few and sparse viewpoints before the grasping action, leading to the generation of wrong or inaccurate grasp poses.

Decoder Robotic Grasping

Leveraging commonsense for object localisation in partial scenes

no code implementations1 Nov 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Alessio Del Bue, Yiming Wang

With the proposed graph-based scene representation, we estimate the unknown position of the target object using a Graph Neural Network that implements a novel attentional message passing mechanism.

Graph Neural Network Object +1

VM-NeRF: Tackling Sparsity in NeRF with View Morphing

1 code implementation9 Oct 2022 Matteo Bortolon, Alessio Del Bue, Fabio Poiesi

A well-known limitation of NeRF methods is their reliance on data: the fewer the viewpoints, the higher the likelihood of overfitting.

Data Augmentation Novel View Synthesis

Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

1 code implementation22 Aug 2022 Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

In addition, we leverage cross-modal verification and matching tasks to analyze the impact of multiple languages on face-voice association.

Metric Learning

Co-Located Human-Human Interaction Analysis using Nonverbal Cues: A Survey

no code implementations20 Jul 2022 Cigdem Beyan, Alessandro Vinciarelli, Alessio Del Bue

Automated co-located human-human interaction analysis has been addressed by the use of nonverbal communication as measurable evidence of social and psychological phenomena.

Privacy Preserving

PoserNet: Refining Relative Camera Poses Exploiting Object Detections

1 code implementation19 Jul 2022 Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue

The estimation of the camera poses associated with a set of images commonly relies on feature matches between the images.

Graph Neural Network Object +1

GANzzle: Reframing jigsaw puzzle solving as a retrieval task using a generative mental image

1 code implementation12 Jul 2022 Davide Talon, Alessio Del Bue, Stuart James

Puzzle solving is a combinatorial challenge due to the difficulty of matching adjacent pieces.

Retrieval

Spatial Commonsense Graph for Object Localisation in Partial Scenes

1 code implementation CVPR 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming Wang, Alessio Del Bue

The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the SCG; second, we propose a Localisation Module based on circular intersection to estimate the object position using all the predicted pairwise distances in order to be independent of any reference system.

Graph Neural Network Object +1

Semantically Grounded Visual Embeddings for Zero-Shot Learning

no code implementations3 Jan 2022 Shah Nawaz, Jacopo Cavazza, Alessio Del Bue

Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks.

Zero-Shot Learning

Fusion and Orthogonal Projection for Improved Face-Voice Association

2 code implementations20 Dec 2021 Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue

Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks.

Cross-Modal Retrieval Triplet

(Just) A Spoonful of Refinements Helps the Registration Error Go Down

1 code implementation ICCV 2021 Sérgio Agostinho, Aljoša Ošep, Alessio Del Bue, Laura Leal-Taixé

However, given the initial rotation estimate supplied by Kabsch, we show we can improve point correspondence learning during model training by extending the original optimization problem.

Point Cloud Registration

Consistent Mesh Colors for Multi-View Reconstructed 3D Scenes

no code implementations26 Jan 2021 Mohamed Dahy Elkhouly, Alessio Del Bue, Stuart James

We then use this color in a re-weighting ratio for the best-view texture, which is identified by prior mesh texturing work, to create a spatial consistent texture map.

LIGHTS: LIGHT Specularity Dataset for specular detection in Multi-view

no code implementations26 Jan 2021 Mohamed Dahy Elkhouly, Theodore Tsesmelis, Alessio Del Bue, Stuart James

Therefore, we propose a novel physically-based rendered LIGHT Specularity (LIGHTS) Dataset for the evaluation of the specular highlight detection task.

Highlight Detection

Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration

1 code implementation ECCV 2020 Yiming Wang, Alessio Del Bue

In this work we address the problem of autonomous 3D exploration of an unknown indoor environment using a depth camera.

3D Reconstruction

Single Image Human Proxemics Estimation for Visual Social Distancing

1 code implementation3 Nov 2020 Maya Aghaei, Matteo Bustreo, Yiming Wang, Gianluca Bailo, Pietro Morerio, Alessio Del Bue

In this work, we address the problem of estimating the so-called "Social Distancing" given a single uncalibrated image in unconstrained scenarios.

A Versatile Crack Inspection Portable System based on Classifier Ensemble and Controlled Illumination

no code implementations19 Oct 2020 Milind G. Padalkar, Carlos Beltrán-González, Matteo Bustreo, Alessio Del Bue, Vittorio Murino

This paper presents a novel setup for automatic visual inspection of cracks in ceramic tile as well as studies the effect of various classifiers and height-varying illumination conditions for this task.

The Visual Social Distancing Problem

no code implementations11 May 2020 Marco Cristani, Alessio Del Bue, Vittorio Murino, Francesco Setti, Alessandro Vinciarelli

One of the main and most effective measures to contain the recent viral outbreak is the maintenance of the so-called Social Distancing (SD).

Cross-modal Speaker Verification and Recognition: A Multilingual Perspective

no code implementations28 Apr 2020 Muhammad Saad Saeed, Shah Nawaz, Pietro Morerio, Arif Mahmood, Ignazio Gallo, Muhammad Haroon Yousaf, Alessio Del Bue

Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition.

Speaker Recognition Speaker Verification

An integrated light management system with real-time light measurement and human perception

no code implementations17 Apr 2020 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso

Illumination is important for well-being, productivity and safety across several environments, including offices, retail shops and industrial warehouses.

Management

re-OBJ: Jointly Learning the Foreground and Background for Object Instance Re-identification

1 code implementation17 Sep 2019 Vaibhav Bansal, Stuart James, Alessio Del Bue

Conventional approaches to object instance re-identification rely on matching appearances of the target objects among a set of frames.

Object

CvxPnPL: A Unified Convex Solution to the Absolute Pose Estimation Problem from Point and Line Correspondences

1 code implementation24 Jul 2019 Sérgio Agostinho, João Gomes, Alessio Del Bue

We present a new convex method to estimate 3D pose from mixed combinations of 2D-3D point and line correspondences, the Perspective-n-Points-and-Lines problem (PnPL).

Pose Estimation

Human-centric light sensing and estimation from RGBD images: The invisible light switch

no code implementations30 Jan 2019 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso

ILS may therefore dim those luminaires, which are not seen by the user, resulting in an effective energy saving, especially in large open offices (where light may otherwise be ON everywhere for a single person).

RGBD2lux: Dense light intensity estimation with an RGBD sensor

no code implementations20 Sep 2018 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Fabio Galasso, Alessio Del Bue

The proposed method uses both depth data and images from the sensor to provide a dense measure of light intensity in the field of view of the camera.

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

2 code implementations16 Jul 2018 Paul Gay, Stuart James, Alessio Del Bue

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships.

3d scene graph generation Graph Generation +2

MX-LSTM: mixing tracklets and vislets to jointly forecast trajectories and head poses

no code implementations CVPR 2018 Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Fabio Galasso, Marco Cristani

Recent approaches on trajectory forecasting use tracklets to predict the future positions of pedestrians exploiting Long Short Term Memory (LSTM) architectures.

Trajectory Forecasting

Objects Localisation from Motion with Constraints

no code implementations28 Mar 2018 Paul Gay, Alessio Del Bue

This problem is modelled as the estimation of a set of quadrics given 2D conics fit to the object bounding boxes.

Object valid

A Benchmark and Evaluation of Non-Rigid Structure from Motion

no code implementations25 Jan 2018 Sebastian Hoppe Nesgaard Jensen, Mads Emil Brix Doest, Henrik Aanaes, Alessio Del Bue

To validate the applicability of this data set, and provide an investigation into the state of the art of NRSfM, including potential directions forward, we here present a benchmark and a scrupulous evaluation using this data set.

Practical Projective Structure From Motion (P2SfM)

no code implementations ICCV 2017 Ludovic Magerand, Alessio Del Bue

This paper presents a solution to the Projective Structure from Motion (PSfM) problem able to deal efficiently with missing data, outliers and, for the first time, large scale 3D reconstruction scenarios.

3D Reconstruction valid

Probabilistic Structure From Motion With Objects (PSfMO)

no code implementations ICCV 2017 Paul Gay, Cosimo Rubino, Vaibhav Bansal, Alessio Del Bue

We show that remarkable object localisation and volumetric occupancy can be recovered by including both geometrical constraints and prior information given by objects CAD models from the ShapeNet dataset.

3D Reconstruction Camera Calibration +1

Manifold Constrained Low-Rank Decomposition

no code implementations6 Aug 2017 Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino

Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling.

Structure From Motion With Objects

no code implementations CVPR 2016 Marco Crocco, Cosimo Rubino, Alessio Del Bue

In practice, this work can be considered as the extension of Tomasi and Kanade factorization method using objects.

Camera Calibration Object +3

PiMPeR: Piecewise Dense 3D Reconstruction from Multi-View and Multi-Illumination Images

no code implementations16 Mar 2015 Reza Sabzevari, Vittori Murino, Alessio Del Bue

Unlike multi-view stereo and multi-view photometric stereo methods, this pipeline deals with wide-baseline images that are uncalibrated, in terms of both camera parameters and lighting conditions.

3D Reconstruction

3D Pose from Detections

no code implementations17 Feb 2015 Cosimo Rubino, Marco Crocco, Alessandro Perina, Vittorio Murino, Alessio Del Bue

We present a novel method to infer, in closed-form, a general 3D spatial occupancy and orientation of a collection of rigid objects given 2D image detections from a sequence of images.

Cannot find the paper you are looking for? You can Submit a new open access paper.