no code implementations • 27 Sep 2024 • Mahtab Dahaghin, Myrna Castillo, Kourosh Riahidehkordi, Matteo Toso, Alessio Del Bue
We propose a pipeline to generate a 3D replica of a scene using only RGB images (e. g. photos of a museum) and then extract a model for each item of interest (e. g. pieces in the exhibit).
no code implementations • 5 Aug 2024 • Mohammad Zohaib, Luca Cosmo, Alessio Del Bue
Unsupervised 3D keypoints estimation from Point Cloud Data (PCD) is a complex task, even more challenging when an object shape is deforming.
no code implementations • 22 Jul 2024 • Matteo Bortolon, Theodore Tsesmelis, Stuart James, Fabio Poiesi, Alessio Del Bue
Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays.
no code implementations • 7 Jun 2024 • Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Yiming Wang
We evaluate the proposed I2EDL on a dataset of instructions containing errors, and further devise a novel metric, the Success weighted by Interaction Number (SIN), to reflect both the navigation performance and the interaction effectiveness.
no code implementations • 19 Apr 2024 • Myrna C. Silva, Mahtab Dahaghin, Matteo Toso, Alessio Del Bue
Recent works in novel-view synthesis have shown how to model the appearance of a scene via a cloud of 3D Gaussians, and how to generate accurate images from a given viewpoint by projecting on it the Gaussians before $\alpha$ blending their color.
no code implementations • 16 Apr 2024 • Mattia Litrico, Davide Talon, Sebastiano Battiato, Alessio Del Bue, Mario Valerio Giuffrida, Pietro Morerio
We propose a novel approach for SF-OSDA that exploits the granularity of target-private categories by segregating their samples into multiple unknown classes.
1 code implementation • 1 Apr 2024 • David Svitov, Pietro Morerio, Lourdes Agapito, Alessio Del Bue
We demonstrate the effectiveness of our approach on two open datasets: SnapshotPeople and X-Humans.
no code implementations • 19 Mar 2024 • Matteo Bortolon, Theodore Tsesmelis, Stuart James, Fabio Poiesi, Alessio Del Bue
We introduce IFFNeRF to estimate the six degrees-of-freedom (6DoF) camera pose of a given image, building on the Neural Radiance Fields (NeRF) formulation.
no code implementations • 15 Mar 2024 • Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Yiming Wang
Moreover, we formally define the task of Instruction Error Detection and Localization, and establish an evaluation protocol on top of our benchmark dataset.
no code implementations • 14 Mar 2024 • Davide Talon, Phillip Lippe, Stuart James, Alessio Del Bue, Sara Magliacane
Causal Representation Learning (CRL) aims at identifying high-level causal factors and their relationships from high-dimensional observations, e. g., images.
no code implementations • 13 Mar 2024 • Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue
Robustly estimating camera poses from a set of images is a fundamental task which remains challenging for differentiable methods, especially in the case of small and sparse camera pose graphs.
1 code implementation • CVPR 2024 • Gianluca Scarpellini, Stefano Fiorini, Francesco Giuliari, Pietro Morerio, Alessio Del Bue
Reassembly tasks play a fundamental role in many fields and multiple approaches exist to solve specific reassembly problems.
no code implementations • 7 Nov 2023 • Javed Ahmad, Alessio Del Bue
The strong multi-modal features from the mmFUSION framework are fed to a simple 3D detection head for 3D predictions.
1 code implementation • 3 Oct 2023 • Julio Ivan Davila Carrazco, Pietro Morerio, Alessio Del Bue, Vittorio Murino
This paper presents a classification framework based on learnable data augmentation to tackle the One-Shot Unsupervised Domain Adaptation (OS-UDA) problem.
no code implementations • 16 Aug 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
Compared to existing video modeling architectures for action anticipation, NAOGAT captures the relationship between objects and the global scene context in order to predict detections for the next active object and anticipate relevant future actions given these detections, leveraging the objects' dynamics to improve accuracy.
1 code implementation • ICCV 2023 • Mohammad Zohaib, Alessio Del Bue
This paper proposes a new method to infer keypoints from arbitrary object categories in practical scenarios where point cloud data (PCD) are noisy, down-sampled and arbitrarily rotated.
1 code implementation • ICCV 2023 • SHAFIQ AHMAD, Pietro Morerio, Alessio Del Bue
In this work, we also bring to the community the first ever event-based person ReId dataset gathered to evaluate the performance of our approach.
1 code implementation • 25 May 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge.
Ranked #3 on Short-term Object Interaction Anticipation on Ego4D
1 code implementation • 22 May 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
To this end, we propose a novel approach that applies a guided attention mechanism between the objects, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of STA in egocentric videos.
no code implementations • 8 May 2023 • Julio Ivan Davila Carrazco, Suvarna Kishorkumar Kadam, Pietro Morerio, Alessio Del Bue, Vittorio Murino
Unlike existing methods, our augmentation module allows for strong transformations of the source samples, and the style of the single target sample available is exploited to guide the augmentation by ensuring perceptual similarity.
One-shot Unsupervised Domain Adaptation Unsupervised Domain Adaptation
1 code implementation • 13 Apr 2023 • Matteo Toso, Matteo Taiana, Stuart James, Alessio Del Bue
Efficient visual localization is crucial to many applications, such as large-scale deployment of autonomous agents and augmented reality.
1 code implementation • 20 Mar 2023 • Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang, Alessio Del Bue
We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning.
2 code implementations • CVPR 2023 • Mattia Litrico, Alessio Del Bue, Pietro Morerio
We propose a novel approach for the SF-UDA setting based on a loss reweighting strategy that brings robustness against the noise that inevitably affects the pseudo-labels.
no code implementations • 21 Feb 2023 • Gianluca Scarpellini, Stefano Rosa, Pietro Morerio, Lorenzo Natale, Alessio Del Bue
Object detectors often experience a drop in performance when new environmental conditions are insufficiently represented in the training data.
no code implementations • 13 Feb 2023 • Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue
This paper addresses the problem of anticipating the next-active-object location in the future, for a given egocentric video clip where the contact might happen, before any action takes place.
1 code implementation • 7 Feb 2023 • Gianluca Scarpellini, Stefano Rosa, Pietro Morerio, Lorenzo Natale, Alessio Del Bue
When an object detector is deployed in a novel setting it often experiences a drop in performance.
no code implementations • 2 Jan 2023 • Seyed S. Mohammadi, Nuno F. Duarte, Dimitris Dimou, Yiming Wang, Matteo Taiana, Pietro Morerio, Atabak Dehban, Plinio Moreno, Alexandre Bernardino, Alessio Del Bue, Jose Santos-Victor
However, in practice, PCDs are often incomplete when objects are viewed from few and sparse viewpoints before the grasping action, leading to the generation of wrong or inaccurate grasp poses.
no code implementations • 1 Nov 2022 • Francesco Giuliari, Geri Skenderi, Marco Cristani, Alessio Del Bue, Yiming Wang
With the proposed graph-based scene representation, we estimate the unknown position of the target object using a Graph Neural Network that implements a novel attentional message passing mechanism.
1 code implementation • 9 Oct 2022 • Matteo Bortolon, Alessio Del Bue, Fabio Poiesi
A well-known limitation of NeRF methods is their reliance on data: the fewer the viewpoints, the higher the likelihood of overfitting.
1 code implementation • 8 Sep 2022 • Hebatallah A. Mohamed, Sebastiano Vascon, Feliks Hibraj, Stuart James, Diego Pilutti, Alessio Del Bue, Marcello Pelillo
Knowledge Graphs (KGs) have proven to be a reliable way of structuring data.
1 code implementation • 22 Aug 2022 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue
In addition, we leverage cross-modal verification and matching tasks to analyze the impact of multiple languages on face-voice association.
no code implementations • 20 Jul 2022 • Cigdem Beyan, Alessandro Vinciarelli, Alessio Del Bue
Automated co-located human-human interaction analysis has been addressed by the use of nonverbal communication as measurable evidence of social and psychological phenomena.
1 code implementation • 19 Jul 2022 • Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue
The estimation of the camera poses associated with a set of images commonly relies on feature matches between the images.
1 code implementation • 12 Jul 2022 • Davide Talon, Alessio Del Bue, Stuart James
Puzzle solving is a combinatorial challenge due to the difficulty of matching adjacent pieces.
1 code implementation • 21 Apr 2022 • Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue
This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition.
1 code implementation • CVPR 2022 • Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming Wang, Alessio Del Bue
The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the SCG; second, we propose a Localisation Module based on circular intersection to estimate the object position using all the predicted pairwise distances in order to be independent of any reference system.
no code implementations • 3 Jan 2022 • Shah Nawaz, Jacopo Cavazza, Alessio Del Bue
Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks.
2 code implementations • 20 Dec 2021 • Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue
Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks.
1 code implementation • IEEE International Conference on Image Processing 2021 • Seyed Saber Mohammadi, Yiming Wang, Alessio Del Bue
We address 3D shape classification with partial point cloud inputs captured from multiple viewpoints around the object.
Ranked #1 on 3D Point Cloud Classification on ModelNet40
1 code implementation • ICCV 2021 • Sérgio Agostinho, Aljoša Ošep, Alessio Del Bue, Laura Leal-Taixé
However, given the initial rotation estimate supplied by Kabsch, we show we can improve point correspondence learning during model training by extending the original optimization problem.
1 code implementation • 21 Apr 2021 • Gianluca Scarpellini, Pietro Morerio, Alessio Del Bue
Here we propose the first learning-based method for 3D human pose from a single stream of events.
Ranked #2 on 3D Human Pose Estimation on DHP19
no code implementations • 26 Jan 2021 • Mohamed Dahy Elkhouly, Alessio Del Bue, Stuart James
We then use this color in a re-weighting ratio for the best-view texture, which is identified by prior mesh texturing work, to create a spatial consistent texture map.
no code implementations • 26 Jan 2021 • Mohamed Dahy Elkhouly, Theodore Tsesmelis, Alessio Del Bue, Stuart James
Therefore, we propose a novel physically-based rendered LIGHT Specularity (LIGHTS) Dataset for the evaluation of the specular highlight detection task.
1 code implementation • 11 Dec 2020 • Paolo Soda, Natascha Claudia D'Amico, Jacopo Tessadori, Giovanni Valbusa, Valerio Guarrasi, Chandra Bortolotto, Muhammad Usman Akbar, Rosa Sicilia, Ermanno Cordelli, Deborah Fazzini, Michaela Cellina, Giancarlo Oliva, Giovanni Callea, Silvia Panella, Maurizio Cariati, Diletta Cozzi, Vittorio Miele, Elvira Stellato, Gian Paolo Carrafiello, Giulia Castorani, Annalisa Simeone, Lorenzo Preda, Giulio Iannello, Alessio Del Bue, Fabio Tedoldi, Marco Alì, Diego Sona, Sergio Papa
Exhaustive evaluation shows promising performance both in 10-fold and leave-one-centre-out cross-validation, implying that clinical data and images have the potential to provide useful information for the management of patients and hospital resources.
1 code implementation • ECCV 2020 • Yiming Wang, Alessio Del Bue
In this work we address the problem of autonomous 3D exploration of an unknown indoor environment using a depth camera.
1 code implementation • 3 Nov 2020 • Maya Aghaei, Matteo Bustreo, Yiming Wang, Gianluca Bailo, Pietro Morerio, Alessio Del Bue
In this work, we address the problem of estimating the so-called "Social Distancing" given a single uncalibrated image in unconstrained scenarios.
no code implementations • 19 Oct 2020 • Milind G. Padalkar, Carlos Beltrán-González, Matteo Bustreo, Alessio Del Bue, Vittorio Murino
This paper presents a novel setup for automatic visual inspection of cracks in ceramic tile as well as studies the effect of various classifiers and height-varying illumination conditions for this task.
no code implementations • 17 Sep 2020 • Yiming Wang, Francesco Giuliari, Riccardo Berra, Alberto Castellini, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Francesco Setti
Our POMP method uses as input the current pose of an agent (e. g. a robot) and a RGB-D frame.
1 code implementation • 21 Jun 2020 • Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue
This paper tackles the problem of human action recognition, defined as classifying which action is displayed in a trimmed sequence, from skeletal data.
Ranked #1 on Skeleton Based Action Recognition on MSR ActionPairs
no code implementations • 11 May 2020 • Marco Cristani, Alessio Del Bue, Vittorio Murino, Francesco Setti, Alessandro Vinciarelli
One of the main and most effective measures to contain the recent viral outbreak is the maintenance of the so-called Social Distancing (SD).
no code implementations • 28 Apr 2020 • Muhammad Saad Saeed, Shah Nawaz, Pietro Morerio, Arif Mahmood, Ignazio Gallo, Muhammad Haroon Yousaf, Alessio Del Bue
Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition.
no code implementations • 20 Apr 2020 • Maya Aghaei, Matteo Bustreo, Pietro Morerio, Nicolo Carissimi, Alessio Del Bue, Vittorio Murino
The design of an automatic visual inspection system is usually performed in two stages.
no code implementations • 17 Apr 2020 • Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso
Illumination is important for well-being, productivity and safety across several environments, including offices, retail shops and industrial warehouses.
no code implementations • 17 Apr 2020 • Avik Hati, Matteo Bustreo, Diego Sona, Vittorio Murino, Alessio Del Bue
We aim at digitally unwrapping the mummy and identify different segments such as body, bandages and jewelry.
1 code implementation • 17 Sep 2019 • Vaibhav Bansal, Stuart James, Alessio Del Bue
Conventional approaches to object instance re-identification rely on matching appearances of the target objects among a set of frames.
1 code implementation • 24 Jul 2019 • Sérgio Agostinho, João Gomes, Alessio Del Bue
We present a new convex method to estimate 3D pose from mixed combinations of 2D-3D point and line correspondences, the Perspective-n-Points-and-Lines problem (PnPL).
no code implementations • 30 Jan 2019 • Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso
ILS may therefore dim those luminaires, which are not seen by the user, resulting in an effective energy saving, especially in large open offices (where light may otherwise be ON everywhere for a single person).
no code implementations • 7 Jan 2019 • Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Vasileios Belagiannis, Sikandar Amin, Alessio Del Bue, Marco Cristani, Fabio Galasso
In this work, we explore the correlation between people trajectories and their head orientations.
no code implementations • 20 Sep 2018 • Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Fabio Galasso, Alessio Del Bue
The proposed method uses both depth data and images from the sensor to provide a dense measure of light intensity in the field of view of the camera.
2 code implementations • 16 Jul 2018 • Paul Gay, Stuart James, Alessio Del Bue
Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships.
no code implementations • CVPR 2018 • Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Fabio Galasso, Marco Cristani
Recent approaches on trajectory forecasting use tracklets to predict the future positions of pedestrians exploiting Long Short Term Memory (LSTM) architectures.
no code implementations • 28 Mar 2018 • Paul Gay, Alessio Del Bue
This problem is modelled as the estimation of a set of quadrics given 2D conics fit to the object bounding boxes.
no code implementations • IEEE Winter Conference on Applications of Computer Vision (WACV) 2018 • Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Marco Cristani, Fabio Galasso
In this paper we show the importance of the head pose estimation in the task of trajectory forecasting.
no code implementations • 25 Jan 2018 • Sebastian Hoppe Nesgaard Jensen, Mads Emil Brix Doest, Henrik Aanaes, Alessio Del Bue
To validate the applicability of this data set, and provide an investigation into the state of the art of NRSfM, including potential directions forward, we here present a benchmark and a scrupulous evaluation using this data set.
no code implementations • ICCV 2017 • Ludovic Magerand, Alessio Del Bue
This paper presents a solution to the Projective Structure from Motion (PSfM) problem able to deal efficiently with missing data, outliers and, for the first time, large scale 3D reconstruction scenarios.
no code implementations • ICCV 2017 • Paul Gay, Cosimo Rubino, Vaibhav Bansal, Alessio Del Bue
We show that remarkable object localisation and volumetric occupancy can be recovered by including both geometrical constraints and prior information given by objects CAD models from the ShapeNet dataset.
no code implementations • 6 Aug 2017 • Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino
Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling.
no code implementations • CVPR 2016 • Marco Crocco, Cosimo Rubino, Alessio Del Bue
In practice, this work can be considered as the extension of Tomasi and Kanade factorization method using objects.
no code implementations • CVPR 2015 • Baochang Zhang, Alessandro Perina, Vittorio Murino, Alessio Del Bue
The fact that image data samples lie on a manifold has been successfully exploited in many learning and inference problems.
no code implementations • 16 Mar 2015 • Reza Sabzevari, Vittori Murino, Alessio Del Bue
Unlike multi-view stereo and multi-view photometric stereo methods, this pipeline deals with wide-baseline images that are uncalibrated, in terms of both camera parameters and lighting conditions.
no code implementations • 17 Feb 2015 • Cosimo Rubino, Marco Crocco, Alessandro Perina, Vittorio Murino, Alessio Del Bue
We present a novel method to infer, in closed-form, a general 3D spatial occupancy and orientation of a collection of rigid objects given 2D image detections from a sequence of images.