Search Results for author: Luigi Di Stefano

Found 53 papers, 22 papers with code

Connecting NeRFs, Images, and Text

no code implementations11 Apr 2024 Francesco Ballerini, Pierluigi Zama Ramirez, Roberto Mirabella, Samuele Salti, Luigi Di Stefano

Neural Radiance Fields (NeRFs) have emerged as a standard framework for representing 3D scenes and objects, introducing a novel data type for information exchange and storage.

Representation Learning Retrieval +1

Deep Learning on 3D Neural Fields

no code implementations20 Dec 2023 Pierluigi Zama Ramirez, Luca De Luigi, Daniele Sirocchi, Adriano Cardace, Riccardo Spezialetti, Francesco Ballerini, Samuele Salti, Luigi Di Stefano

In recent years, Neural Fields (NFs) have emerged as an effective tool for encoding diverse continuous signals such as images, videos, audio, and 3D shapes.

Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

no code implementations7 Dec 2023 Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

The paper explores the industrial multimodal Anomaly Detection (AD) task, which exploits point clouds and RGB images to localize anomalies.

Anomaly Detection

Neural Processing of Tri-Plane Hybrid Neural Fields

1 code implementation2 Oct 2023 Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini, Allan Zhou, Samuele Salti, Luigi Di Stefano

While processing a field with the same reconstruction quality, we achieve task performance far superior to frameworks that process large MLPs and, for the first time, almost on par with architectures handling explicit representations.

Looking at words and points with attention: a benchmark for text-to-shape coherence

no code implementations14 Sep 2023 Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

The refined dataset, the new metric and a set of text-shape pairs validated by the user study comprise a novel, fine-grained benchmark that we publicly release to foster research on text-to-shape coherence of text-conditioned 3D generative models.

Learning Depth Estimation for Transparent and Mirror Surfaces

no code implementations ICCV 2023 Alex Costanzino, Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

Inferring the depth of transparent or mirror (ToM) surfaces represents a hard challenge for either sensors, algorithms, or deep networks.

Monocular Depth Estimation

ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects

no code implementations CVPR 2023 Marco Toschi, Riccardo De Matteo, Riccardo Spezialetti, Daniele De Gregorio, Luigi Di Stefano, Samuele Salti

By leveraging the dataset, we perform an ablation study on the relighting capability of variants of the vanilla NeRF architecture and identify a lightweight architecture that can render novel views of an object under novel light conditions, which we use to establish a non-trivial baseline for the dataset.

Image Relighting Novel View Synthesis

Deep Learning on Implicit Neural Representations of Shapes

no code implementations10 Feb 2023 Luca De Luigi, Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

Implicit Neural Representations (INRs) have emerged in the last few years as a powerful tool to encode continuously a variety of different signals like images, videos, audio and 3D shapes.

Learning Good Features to Transfer Across Tasks and Domains

no code implementations26 Jan 2023 Pierluigi Zama Ramirez, Adriano Cardace, Luca De Luigi, Alessio Tonioni, Samuele Salti, Luigi Di Stefano

Besides, we propose a set of strategies to constrain the learned feature spaces, to ease learning and increase the generalization capability of the mapping network, thereby considerably improving the final performance of our framework.

Monocular Depth Estimation Semantic Segmentation

ScanNeRF: a Scalable Benchmark for Neural Radiance Fields

no code implementations24 Nov 2022 Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano

In this paper, we propose the first-ever real benchmark thought for evaluating Neural Radiance Fields (NeRFs) and, in general, Neural Rendering (NR) frameworks.

Benchmarking Neural Rendering

Self-Distillation for Unsupervised 3D Domain Adaptation

no code implementations15 Oct 2022 Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

In contrast, in this work, we focus on obtaining a discriminative feature space for the target domain enforcing consistency between a point cloud and its augmented version.

Classification Point Cloud Classification +2

Cross-Spectral Neural Radiance Fields

no code implementations1 Sep 2022 Matteo Poggi, Pierluigi Zama Ramirez, Fabio Tosi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We propose X-NeRF, a novel method to learn a Cross-Spectral scene representation given images captured from cameras with different light spectrum sensitivity, based on the Neural Radiance Fields formulation.

RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

no code implementations CVPR 2022 Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences.

Stereo Matching

Learning the Space of Deep Models

1 code implementation10 Jun 2022 Gianluca Berardi, Luca De Luigi, Samuele Salti, Luigi Di Stefano

In particular, we show that it is possible to use representation learning to learn a fixed-size, low-dimensional embedding space of trained deep models and that such space can be explored by interpolation or optimization to attain ready-to-use models.

Image Classification Representation Learning

Open Challenges in Deep Stereo: the Booster Dataset

no code implementations CVPR 2022 Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We present a novel high-resolution and challenging stereo dataset framing indoor scenes annotated with dense and accurate ground-truth disparities.

Neural Disparity Refinement for Arbitrary Resolution Stereo

1 code implementation28 Oct 2021 Filippo Aleotti, Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We introduce a novel architecture for neural disparity refinement aimed at facilitating deployment of 3D computer vision on cheap and widespread consumer devices, such as mobile phones.

Zero-shot Generalization

Shallow Features Guide Unsupervised Domain Adaptation for Semantic Segmentation at Class Boundaries

1 code implementation6 Oct 2021 Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

Although deep neural networks have achieved remarkable results for the task of semantic segmentation, they usually fail to generalize towards new domains, especially when performing synthetic-to-real adaptation.

Data Augmentation Segmentation +2

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings

1 code implementation24 Dec 2020 Daniele De Gregorio, Riccardo Zanella, Gianluca Palli, Luigi Di Stefano

In this paper we investigate how to effectively deploy deep learning in practical industrial settings, such as robotic grasping applications.

Pose Estimation Robotic Grasping

SAFFIRE: System for Autonomous Feature Filtering and Intelligent ROI Estimation

no code implementations4 Dec 2020 Marco Boschi, Luigi Di Stefano, Martino Alessandrini

The framework is specialized here in the context of a machine vision system for automated product inspection.

Learning to Orient Surfaces by Self-supervised Spherical CNNs

1 code implementation NeurIPS 2020 Riccardo Spezialetti, Federico Stella, Marlon Marcon, Luciano Silva, Samuele Salti, Luigi Di Stefano

In this work, we show the feasibility of learning a robust canonical orientation for surfaces represented as point clouds.

Continual Adaptation for Deep Stereo

1 code implementation10 Jul 2020 Matteo Poggi, Alessio Tonioni, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression.

Depth Estimation

Ambiguity in Sequential Data: Predicting Uncertain Futures with Recurrent Models

no code implementations10 Mar 2020 Alessandro Berlati, Oliver Scheel, Luigi Di Stefano, Federico Tombari

Ambiguity is inherently present in many machine learning tasks, but especially for sequential models seldom accounted for, as most only output a single prediction.

Time Series Time Series Analysis +1

Boosting Object Recognition in Point Clouds by Saliency Detection

no code implementations6 Nov 2019 Marlon Marcon, Riccardo Spezialetti, Samuele Salti, Luciano Silva, Luigi Di Stefano

Object recognition in 3D point clouds is a challenging task, mainly when time is an important factor to deal with, such as in industrial applications.

Object Recognition Saliency Detection

Shooting Labels: 3D Semantic Labeling by Virtual Reality

1 code implementation11 Oct 2019 Pierluigi Zama Ramirez, Claudio Paternesi, Luca De Luigi, Luigi Lella, Daniele De Gregorio, Luigi Di Stefano

Availability of a few, large-size, annotated datasets, like ImageNet, Pascal VOC and COCO, has lead deep learning to revolutionize computer vision research by achieving astonishing results in several vision tasks. We argue that new tools to facilitate generation of annotated datasets may help spreading data-driven AI throughout applications and domains.

3D Semantic Segmentation

Learning an Effective Equivariant 3D Descriptor Without Supervision

no code implementations ICCV 2019 Riccardo Spezialetti, Samuele Salti, Luigi Di Stefano

Establishing correspondences between 3D shapes is a fundamental task in 3D Computer Vision, typically addressed by matching local descriptors.

Performance Evaluation of Learned 3D Features

no code implementations15 Sep 2019 Riccardo Spezialetti, Samuele Salti, Luigi Di Stefano

Matching surfaces is a challenging 3D Computer Vision problem typically addressed by local features.

Object Recognition

Unsupervised Domain Adaptation for Depth Prediction from Images

1 code implementation9 Sep 2019 Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Extensive experimental results based on standard datasets and evaluation protocols prove that our technique can address effectively the domain shift issue with both stereo and monocular depth prediction architectures and outperforms other state-of-the-art unsupervised loss functions that may be alternatively deployed to pursue domain adaptation.

Depth Estimation Depth Prediction +1

Semi-Automatic Labeling for Deep Learning in Robotics

1 code implementation5 Aug 2019 Daniele De Gregorio, Alessio Tonioni, Gianluca Palli, Luigi Di Stefano

In this paper, we propose Augmented Reality Semi-automatic labeling (ARS), a semi-automatic method which leverages on moving a 2D camera by means of a robot, proving precise camera tracking, and an augmented reality pen to define initial object bounding box, to create large labeled datasets with minimal human intervention.

Object object-detection +1

Real-Time Highly Accurate Dense Depth on a Power Budget using an FPGA-CPU Hybrid SoC

no code implementations17 Jul 2019 Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Alessio Tonioni, Thomas Joy, Luigi Di Stefano, Simon Walker, Philip H. S. Torr

Obtaining highly accurate depth from stereo images in real time has many applications across computer vision and robotics, but in some contexts, upper bounds on power consumption constrain the feasible hardware to embedded platforms such as FPGAs.

Learning to Adapt for Stereo

1 code implementation CVPR 2019 Alessio Tonioni, Oscar Rahnama, Thomas Joy, Luigi Di Stefano, Thalaiyasingam Ajanthan, Philip H. S. Torr

Real world applications of stereo depth estimation require models that are robust to dynamic variations in the environment.

Autonomous Driving Stereo Depth Estimation

Domain invariant hierarchical embedding for grocery products recognition

no code implementations2 Feb 2019 Alessio Tonioni, Luigi Di Stefano

Moreover, there exist a significant domain shift between the images that should be recognized at test time, taken in stores by cheap cameras, and those available for training, usually just one or a few studio-quality images per product.

Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade

1 code implementation29 Oct 2018 Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Victor A. Prisacariu, Luigi Di Stefano, Philip H. S. Torr

The adapted forests achieved relocalisation performance that was on par with that of offline forests, and our approach was able to estimate the camera pose in close to real time.

Pose Estimation

Exploiting Semantics in Adversarial Training for Image-Level Domain Adaptation

no code implementations13 Oct 2018 Pierluigi Zama Ramirez, Alessio Tonioni, Luigi Di Stefano

To prove the effectiveness of our proposal, we show how a semantic segmentation CNN trained on images from the synthetic GTA dataset adapted by our method can improve performance by more than 16% mIoU with respect to the same model trained on synthetic images.

Domain Adaptation Segmentation +2

Real-time self-adaptive deep stereo

1 code implementation CVPR 2019 Alessio Tonioni, Fabio Tosi, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Deep convolutional neural networks trained end-to-end are the state-of-the-art methods to regress dense disparity maps from stereo pairs.

Stereo Depth Estimation

Geometry meets semantics for semi-supervised monocular depth estimation

1 code implementation9 Oct 2018 Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

For unsupervised training of these models, geometry has been effectively exploited by suitable images warping losses computed from views acquired by a stereo rig or a moving camera.

Depth Prediction Monocular Depth Estimation +1

A deep learning pipeline for product recognition on store shelves

no code implementations3 Oct 2018 Alessio Tonioni, Eugenio Serra, Luigi Di Stefano

Then, available product databases usually include just one or a few studio-quality images per product (referred to herein as reference images), whilst at test time recognition is performed on pictures displaying a portion of a shelf containing several products and taken in the store by cheap cameras (referred to as query images).

Image Retrieval object-detection +2

Unsupervised Adaptation for Deep Stereo

1 code implementation ICCV 2017 Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs.

Product recognition in store shelves as a sub-graph isomorphism problem

no code implementations26 Jul 2017 Alessio Tonioni, Luigi Di Stefano

The arrangement of products in store shelves is carefully planned to maximize sales and keep customers happy.

SkiMap: An Efficient Mapping Framework for Robot Navigation

1 code implementation19 Apr 2017 Daniele De Gregorio, Luigi Di Stefano

We present a novel mapping framework for robot navigation which features a multi-level querying system capable to obtain rapidly representations as diverse as a 3D voxel grid, a 2. 5D height map and a 2D occupancy grid.

Robot Navigation

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

no code implementations CVPR 2017 Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr

Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and mapping, virtual/augmented reality and navigation.

Camera Relocalization regression

Learning a Descriptor-Specific 3D Keypoint Detector

no code implementations ICCV 2015 Samuele Salti, Federico Tombari, Riccardo Spezialetti, Luigi Di Stefano

Keypoint detection represents the first stage in the majority of modern computer vision pipelines based on automatically established correspondences between local descriptors.

Binary Classification Keypoint Detection

Volume-based Semantic Labeling with Signed Distance Functions

no code implementations13 Nov 2015 Tommaso Cavallari, Luigi Di Stefano

Research works on the two topics of Semantic Segmentation and SLAM (Simultaneous Localization and Mapping) have been following separate tracks.

Segmentation Semantic Segmentation +1

Keypoints from Symmetries by Wave Propagation

no code implementations CVPR 2013 Samuele Salti, Alessandro Lanza, Luigi Di Stefano

The paper conjectures and demonstrates that repeatable keypoints based on salient symmetries at different scales can be detected by a novel analysis grounded on the wave equation rather than the heat equation underlying traditional Gaussian scale-space theory.

Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment

no code implementations CVPR 2013 Nicola Fioraio, Luigi Di Stefano

In this paper we propose a novel Semantic Bundle Adjustment framework whereby known rigid stationary objects are detected while tracking the camera and mapping the environment.

3D Reconstruction Object +2

Cannot find the paper you are looking for? You can Submit a new open access paper.