Search Results for author: Luigi Di Stefano

Found 53 papers, 22 papers with code

Connecting NeRFs, Images, and Text

no code implementations • 11 Apr 2024 • Francesco Ballerini, Pierluigi Zama Ramirez, Roberto Mirabella, Samuele Salti, Luigi Di Stefano

Neural Radiance Fields (NeRFs) have emerged as a standard framework for representing 3D scenes and objects, introducing a novel data type for information exchange and storage.

Representation Learning Retrieval +1

Paper
Add Code

Test Time Training for Industrial Anomaly Segmentation

no code implementations • 4 Apr 2024 • Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

Anomaly Detection and Segmentation (AD&S) is crucial for industrial quality control.

Anomaly Detection Segmentation

Paper
Add Code

Deep Learning on 3D Neural Fields

no code implementations • 20 Dec 2023 • Pierluigi Zama Ramirez, Luca De Luigi, Daniele Sirocchi, Adriano Cardace, Riccardo Spezialetti, Francesco Ballerini, Samuele Salti, Luigi Di Stefano

In recent years, Neural Fields (NFs) have emerged as an effective tool for encoding diverse continuous signals such as images, videos, audio, and 3D shapes.

Paper
Add Code

Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

no code implementations • 7 Dec 2023 • Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

The paper explores the industrial multimodal Anomaly Detection (AD) task, which exploits point clouds and RGB images to localize anomalies.

Anomaly Detection

Paper
Add Code

Neural Processing of Tri-Plane Hybrid Neural Fields

1 code implementation • 2 Oct 2023 • Adriano Cardace, Pierluigi Zama Ramirez, Francesco Ballerini, Allan Zhou, Samuele Salti, Luigi Di Stefano

While processing a field with the same reconstruction quality, we achieve task performance far superior to frameworks that process large MLPs and, for the first time, almost on par with architectures handling explicit representations.

Paper
Code

Looking at words and points with attention: a benchmark for text-to-shape coherence

no code implementations • 14 Sep 2023 • Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

The refined dataset, the new metric and a set of text-shape pairs validated by the user study comprise a novel, fine-grained benchmark that we publicly release to foster research on text-to-shape coherence of text-conditioned 3D generative models.

Paper
Add Code

Learning Depth Estimation for Transparent and Mirror Surfaces

no code implementations • ICCV 2023 • Alex Costanzino, Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

Inferring the depth of transparent or mirror (ToM) surfaces represents a hard challenge for either sensors, algorithms, or deep networks.

Monocular Depth Estimation

Paper
Add Code

ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects

no code implementations • CVPR 2023 • Marco Toschi, Riccardo De Matteo, Riccardo Spezialetti, Daniele De Gregorio, Luigi Di Stefano, Samuele Salti

By leveraging the dataset, we perform an ablation study on the relighting capability of variants of the vanilla NeRF architecture and identify a lightweight architecture that can render novel views of an object under novel light conditions, which we use to establish a non-trivial baseline for the dataset.

Image Relighting Novel View Synthesis

Paper
Add Code

Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation

1 code implementation • 6 Apr 2023 • Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

3D semantic segmentation is a critical task in many real-world applications, such as autonomous driving, robotics, and mixed reality.

3D Semantic Segmentation Autonomous Driving +4

Paper
Code

Deep Learning on Implicit Neural Representations of Shapes

no code implementations • 10 Feb 2023 • Luca De Luigi, Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

Implicit Neural Representations (INRs) have emerged in the last few years as a powerful tool to encode continuously a variety of different signals like images, videos, audio and 3D shapes.

Paper
Add Code

Learning Good Features to Transfer Across Tasks and Domains

no code implementations • 26 Jan 2023 • Pierluigi Zama Ramirez, Adriano Cardace, Luca De Luigi, Alessio Tonioni, Samuele Salti, Luigi Di Stefano

Besides, we propose a set of strategies to constrain the learned feature spaces, to ease learning and increase the generalization capability of the mapping network, thereby considerably improving the final performance of our framework.

Monocular Depth Estimation Semantic Segmentation

Paper
Add Code

Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces

no code implementations • 19 Jan 2023 • Pierluigi Zama Ramirez, Alex Costanzino, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

Estimating depth from images nowadays yields outstanding results, both in terms of in-domain accuracy and generalization.

Monocular Depth Estimation

Paper
Add Code

ScanNeRF: a Scalable Benchmark for Neural Radiance Fields

no code implementations • 24 Nov 2022 • Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano

In this paper, we propose the first-ever real benchmark thought for evaluating Neural Radiance Fields (NeRFs) and, in general, Neural Rendering (NR) frameworks.

Benchmarking Neural Rendering

Paper
Add Code

Self-Distillation for Unsupervised 3D Domain Adaptation

no code implementations • 15 Oct 2022 • Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

In contrast, in this work, we focus on obtaining a discriminative feature space for the target domain enforcing consistency between a point cloud and its augmented version.

Classification Point Cloud Classification +2

Paper
Add Code

Cross-Spectral Neural Radiance Fields

no code implementations • 1 Sep 2022 • Matteo Poggi, Pierluigi Zama Ramirez, Fabio Tosi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We propose X-NeRF, a novel method to learn a Cross-Spectral scene representation given images captured from cameras with different light spectrum sensitivity, based on the Neural Radiance Fields formulation.

Paper
Add Code

RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

no code implementations • CVPR 2022 • Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We address the problem of registering synchronized color (RGB) and multi-spectral (MS) images featuring very different resolution by solving stereo matching correspondences.

Stereo Matching

Paper
Add Code

Learning the Space of Deep Models

1 code implementation • 10 Jun 2022 • Gianluca Berardi, Luca De Luigi, Samuele Salti, Luigi Di Stefano

In particular, we show that it is possible to use representation learning to learn a fixed-size, low-dimensional embedding space of trained deep models and that such space can be explored by interpolation or optimization to attain ready-to-use models.

Image Classification Representation Learning

Paper
Code

Open Challenges in Deep Stereo: the Booster Dataset

no code implementations • CVPR 2022 • Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We present a novel high-resolution and challenging stereo dataset framing indoor scenes annotated with dense and accurate ground-truth disparities.

Paper
Add Code

Neural Disparity Refinement for Arbitrary Resolution Stereo

1 code implementation • 28 Oct 2021 • Filippo Aleotti, Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

We introduce a novel architecture for neural disparity refinement aimed at facilitating deployment of 3D computer vision on cheap and widespread consumer devices, such as mobile phones.

Zero-shot Generalization

Paper
Code

RefRec: Pseudo-labels Refinement via Shape Reconstruction for Unsupervised 3D Domain Adaptation

1 code implementation • 21 Oct 2021 • Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

Unsupervised Domain Adaptation (UDA) for point cloud classification is an emerging research problem with relevant practical motivations.

Multi-Task Learning Point Cloud Classification +1

Paper
Code

Plugging Self-Supervised Monocular Depth into Unsupervised Domain Adaptation for Semantic Segmentation

1 code implementation • 13 Oct 2021 • Adriano Cardace, Luca De Luigi, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

We further rely on depth to generate a large and varied set of samples to Self-Train the final model.

Autonomous Driving Monocular Depth Estimation +3

Paper
Code

Shallow Features Guide Unsupervised Domain Adaptation for Semantic Segmentation at Class Boundaries

1 code implementation • 6 Oct 2021 • Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano

Although deep neural networks have achieved remarkable results for the task of semantic segmentation, they usually fail to generalize towards new domains, especially when performing synthetic-to-real adaptation.

Data Augmentation Segmentation +2

Paper
Code

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings

1 code implementation • 24 Dec 2020 • Daniele De Gregorio, Riccardo Zanella, Gianluca Palli, Luigi Di Stefano

In this paper we investigate how to effectively deploy deep learning in practical industrial settings, such as robotic grasping applications.

Pose Estimation Robotic Grasping

Paper
Code

SAFFIRE: System for Autonomous Feature Filtering and Intelligent ROI Estimation

no code implementations • 4 Dec 2020 • Marco Boschi, Luigi Di Stefano, Martino Alessandrini

The framework is specialized here in the context of a machine vision system for automated product inspection.

Paper
Add Code

Learning to Orient Surfaces by Self-supervised Spherical CNNs

1 code implementation • NeurIPS 2020 • Riccardo Spezialetti, Federico Stella, Marlon Marcon, Luciano Silva, Samuele Salti, Luigi Di Stefano

In this work, we show the feasibility of learning a robust canonical orientation for surfaces represented as point clouds.

Paper
Code

Continual Adaptation for Deep Stereo

1 code implementation • 10 Jul 2020 • Matteo Poggi, Alessio Tonioni, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression.

Depth Estimation

416

Paper
Code

Distilled Semantics for Comprehensive Scene Understanding from Videos

1 code implementation • CVPR 2020 • Fabio Tosi, Filippo Aleotti, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Luigi Di Stefano, Stefano Mattoccia

Whole understanding of the surroundings is paramount to autonomous systems.

Knowledge Distillation Monocular Depth Estimation +3

Paper
Code

Ambiguity in Sequential Data: Predicting Uncertain Futures with Recurrent Models

no code implementations • 10 Mar 2020 • Alessandro Berlati, Oliver Scheel, Luigi Di Stefano, Federico Tombari

Ambiguity is inherently present in many machine learning tasks, but especially for sequential models seldom accounted for, as most only output a single prediction.

Time Series Time Series Analysis +1

Paper
Add Code

Boosting Object Recognition in Point Clouds by Saliency Detection

no code implementations • 6 Nov 2019 • Marlon Marcon, Riccardo Spezialetti, Samuele Salti, Luciano Silva, Luigi Di Stefano

Object recognition in 3D point clouds is a challenging task, mainly when time is an important factor to deal with, such as in industrial applications.

Object Recognition Saliency Detection

Paper
Add Code

Shooting Labels: 3D Semantic Labeling by Virtual Reality

1 code implementation • 11 Oct 2019 • Pierluigi Zama Ramirez, Claudio Paternesi, Luca De Luigi, Luigi Lella, Daniele De Gregorio, Luigi Di Stefano

Availability of a few, large-size, annotated datasets, like ImageNet, Pascal VOC and COCO, has lead deep learning to revolutionize computer vision research by achieving astonishing results in several vision tasks. We argue that new tools to facilitate generation of annotated datasets may help spreading data-driven AI throughout applications and domains.

3D Semantic Segmentation

Paper
Code

Learning an Effective Equivariant 3D Descriptor Without Supervision

no code implementations • ICCV 2019 • Riccardo Spezialetti, Samuele Salti, Luigi Di Stefano

Establishing correspondences between 3D shapes is a fundamental task in 3D Computer Vision, typically addressed by matching local descriptors.

Paper
Add Code

Performance Evaluation of Learned 3D Features

no code implementations • 15 Sep 2019 • Riccardo Spezialetti, Samuele Salti, Luigi Di Stefano

Matching surfaces is a challenging 3D Computer Vision problem typically addressed by local features.

Object Recognition

Paper
Add Code

Unsupervised Domain Adaptation for Depth Prediction from Images

1 code implementation • 9 Sep 2019 • Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Extensive experimental results based on standard datasets and evaluation protocols prove that our technique can address effectively the domain shift issue with both stereo and monocular depth prediction architectures and outperforms other state-of-the-art unsupervised loss functions that may be alternatively deployed to pursue domain adaptation.

Depth Estimation Depth Prediction +1

Paper
Code

Semi-Automatic Labeling for Deep Learning in Robotics

1 code implementation • 5 Aug 2019 • Daniele De Gregorio, Alessio Tonioni, Gianluca Palli, Luigi Di Stefano

In this paper, we propose Augmented Reality Semi-automatic labeling (ARS), a semi-automatic method which leverages on moving a 2D camera by means of a robot, proving precise camera tracking, and an augmented reality pen to define initial object bounding box, to create large labeled datasets with minimal human intervention.

Object object-detection +1

Paper
Code

Real-Time Highly Accurate Dense Depth on a Power Budget using an FPGA-CPU Hybrid SoC

no code implementations • 17 Jul 2019 • Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Alessio Tonioni, Thomas Joy, Luigi Di Stefano, Simon Walker, Philip H. S. Torr

Obtaining highly accurate depth from stereo images in real time has many applications across computer vision and robotics, but in some contexts, upper bounds on power consumption constrain the feasible hardware to embedded platforms such as FPGAs.

Paper
Add Code

Learning Across Tasks and Domains

2 code implementations • ICCV 2019 • Pierluigi Zama Ramirez, Alessio Tonioni, Samuele Salti, Luigi Di Stefano

Recent works have proven that many relevant visual tasks are closely related one to another.

Domain Adaptation Monocular Depth Estimation +1

Paper
Code

Learning to Adapt for Stereo

1 code implementation • CVPR 2019 • Alessio Tonioni, Oscar Rahnama, Thomas Joy, Luigi Di Stefano, Thalaiyasingam Ajanthan, Philip H. S. Torr

Real world applications of stereo depth estimation require models that are robust to dynamic variations in the environment.

Autonomous Driving Stereo Depth Estimation

Paper
Code

Domain invariant hierarchical embedding for grocery products recognition

no code implementations • 2 Feb 2019 • Alessio Tonioni, Luigi Di Stefano

Moreover, there exist a significant domain shift between the images that should be recognized at test time, taken in stores by cheap cameras, and those available for training, usually just one or a few studio-quality images per product.

Paper
Add Code

Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade

1 code implementation • 29 Oct 2018 • Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Victor A. Prisacariu, Luigi Di Stefano, Philip H. S. Torr

The adapted forests achieved relocalisation performance that was on par with that of offline forests, and our approach was able to estimate the camera pose in close to real time.

Pose Estimation

190

Paper
Code

Exploiting Semantics in Adversarial Training for Image-Level Domain Adaptation

no code implementations • 13 Oct 2018 • Pierluigi Zama Ramirez, Alessio Tonioni, Luigi Di Stefano

To prove the effectiveness of our proposal, we show how a semantic segmentation CNN trained on images from the synthetic GTA dataset adapted by our method can improve performance by more than 16% mIoU with respect to the same model trained on synthetic images.

Domain Adaptation Segmentation +2

Paper
Add Code

Real-time self-adaptive deep stereo

1 code implementation • CVPR 2019 • Alessio Tonioni, Fabio Tosi, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Deep convolutional neural networks trained end-to-end are the state-of-the-art methods to regress dense disparity maps from stereo pairs.

Stereo Depth Estimation

416

Paper
Code

Let's take a Walk on Superpixels Graphs: Deformable Linear Objects Segmentation and Model Estimation

2 code implementations • 10 Oct 2018 • Daniele De Gregorio, Gianluca Palli, Luigi Di Stefano

While robotic manipulation of rigid objects is quite straightforward, coping with deformable objects is an open issue.

Pose Estimation Semantic Segmentation +1

Paper
Code

Geometry meets semantics for semi-supervised monocular depth estimation

1 code implementation • 9 Oct 2018 • Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi, Stefano Mattoccia, Luigi Di Stefano

For unsupervised training of these models, geometry has been effectively exploited by suitable images warping losses computed from views acquired by a stereo rig or a moving camera.

Depth Prediction Monocular Depth Estimation +1

108

Paper
Code

A deep learning pipeline for product recognition on store shelves

no code implementations • 3 Oct 2018 • Alessio Tonioni, Eugenio Serra, Luigi Di Stefano

Then, available product databases usually include just one or a few studio-quality images per product (referred to herein as reference images), whilst at test time recognition is performed on pictures displaying a portion of a shelf containing several products and taken in the store by cheap cameras (referred to as query images).

Image Retrieval object-detection +2

Paper
Add Code

Unsupervised Adaptation for Deep Stereo

1 code implementation • ICCV 2017 • Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano

Recent ground-breaking works have shown that deep neural networks can be trained end-to-end to regress dense disparity maps directly from image pairs.

Paper
Code

Product recognition in store shelves as a sub-graph isomorphism problem

no code implementations • 26 Jul 2017 • Alessio Tonioni, Luigi Di Stefano

The arrangement of products in store shelves is carefully planned to maximize sales and keep customers happy.

Paper
Add Code

SkiMap: An Efficient Mapping Framework for Robot Navigation

1 code implementation • 19 Apr 2017 • Daniele De Gregorio, Luigi Di Stefano

We present a novel mapping framework for robot navigation which features a multi-level querying system capable to obtain rapidly representations as diverse as a 3D voxel grid, a 2. 5D height map and a 2D occupancy grid.

Robot Navigation

266

Paper
Code

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

no code implementations • CVPR 2017 • Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr

Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and mapping, virtual/augmented reality and navigation.

Camera Relocalization regression

Paper
Add Code

Learning a Descriptor-Specific 3D Keypoint Detector

no code implementations • ICCV 2015 • Samuele Salti, Federico Tombari, Riccardo Spezialetti, Luigi Di Stefano

Keypoint detection represents the first stage in the majority of modern computer vision pipelines based on automatically established correspondences between local descriptors.

Binary Classification Keypoint Detection

Paper
Add Code

Volume-based Semantic Labeling with Signed Distance Functions

no code implementations • 13 Nov 2015 • Tommaso Cavallari, Luigi Di Stefano

Research works on the two topics of Semantic Segmentation and SLAM (Simultaneous Localization and Mapping) have been following separate tracks.

Segmentation Semantic Segmentation +1

Paper
Add Code

Large-Scale and Drift-Free Surface Reconstruction Using Online Subvolume Registration

no code implementations • CVPR 2015 • Nicola Fioraio, Jonathan Taylor, Andrew Fitzgibbon, Luigi Di Stefano, Shahram Izadi

Our method supports online model correction, without needing to reprocess or store any input depth data.

Pose Estimation Surface Reconstruction

Paper
Add Code

Keypoints from Symmetries by Wave Propagation

no code implementations • CVPR 2013 • Samuele Salti, Alessandro Lanza, Luigi Di Stefano

The paper conjectures and demonstrates that repeatable keypoints based on salient symmetries at different scales can be detected by a novel analysis grounded on the wave equation rather than the heat equation underlying traditional Gaussian scale-space theory.

Paper
Add Code

Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment

no code implementations • CVPR 2013 • Nicola Fioraio, Luigi Di Stefano

In this paper we propose a novel Semantic Bundle Adjustment framework whereby known rigid stationary objects are detected while tracking the camera and mapping the environment.

3D Reconstruction Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.