Search Results for author: Carlo Masone

Found 23 papers, 16 papers with code

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

no code implementations16 Apr 2024 Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler

This involves training an implicit scene representation or learning features while optimizing a camera pose-based loss.

Retrieval

JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition

1 code implementation28 Mar 2024 Gabriele Berton, Gabriele Trivigno, Barbara Caputo, Carlo Masone

As a mitigation to this problem, we propose a novel Joint Image and Sequence Training protocol (JIST) that leverages large uncurated sets of images through a multi-task learning framework.

Multi-Task Learning Visual Place Recognition

Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation

no code implementations28 Mar 2024 Qitian Ma, Shyam Nanda Rai, Carlo Masone, Tatiana Tommasi

In the domain of computer vision, semantic segmentation emerges as a fundamental application within machine learning, wherein individual pixels of an image are classified into distinct semantic categories.

Decision Making Image Segmentation +3

EarthLoc: Astronaut Photography Localization by Indexing Earth from Space

1 code implementation11 Mar 2024 Gabriele Berton, Alex Stoken, Barbara Caputo, Carlo Masone

Astronaut photography, spanning six decades of human spaceflight, presents a unique Earth observations dataset with immense value for both scientific research and disaster response.

Data Augmentation Disaster Response +2

Communication-Efficient Heterogeneous Federated Learning with Generalized Heavy-Ball Momentum

no code implementations30 Nov 2023 Riccardo Zaccone, Carlo Masone, Marco Ciccone

Federated Learning (FL) is the state-of-the-art approach for learning from decentralized data in privacy-constrained scenarios.

Federated Learning

Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation

no code implementations8 Sep 2023 Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone

Segmenting unknown or anomalous object instances is a critical task in autonomous driving applications, and it is approached traditionally as a per-pixel classification problem.

Autonomous Driving Classification +3

Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition

1 code implementation17 Jul 2023 Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone

Our method, Divide&Classify (D&C), enjoys the fast inference of classification solutions and an accuracy competitive with retrieval methods on the fine-grained, city-wide setting.

Image Retrieval Retrieval +1

Are Local Features All You Need for Cross-Domain Visual Place Recognition?

1 code implementation12 Apr 2023 Giovanni Barbarani, Mohamad Mostafa, Hajali Bayramov, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo

Despite recent advances, recognizing the same place when the query comes from a significantly different distribution is still a major hurdle for state of the art retrieval methods.

Re-Ranking Retrieval +1

Divide&Classify: Fine-Grained Classification for City-Wide Visual Geo-Localization

1 code implementation ICCV 2023 Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone

In this paper we investigate whether we can effectively approach this task as a classification problem, thus bypassing the need for a similarity search.

Image Retrieval Retrieval +1

Hierarchical Instance Mixing across Domains in Aerial Segmentation

no code implementations12 Oct 2022 Edoardo Arnaudo, Antonio Tavera, Fabrizio Dominici, Carlo Masone, Barbara Caputo

We investigate the task of unsupervised domain adaptation in aerial semantic segmentation and discover that the current state-of-the-art algorithms designed for autonomous driving based on domain mixing do not translate well to the aerial setting.

Autonomous Driving Segmentation +2

Learning Sequential Descriptors for Sequence-based Visual Place Recognition

1 code implementation8 Jul 2022 Riccardo Mereu, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo

In robotics, Visual Place Recognition is a continuous process that receives as input a video stream to produce a hypothesis of the robot's current position within a map of known places.

Position Visual Place Recognition

Augmentation Invariance and Adaptive Sampling in Semantic Segmentation of Agricultural Aerial Images

1 code implementation17 Apr 2022 Antonio Tavera, Edoardo Arnaudo, Carlo Masone, Barbara Caputo

We observe that the existing methods used for this task are designed without considering two characteristics of the aerial data: (i) the top-down perspective implies that the model cannot rely on a fixed semantic structure of the scene, because the same scene may be experienced with different rotations of the sensor; (ii) there can be a strong imbalance in the distribution of semantic classes because the relevant objects of the scene may appear at extremely different scales (e. g., a field of crops and a small vehicle).

Semantic Segmentation

Deep Visual Geo-localization Benchmark

1 code implementation CVPR 2022 Gabriele Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone, Gabriela Csurka, Torsten Sattler, Barbara Caputo

In this paper, we propose a new open-source benchmarking framework for Visual Geo-localization (VG) that allows to build, train, and test a wide range of commonly used architectures, with the flexibility to change individual components of a geo-localization pipeline.

Benchmarking Data Augmentation

Rethinking Visual Geo-localization for Large-Scale Applications

2 code implementations CVPR 2022 Gabriele Berton, Carlo Masone, Barbara Caputo

Visual Geo-localization (VG) is the task of estimating the position where a given photo was taken by comparing it with a large database of images of known locations.

Contrastive Learning Image Classification +2

Learning Semantics for Visual Place Recognition through Multi-Scale Attention

1 code implementation24 Jan 2022 Valerio Paolicelli, Antonio Tavera, Carlo Masone, Gabriele Berton, Barbara Caputo

In this paper we address the task of visual place recognition (VPR), where the goal is to retrieve the correct GPS coordinates of a given query image against a huge geotagged gallery.

Segmentation Visual Place Recognition

Pixel-by-Pixel Cross-Domain Alignment for Few-Shot Semantic Segmentation

1 code implementation22 Oct 2021 Antonio Tavera, Fabio Cermelli, Carlo Masone, Barbara Caputo

The pixel-wise adversarial training is assisted by a novel sample selection procedure, that handles the imbalance between source and target data, and a knowledge distillation strategy, that avoids overfitting towards the few target images.

Autonomous Driving Cross-Domain Few-Shot +3

Reimagine BiSeNet for Real-Time Domain Adaptation in Semantic Segmentation

1 code implementation22 Oct 2021 Antonio Tavera, Carlo Masone, Barbara Caputo

To the best of our knowledge, we are the first to present a real-time adversarial approach for assessing the domain adaption problem in semantic segmentation.

Domain Adaptation Real-Time Semantic Segmentation +1

Viewpoint Invariant Dense Matching for Visual Geolocalization

1 code implementation ICCV 2021 Gabriele Berton, Carlo Masone, Valerio Paolicelli, Barbara Caputo

Dense local features matching is robust against changes in illumination and occlusions, but not against viewpoint shifts which are a fundamental aspect of geolocalization.

Re-Ranking Retrieval

Adaptive-Attentive Geolocalization from few queries: a hybrid approach

2 code implementations14 Oct 2020 Gabriele Moreno Berton, Valerio Paolicelli, Carlo Masone, Barbara Caputo

We address the task of cross-domain visual place recognition, where the goal is to geolocalize a given query image against a labeled gallery, in the case where the query and the gallery belong to different visual domains.

Unsupervised Domain Adaptation Visual Place Recognition

IDDA: a large-scale multi-domain dataset for autonomous driving

no code implementations17 Apr 2020 Emanuele Alberti, Antonio Tavera, Carlo Masone, Barbara Caputo

To support work in this direction, this paper contributes a new large scale, synthetic dataset for semantic segmentation with more than 100 different source visual domains.

Autonomous Driving Domain Adaptation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.