Search Results for author: Carlo Masone

Found 23 papers, 16 papers with code

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

no code implementations • 16 Apr 2024 • Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler

This involves training an implicit scene representation or learning features while optimizing a camera pose-based loss.

Paper
Add Code

JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition

1 code implementation • 28 Mar 2024 • Gabriele Berton, Gabriele Trivigno, Barbara Caputo, Carlo Masone

As a mitigation to this problem, we propose a novel Joint Image and Sequence Training protocol (JIST) that leverages large uncurated sets of images through a multi-task learning framework.

Multi-Task Learning Visual Place Recognition

Paper
Code

Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation

no code implementations • 28 Mar 2024 • Qitian Ma, Shyam Nanda Rai, Carlo Masone, Tatiana Tommasi

In the domain of computer vision, semantic segmentation emerges as a fundamental application within machine learning, wherein individual pixels of an image are classified into distinct semantic categories.

Decision Making Image Segmentation +3

Paper
Add Code

EarthLoc: Astronaut Photography Localization by Indexing Earth from Space

1 code implementation • 11 Mar 2024 • Gabriele Berton, Alex Stoken, Barbara Caputo, Carlo Masone

Astronaut photography, spanning six decades of human spaceflight, presents a unique Earth observations dataset with immense value for both scientific research and disaster response.

Data Augmentation Disaster Response +2

Paper
Code

Communication-Efficient Heterogeneous Federated Learning with Generalized Heavy-Ball Momentum

no code implementations • 30 Nov 2023 • Riccardo Zaccone, Carlo Masone, Marco Ciccone

Federated Learning (FL) is the state-of-the-art approach for learning from decentralized data in privacy-constrained scenarios.

Federated Learning

Paper
Add Code

The Robust Semantic Segmentation UNCV2023 Challenge Results

no code implementations • 27 Sep 2023 • Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi

This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023.

Autonomous Driving Segmentation +2

Paper
Add Code

Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation

no code implementations • 8 Sep 2023 • Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone

Segmenting unknown or anomalous object instances is a critical task in autonomous driving applications, and it is approached traditionally as a per-pixel classification problem.

Autonomous Driving Classification +3

Paper
Add Code

EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition

3 code implementations • ICCV 2023 • Gabriele Berton, Gabriele Trivigno, Barbara Caputo, Carlo Masone

Visual Place Recognition is a task that aims to predict the place of an image (called query) based solely on its visual features.

Ranked #1 on Visual Place Recognition on SF-XL test v1

Image Retrieval Retrieval +1

122

Paper
Code

Unmasking Anomalies in Road-Scene Segmentation

1 code implementation • ICCV 2023 • Shyam Nandan Rai, Fabio Cermelli, Dario Fontanel, Carlo Masone, Barbara Caputo

We propose a paradigm change by shifting from a per-pixel classification to a mask classification.

Ranked #1 on Scene Segmentation on StreetHazards (using extra training data)

Anomaly Detection Classification +3

Paper
Code

Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition

1 code implementation • 17 Jul 2023 • Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone

Our method, Divide&Classify (D&C), enjoys the fast inference of classification solutions and an accuracy competitive with retrieval methods on the fine-grained, city-wide setting.

Image Retrieval Retrieval +1

Paper
Code

Are Local Features All You Need for Cross-Domain Visual Place Recognition?

1 code implementation • 12 Apr 2023 • Giovanni Barbarani, Mohamad Mostafa, Hajali Bayramov, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo

Despite recent advances, recognizing the same place when the query comes from a significantly different distribution is still a major hurdle for state of the art retrieval methods.

Re-Ranking Retrieval +1

Paper
Code

Divide&Classify: Fine-Grained Classification for City-Wide Visual Geo-Localization

1 code implementation • ICCV 2023 • Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone

In this paper we investigate whether we can effectively approach this task as a classification problem, thus bypassing the need for a similarity search.

Image Retrieval Retrieval +1

Paper
Code

Hierarchical Instance Mixing across Domains in Aerial Segmentation

no code implementations • 12 Oct 2022 • Edoardo Arnaudo, Antonio Tavera, Fabrizio Dominici, Carlo Masone, Barbara Caputo

We investigate the task of unsupervised domain adaptation in aerial semantic segmentation and discover that the current state-of-the-art algorithms designed for autonomous driving based on domain mixing do not translate well to the aerial setting.

Autonomous Driving Segmentation +2

Paper
Add Code

Learning Sequential Descriptors for Sequence-based Visual Place Recognition

1 code implementation • 8 Jul 2022 • Riccardo Mereu, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo

In robotics, Visual Place Recognition is a continuous process that receives as input a video stream to produce a hypothesis of the robot's current position within a map of known places.

Position Visual Place Recognition

Paper
Code

Augmentation Invariance and Adaptive Sampling in Semantic Segmentation of Agricultural Aerial Images

1 code implementation • 17 Apr 2022 • Antonio Tavera, Edoardo Arnaudo, Carlo Masone, Barbara Caputo

We observe that the existing methods used for this task are designed without considering two characteristics of the aerial data: (i) the top-down perspective implies that the model cannot rely on a fixed semantic structure of the scene, because the same scene may be experienced with different rotations of the sensor; (ii) there can be a strong imbalance in the distribution of semantic classes because the relevant objects of the scene may appear at extremely different scales (e. g., a field of crops and a small vehicle).

Semantic Segmentation

Paper
Code

Deep Visual Geo-localization Benchmark

1 code implementation • CVPR 2022 • Gabriele Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone, Gabriela Csurka, Torsten Sattler, Barbara Caputo

In this paper, we propose a new open-source benchmarking framework for Visual Geo-localization (VG) that allows to build, train, and test a wide range of commonly used architectures, with the flexibility to change individual components of a geo-localization pipeline.

Benchmarking Data Augmentation

151

Paper
Code

Rethinking Visual Geo-localization for Large-Scale Applications

2 code implementations • CVPR 2022 • Gabriele Berton, Carlo Masone, Barbara Caputo

Visual Geo-localization (VG) is the task of estimating the position where a given photo was taken by comparing it with a large database of images of known locations.

Ranked #1 on Visual Place Recognition on SF-XL test v2

Contrastive Learning Image Classification +2

256

Paper
Code

Learning Semantics for Visual Place Recognition through Multi-Scale Attention

1 code implementation • 24 Jan 2022 • Valerio Paolicelli, Antonio Tavera, Carlo Masone, Gabriele Berton, Barbara Caputo

In this paper we address the task of visual place recognition (VPR), where the goal is to retrieve the correct GPS coordinates of a given query image against a huge geotagged gallery.

Segmentation Visual Place Recognition

Paper
Code

Pixel-by-Pixel Cross-Domain Alignment for Few-Shot Semantic Segmentation

1 code implementation • 22 Oct 2021 • Antonio Tavera, Fabio Cermelli, Carlo Masone, Barbara Caputo

The pixel-wise adversarial training is assisted by a novel sample selection procedure, that handles the imbalance between source and target data, and a knowledge distillation strategy, that avoids overfitting towards the few target images.

Autonomous Driving Cross-Domain Few-Shot +3

Paper
Code

Reimagine BiSeNet for Real-Time Domain Adaptation in Semantic Segmentation

1 code implementation • 22 Oct 2021 • Antonio Tavera, Carlo Masone, Barbara Caputo

To the best of our knowledge, we are the first to present a real-time adversarial approach for assessing the domain adaption problem in semantic segmentation.

Domain Adaptation Real-Time Semantic Segmentation +1

Paper
Code

Viewpoint Invariant Dense Matching for Visual Geolocalization

1 code implementation • ICCV 2021 • Gabriele Berton, Carlo Masone, Valerio Paolicelli, Barbara Caputo

Dense local features matching is robust against changes in illumination and occlusions, but not against viewpoint shifts which are a fundamental aspect of geolocalization.

Re-Ranking Retrieval

Paper
Code

Adaptive-Attentive Geolocalization from few queries: a hybrid approach

2 code implementations • 14 Oct 2020 • Gabriele Moreno Berton, Valerio Paolicelli, Carlo Masone, Barbara Caputo

We address the task of cross-domain visual place recognition, where the goal is to geolocalize a given query image against a labeled gallery, in the case where the query and the gallery belong to different visual domains.

Unsupervised Domain Adaptation Visual Place Recognition

Paper
Code

IDDA: a large-scale multi-domain dataset for autonomous driving

no code implementations • 17 Apr 2020 • Emanuele Alberti, Antonio Tavera, Carlo Masone, Barbara Caputo

To support work in this direction, this paper contributes a new large scale, synthetic dataset for semantic segmentation with more than 100 different source visual domains.

Autonomous Driving Domain Adaptation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.