no code implementations • 10 Feb 2025 • Gabriele Berton, Alex Stoken, Carlo Masone
We first produce full localization information for 300, 000 manually weakly labeled astronaut photos through an automated pipeline, and then use these images to train a model, called AstroLoc.
1 code implementation • 26 Nov 2024 • Claudia Cuttano, Gabriele Trivigno, Gabriele Rosi, Carlo Masone, Giuseppe Averta
Referring Video Object Segmentation (RVOS) relies on natural language expressions to segment an object in a video clip.
Ranked #4 on
Referring Video Object Segmentation
on MeViS
Natural Language Understanding
Referring Video Object Segmentation
+3
no code implementations • 4 Jun 2024 • Gabriele Berton, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone
Mesh-based scene representation offers a promising direction for simplifying large-scale hierarchical visual localization pipelines, combining a visual place recognition step based on global features (retrieval) and a visual localization step based on local features.
1 code implementation • 3 Jun 2024 • Giovanni Barbarani, Francesco Vaccarino, Gabriele Trivigno, Marco Guerra, Gabriele Berton, Carlo Masone
In computer vision, keypoint detection is a fundamental task, with applications spanning from robotics to image retrieval; however, existing learning-based methods suffer from scale dependency and lack flexibility.
1 code implementation • 8 May 2024 • Gabriele Berton, Gabriele Goletto, Gabriele Trivigno, Alex Stoken, Barbara Caputo, Carlo Masone
Precise, pixel-wise geolocalization of astronaut photography is critical to unlocking the potential of this unique type of remotely sensed Earth data, particularly for its use in disaster management and climate change research.
no code implementations • 20 Apr 2024 • Mattia Dutto, Gabriele Berton, Debora Caldarola, Eros Fanì, Gabriele Trivigno, Carlo Masone
Visual Place Recognition (VPR) aims to estimate the location of an image by treating it as a retrieval problem.
no code implementations • CVPR 2024 • Gabriele Trivigno, Carlo Masone, Barbara Caputo, Torsten Sattler
This involves training an implicit scene representation or learning features while optimizing a camera pose-based loss.
no code implementations • 28 Mar 2024 • Qitian Ma, Shyam Nanda Rai, Carlo Masone, Tatiana Tommasi
In the domain of computer vision, semantic segmentation emerges as a fundamental application within machine learning, wherein individual pixels of an image are classified into distinct semantic categories.
1 code implementation • 28 Mar 2024 • Gabriele Berton, Gabriele Trivigno, Barbara Caputo, Carlo Masone
As a mitigation to this problem, we propose a novel Joint Image and Sequence Training protocol (JIST) that leverages large uncurated sets of images through a multi-task learning framework.
1 code implementation • CVPR 2024 • Gabriele Berton, Alex Stoken, Barbara Caputo, Carlo Masone
Astronaut photography, spanning six decades of human spaceflight, presents a unique Earth observations dataset with immense value for both scientific research and disaster response.
no code implementations • 30 Nov 2023 • Riccardo Zaccone, Carlo Masone, Marco Ciccone
Federated Learning (FL) has emerged as the state-of-the-art approach for learning from decentralized data in privacy-constrained scenarios.
no code implementations • 27 Sep 2023 • Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023.
no code implementations • 8 Sep 2023 • Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone
Segmenting unknown or anomalous object instances is a critical task in autonomous driving applications, and it is approached traditionally as a per-pixel classification problem.
4 code implementations • ICCV 2023 • Gabriele Berton, Gabriele Trivigno, Barbara Caputo, Carlo Masone
Visual Place Recognition is a task that aims to predict the place of an image (called query) based solely on its visual features.
1 code implementation • ICCV 2023 • Shyam Nandan Rai, Fabio Cermelli, Dario Fontanel, Carlo Masone, Barbara Caputo
We propose a paradigm change by shifting from a per-pixel classification to a mask classification.
Ranked #1 on
Scene Segmentation
on StreetHazards
(using extra training data)
1 code implementation • 17 Jul 2023 • Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone
Our method, Divide&Classify (D&C), enjoys the fast inference of classification solutions and an accuracy competitive with retrieval methods on the fine-grained, city-wide setting.
1 code implementation • 12 Apr 2023 • Giovanni Barbarani, Mohamad Mostafa, Hajali Bayramov, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo
Despite recent advances, recognizing the same place when the query comes from a significantly different distribution is still a major hurdle for state of the art retrieval methods.
1 code implementation • ICCV 2023 • Gabriele Trivigno, Gabriele Berton, Juan Aragon, Barbara Caputo, Carlo Masone
In this paper we investigate whether we can effectively approach this task as a classification problem, thus bypassing the need for a similarity search.
no code implementations • 12 Oct 2022 • Edoardo Arnaudo, Antonio Tavera, Fabrizio Dominici, Carlo Masone, Barbara Caputo
We investigate the task of unsupervised domain adaptation in aerial semantic segmentation and discover that the current state-of-the-art algorithms designed for autonomous driving based on domain mixing do not translate well to the aerial setting.
1 code implementation • 8 Jul 2022 • Riccardo Mereu, Gabriele Trivigno, Gabriele Berton, Carlo Masone, Barbara Caputo
In robotics, Visual Place Recognition is a continuous process that receives as input a video stream to produce a hypothesis of the robot's current position within a map of known places.
1 code implementation • 17 Apr 2022 • Antonio Tavera, Edoardo Arnaudo, Carlo Masone, Barbara Caputo
We observe that the existing methods used for this task are designed without considering two characteristics of the aerial data: (i) the top-down perspective implies that the model cannot rely on a fixed semantic structure of the scene, because the same scene may be experienced with different rotations of the sensor; (ii) there can be a strong imbalance in the distribution of semantic classes because the relevant objects of the scene may appear at extremely different scales (e. g., a field of crops and a small vehicle).
1 code implementation • CVPR 2022 • Gabriele Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone, Gabriela Csurka, Torsten Sattler, Barbara Caputo
In this paper, we propose a new open-source benchmarking framework for Visual Geo-localization (VG) that allows to build, train, and test a wide range of commonly used architectures, with the flexibility to change individual components of a geo-localization pipeline.
2 code implementations • CVPR 2022 • Gabriele Berton, Carlo Masone, Barbara Caputo
Visual Geo-localization (VG) is the task of estimating the position where a given photo was taken by comparing it with a large database of images of known locations.
Ranked #2 on
Visual Place Recognition
on Nardo-Air R
1 code implementation • 24 Jan 2022 • Valerio Paolicelli, Antonio Tavera, Carlo Masone, Gabriele Berton, Barbara Caputo
In this paper we address the task of visual place recognition (VPR), where the goal is to retrieve the correct GPS coordinates of a given query image against a huge geotagged gallery.
1 code implementation • 22 Oct 2021 • Antonio Tavera, Fabio Cermelli, Carlo Masone, Barbara Caputo
The pixel-wise adversarial training is assisted by a novel sample selection procedure, that handles the imbalance between source and target data, and a knowledge distillation strategy, that avoids overfitting towards the few target images.
1 code implementation • 22 Oct 2021 • Antonio Tavera, Carlo Masone, Barbara Caputo
To the best of our knowledge, we are the first to present a real-time adversarial approach for assessing the domain adaption problem in semantic segmentation.
1 code implementation • ICCV 2021 • Gabriele Berton, Carlo Masone, Valerio Paolicelli, Barbara Caputo
Dense local features matching is robust against changes in illumination and occlusions, but not against viewpoint shifts which are a fundamental aspect of geolocalization.
2 code implementations • 14 Oct 2020 • Gabriele Moreno Berton, Valerio Paolicelli, Carlo Masone, Barbara Caputo
We address the task of cross-domain visual place recognition, where the goal is to geolocalize a given query image against a labeled gallery, in the case where the query and the gallery belong to different visual domains.
no code implementations • 17 Apr 2020 • Emanuele Alberti, Antonio Tavera, Carlo Masone, Barbara Caputo
To support work in this direction, this paper contributes a new large scale, synthetic dataset for semantic segmentation with more than 100 different source visual domains.