no code implementations • 21 Dec 2024 • Javier Montalvo, Roberto Alcover-Couso, Pablo Carballeira, Álvaro García-Martín, Juan C. SanMiguel, Marcos Escudero-Viñolo
This paper introduces a novel synthetic dataset that captures urban scenes under a variety of weather conditions, providing pixel-perfect, ground-truth-aligned images to facilitate effective feature alignment across domains.
no code implementations • 12 Dec 2024 • Roberto Alcover-Couso, Marcos Escudero-Viñolo, Juan C. SanMiguel, Jesus Bescos
Segmentation models are typically constrained by the categories defined during training.
Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +2
no code implementations • 12 Dec 2024 • Kirill Sirotkin, Marcos Escudero-Viñolo, Pablo Carballeira, Mayug Maniparambil, Catarina Barata, Noel E. O'Connor
When applied to the Conceptual Captions dataset for creating gender counterfactuals, our method results in higher visual and semantic fidelity than state-of-the-art alternatives, while maintaining the performance of models trained using only real data on non-human-centric tasks.
no code implementations • 24 Sep 2024 • Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Jose M Martínez
In this paper, we leverage the abundance of freely accessible trained models to introduce a cost-free approach to model merging.
1 code implementation • Applied Intelligence 2024 • Kirill Sirotkin, Marcos Escudero-Viñolo, Pablo Carballeira, Álvaro García-Martín
At a cost of extra training of only 0. 16% model parameters, in case of ResNet-50, we acquire a proxy task that (i) has a stronger correlation with end-to-end finetuned performance, (ii) improves the linear probing performance in the many- and few-shot learning regimes and (iii) in some cases, outperforms both linear probing and end-to-end finetuning, reaching the state-of-the-art performance on a pathology dataset.
Ranked #1 on Classification on MHIST (using extra training data)
no code implementations • 1 Jul 2024 • Roberto Alcover-Couso, Marcos Escudero-Viñolo, Juan C. SanMiguel, Jesus Bescós
In unsupervised domain adaptation (UDA), where models are trained on source data (e. g., synthetic) and adapted to target data (e. g., real-world) without target annotations, addressing the challenge of significant class imbalance remains an open issue.
no code implementations • 27 Sep 2023 • Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023.
no code implementations • 25 Jan 2023 • Iván de Andrés Tamé, Kirill Sirotkin, Pablo Carballeira, Marcos Escudero-Viñolo
Deep learning technologies have already demonstrated a high potential to build diagnosis support systems from medical imaging data, such as Chest X-Ray images.
no code implementations • 28 Nov 2022 • Elena Luna, Juan Carlos San Miguel, José María Martínez, Marcos Escudero-Viñolo
This letter focuses on the task of Multi-Target Multi-Camera vehicle tracking.
no code implementations • 4 May 2022 • Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Juan C. SanMiguel
Feature-based Knowledge Distillation is a subfield of KD that relies on intermediate network representations, either unaltered or depth-reduced via maximum activation maps, as the source knowledge.
no code implementations • CVPR 2022 • Kirill Sirotkin, Pablo Carballeira, Marcos Escudero-Viñolo
We show that there is a correlation between the type of the SSL model and the number of biases that it incorporates.
no code implementations • 8 Feb 2021 • Elena Luna, Juan C. SanMiguel, Jose M. Martínez, Marcos Escudero-Viñolo
Multi-Target Multi-Camera (MTMC) vehicle tracking is an essential task of visual traffic monitoring, one of the main research fields of Intelligent Transportation Systems.
no code implementations • 26 Aug 2020 • Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós
Action recognition is currently one of the top-challenging research fields in computer vision.
no code implementations • 25 May 2020 • Andrija Gajic, Ester Gonzalez-Sosa, Diego Gonzalez-Morin, Marcos Escudero-Viñolo, Alvaro Villegas
The objective of this work is to segment human body parts from egocentric video using semantic segmentation networks.
1 code implementation • 5 Sep 2019 • Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Álvaro García-Martín
Scene recognition is currently one of the top-challenging research fields in computer vision.
Ranked #1 on Scene Recognition on ADE20K
no code implementations • 27 Dec 2018 • Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Pablo Carballeira
Contrarily to the majority of the methods of the state-of-the-art, the proposed approach is scene-agnostic, not requiring a tailored adaptation to the target scenario\textemdash e. g., via fine-tunning.