Search Results for author: Alessandro Conti

Found 11 papers, 8 papers with code

On Large Multimodal Models as Open-World Image Classifiers

1 code implementation27 Mar 2025 Alessandro Conti, Massimiliano Mancini, Enrico Fini, Yiming Wang, Paolo Rota, Elisa Ricci

Despite this remarkable capability, most existing studies on LMM classification performance are surprisingly limited in scope, often assuming a closed-world setting with a predefined set of categories.

image-classification Image Classification

Compositional Caching for Training-free Open-vocabulary Attribute Detection

no code implementations CVPR 2025 Marco Garosi, Alessandro Conti, Gaowen Liu, Elisa Ricci, Massimiliano Mancini

Those are aggregated at inference time based on the similarity between the input and cache images, refining the predictions of underlying Vision-Language Models (VLMs).

Attribute Open Vocabulary Attribute Detection

Vocabulary-free Image Classification and Semantic Segmentation

1 code implementation16 Apr 2024 Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, Elisa Ricci

To address VIC, we propose Category Search from External Databases (CaSED), a training-free method that leverages a pre-trained vision-language model and an external database.

Classification image-classification +6

The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation

1 code implementation ICCV 2023 Giacomo Zara, Alessandro Conti, Subhankar Roy, Stéphane Lathuilière, Paolo Rota, Elisa Ricci

Source-Free Video Unsupervised Domain Adaptation (SFVUDA) task consists in adapting an action recognition model, trained on a labelled source dataset, to an unlabelled target dataset, without accessing the actual source data.

Action Recognition Unsupervised Domain Adaptation

Vocabulary-free Image Classification

1 code implementation NeurIPS 2023 Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, Elisa Ricci

We thus formalize a novel task, termed as Vocabulary-free Image Classification (VIC), where we aim to assign to an input image a class that resides in an unconstrained language-induced semantic space, without the prerequisite of a known vocabulary.

Classification image-classification +6

Cannot find the paper you are looking for? You can Submit a new open access paper.