Search Results for author: Enrico Fini

Found 23 papers, 17 papers with code

On Large Multimodal Models as Open-World Image Classifiers

1 code implementation27 Mar 2025 Alessandro Conti, Massimiliano Mancini, Enrico Fini, Yiming Wang, Paolo Rota, Elisa Ricci

Despite this remarkable capability, most existing studies on LMM classification performance are surprisingly limited in scope, often assuming a closed-world setting with a predefined set of categories.

image-classification Image Classification

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

no code implementations19 Feb 2025 Roman Bachmann, Jesse Allardice, David Mizrahi, Enrico Fini, Oğuzhan Fatih Kar, Elmira Amirloo, Alaaeldin El-Nouby, Amir Zamir, Afshin Dehghan

A key finding is that FlexTok enables next-token prediction to describe images in a coarse-to-fine "visual vocabulary", and that the number of tokens to generate depends on the complexity of the generation task.

Image Generation

Retrieval-enriched zero-shot image classification in low-resource domains

no code implementations1 Nov 2024 Nicola Dall'Asen, Yiming Wang, Enrico Fini, Elisa Ricci

Low-resource domains, characterized by scarce data and annotations, present significant challenges for language and visual understanding tasks, with the latter much under-explored in the literature.

image-classification Image Classification +3

Visual Scratchpads: Enabling Global Reasoning in Vision

no code implementations10 Oct 2024 Aryo Lotfi, Enrico Fini, Samy Bengio, Moin Nabi, Emmanuel Abbe

Modern vision models have achieved remarkable success in benchmarks where local features provide critical information about the target.

Out-of-Distribution Generalization

Vocabulary-free Image Classification and Semantic Segmentation

1 code implementation16 Apr 2024 Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, Elisa Ricci

To address VIC, we propose Category Search from External Databases (CaSED), a training-free method that leverages a pre-trained vision-language model and an external database.

Classification image-classification +6

Continual Contrastive Spoken Language Understanding

no code implementations4 Oct 2023 Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj

In this paper, we investigate the problem of learning sequence-to-sequence models for spoken language understanding in a class-incremental learning (CIL) setting and we propose COCONUT, a CIL method that relies on the combination of experience replay and contrastive learning.

class-incremental learning Class Incremental Learning +4

Vocabulary-free Image Classification

1 code implementation NeurIPS 2023 Alessandro Conti, Enrico Fini, Massimiliano Mancini, Paolo Rota, Yiming Wang, Elisa Ricci

We thus formalize a novel task, termed as Vocabulary-free Image Classification (VIC), where we aim to assign to an input image a class that resides in an unconstrained language-induced semantic space, without the prerequisite of a known vocabulary.

Classification image-classification +6

Improved baselines for vision-language pre-training

1 code implementation15 May 2023 Enrico Fini, Pietro Astolfi, Adriana Romero-Soriano, Jakob Verbeek, Michal Drozdzal

Indeed, we find that a simple CLIP baseline can also be improved substantially, up to a 25% relative improvement on downstream zero-shot tasks, by using well-known training techniques that are popular in other subfields.

Contrastive Learning Data Augmentation +1

StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization

no code implementations18 Feb 2023 Shirsha Bose, Ankit Jha, Enrico Fini, Mainak Singha, Elisa Ricci, Biplab Banerjee

Our method focuses on a domain-agnostic prompt learning strategy, aiming to disentangle the visual style and content information embedded in CLIP's pre-trained vision encoder, enabling effortless adaptation to novel domains during inference.

Domain Generalization Prompt Learning +1

A soft nearest-neighbor framework for continual semi-supervised learning

2 code implementations ICCV 2023 Zhiqi Kang, Enrico Fini, Moin Nabi, Elisa Ricci, Karteek Alahari

Despite significant advances, the performance of state-of-the-art continual learning approaches hinges on the unrealistic scenario of fully labeled data.

Continual Learning

Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation

1 code implementation26 Mar 2022 Guanglei Yang, Enrico Fini, Dan Xu, Paolo Rota, Mingli Ding, Moin Nabi, Xavier Alameda-Pineda, Elisa Ricci

This problem has been widely investigated in the research community and several Incremental Learning (IL) approaches have been proposed in the past years.

Contrastive Learning image-classification +6

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation

1 code implementation1 Feb 2022 Guanglei Yang, Enrico Fini, Dan Xu, Paolo Rota, Mingli Ding, Hao Tang, Xavier Alameda-Pineda, Elisa Ricci

To fill this gap, in this paper we introduce a novel attentive feature distillation approach to mitigate catastrophic forgetting while accounting for semantic spatial- and channel-level dependencies.

Incremental Learning Semantic Segmentation

Self-Supervised Models are Continual Learners

1 code implementation CVPR 2022 Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal

Self-supervised models have been shown to produce comparable or better visual representations than their supervised counterparts when trained offline on unlabeled data at scale.

Continual Learning Representation Learning

Neighborhood Contrastive Learning for Novel Class Discovery

1 code implementation CVPR 2021 Zhun Zhong, Enrico Fini, Subhankar Roy, Zhiming Luo, Elisa Ricci, Nicu Sebe

In this paper, we address Novel Class Discovery (NCD), the task of unveiling new classes in a set of unlabeled samples given a labeled dataset with known classes.

Clustering Contrastive Learning +1

Online Continual Learning under Extreme Memory Constraints

1 code implementation ECCV 2020 Enrico Fini, Stéphane Lathuilière, Enver Sangineto, Moin Nabi, Elisa Ricci

Continual Learning (CL) aims to develop agents emulating the human ability to sequentially learn new tasks while being able to retain knowledge obtained from past experiences.

Continual Learning

Supervised online diarization with sample mean loss for multi-domain data

1 code implementation4 Nov 2019 Enrico Fini, Alessio Brutti

Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which models speakers using multiple instances of a parameter-sharing recurrent neural network.

Clustering speaker-diarization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.