no code implementations • 21 Sep 2023 • Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo
Given the recent advances in multimodal image pretraining where visual models trained with semantically dense textual supervision tend to have better generalization capabilities than those trained using categorical attributes or through unsupervised techniques, in this work we investigate how recent CLIP model can be applied in several tasks in artwork domain.
1 code implementation • 22 Aug 2023 • Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo
Given a query composed of a reference image and a relative caption, the Composed Image Retrieval goal is to retrieve images visually similar to the reference one that integrates the modifications expressed by the caption.
Ranked #6 on Image Retrieval on CIRR
1 code implementation • ICIAP 2022 • Simone Ricci, Tiberio Uricchio, Alberto del Bimbo
In this paper, we introduced the novel concept of advisor network to address the problem of noisy labels in image classification.
Ranked #6 on Image Classification on Clothing1M (using extra training data)
2 code implementations • CVPRW 2022 • Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo
The proposed method is based on an initial training stage where a simple combination of visual and textual features is used, to fine-tune the CLIP text encoder.
Ranked #3 on Image Retrieval on LaSCo
Composed Image Retrieval (CoIR) Content-Based Image Retrieval +2
2 code implementations • CVPR 2022 • Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo
the visual content of the query image.
Ranked #9 on Image Retrieval on CIRR
1 code implementation • 14 May 2021 • Fabio Zappardino, Tiberio Uricchio, Lorenzo Seidenari, Alberto del Bimbo
To understand human behavior we must not just recognize individual actions but model possibly complex group activity and interactions.
Ranked #8 on Group Activity Recognition on Volleyball
no code implementations • 21 Apr 2020 • Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo
In this paper, we address the problem of image retrieval by learning images representation based on the activations of a Convolutional Neural Network.
1 code implementation • 4 May 2017 • Federico Becattini, Tiberio Uricchio, Lorenzo Seidenari, Lamberto Ballan, Alberto del Bimbo
In this paper we deal with the problem of predicting action progress in videos.
no code implementations • 16 May 2016 • Tiberio Uricchio, Lamberto Ballan, Lorenzo Seidenari, Alberto del Bimbo
Automatic image annotation is among the fundamental problems in computer vision and pattern recognition, and it is becoming increasingly important in order to develop algorithms that are able to search and browse large-scale image collections.
1 code implementation • 28 Mar 2015 • Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, Alberto del Bimbo
Where previous reviews on content-based image retrieval emphasize on what can be seen in an image to bridge the semantic gap, this survey considers what people tag about an image.