Search Results for author: Lorenzo Agnolucci

Found 13 papers, 11 papers with code

Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution

no code implementations10 Feb 2025 Vlad Hosu, Lorenzo Agnolucci, Daisuke Iso, Dietmar Saupe

To bridge this gap, we introduce the Image Intrinsic Scale (IIS), defined as the largest scale where an image exhibits its highest perceived quality.

Image Quality Assessment

Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

1 code implementation6 Feb 2025 Marco Mistretta, Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Andrew D. Bagdanov

In this paper, we show that the common practice of individually exploiting the text or image encoders of these powerful multi-modal models is highly suboptimal for intra-modal tasks like image-to-image retrieval.

Image Classification Image Retrieval +2

AIM 2024 Challenge on UHD Blind Photo Quality Assessment

1 code implementation24 Sep 2024 Vlad Hosu, Marcos V. Conde, Lorenzo Agnolucci, Nabajeet Barman, Saman Zadtootaghaj, Radu Timofte

By pushing the boundaries of NR-IQA for high-resolution photos, the UHD-IQA Challenge aims to stimulate the development of practical models that can keep pace with the rapidly evolving landscape of digital photography.

4k Computational Efficiency +3

UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment

1 code implementation25 Jun 2024 Vlad Hosu, Lorenzo Agnolucci, Oliver Wiedemann, Daisuke Iso, Dietmar Saupe

We introduce a novel Image Quality Assessment (IQA) dataset comprising 6073 UHD-1 (4K) images, annotated at a fixed width of 3840 pixels.

4k Blind Image Quality Assessment +1

iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval

2 code implementations5 May 2024 Lorenzo Agnolucci, Alberto Baldrati, Marco Bertini, Alberto del Bimbo

Given a query consisting of a reference image and a relative caption, Composed Image Retrieval (CIR) aims to retrieve target images visually similar to the reference one while incorporating the changes specified in the relative caption.

Benchmarking Retrieval +1

Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

1 code implementation17 Mar 2024 Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini

At the same time, we force CLIP to generate consistent representations for images with similar content and the same level of degradation.

Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN

1 code implementation7 Nov 2023 Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto del Bimbo

Given that, in this context, the speaker is typically in front of the camera and remains the same for the entire duration of the transmission, we can maintain a set of reference keyframes of the person from the higher-quality I-frames that are transmitted within the video stream and exploit them to guide the visual quality improvement; a novel aspect of this approach is the update policy that maintains and updates a compact and effective set of reference keyframes.

Video Compression

Reference-based Restoration of Digitized Analog Videotapes

2 code implementations20 Oct 2023 Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto del Bimbo

We design a transformer-based Swin-UNet network that exploits both neighboring and reference frames via our Multi-Reference Spatial Feature Fusion (MRSFF) blocks.

Analog Video Restoration Artifact Detection

ARNIQA: Learning Distortion Manifold for Image Quality Assessment

1 code implementation20 Oct 2023 Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto del Bimbo

In this work, we propose a self-supervised approach named ARNIQA (leArning distoRtion maNifold for Image Quality Assessment) for modeling the image distortion manifold to obtain quality representations in an intrinsic manner.

Mapping Memes to Words for Multimodal Hateful Meme Classification

1 code implementation12 Oct 2023 Giovanni Burbi, Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Alberto del Bimbo

Multimodal image-text memes are prevalent on the internet, serving as a unique form of communication that combines visual and textual elements to convey humor, ideas, or emotions.

Hateful Meme Classification Language Modeling +1

ECO: Ensembling Context Optimization for Vision-Language Models

no code implementations26 Jul 2023 Lorenzo Agnolucci, Alberto Baldrati, Francesco Todino, Federico Becattini, Marco Bertini, Alberto del Bimbo

Among these, the CLIP model has shown remarkable capabilities for zero-shot transfer by matching an image and a custom textual prompt in its latent space.

Classification Image Classification

Zero-Shot Composed Image Retrieval with Textual Inversion

2 code implementations ICCV 2023 Alberto Baldrati, Lorenzo Agnolucci, Marco Bertini, Alberto del Bimbo

Composed Image Retrieval (CIR) aims to retrieve a target image based on a query composed of a reference image and a relative caption that describes the difference between the two images.

Retrieval Zero-Shot Composed Image Retrieval (ZS-CIR)

Cannot find the paper you are looking for? You can Submit a new open access paper.