Search Results for author: Federico Cocchi

Found 3 papers, 1 papers with code

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

no code implementations • 23 Apr 2024 • Davide Caffagni, Federico Cocchi, Nicholas Moratelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

Multimodal LLMs are the natural evolution of LLMs, and enlarge their capabilities so as to work beyond the pure textual modality.

Question Answering Retrieval +1

Paper
Add Code

The (R)Evolution of Multimodal Large Language Models: A Survey

no code implementations • 19 Feb 2024 • Davide Caffagni, Federico Cocchi, Luca Barsellotti, Nicholas Moratelli, Sara Sarto, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara

Connecting text and visual modalities plays an essential role in generative intelligence.

Image Generation Instruction Following +1

Paper
Add Code

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

1 code implementation • 27 Nov 2023 • Samuele Poppi, Tobia Poppi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

We show how this can be done by fine-tuning a CLIP model on synthetic data obtained from a large language model trained to convert between safe and unsafe sentences, and a text-to-image generator.

Cross-Modal Retrieval Image Retrieval +5

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.