Search Results for author: Rita Ramos

Found 4 papers, 4 papers with code

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

1 code implementation4 Jun 2024 Wenyan Li, Jiaang Li, Rita Ramos, Raphael Tang, Desmond Elliott

Recent advances in retrieval-augmented models for image captioning highlight the benefit of retrieving related captions for efficient, lightweight models with strong domain-transfer capabilities.

Image Captioning Retrieval

LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting

1 code implementation31 May 2023 Rita Ramos, Bruno Martins, Desmond Elliott

Multilingual image captioning has recently been tackled by training with large-scale machine translated data, which is an expensive, noisy, and time-consuming process.

Decoder Image Captioning +2

Retrieval-augmented Image Captioning

1 code implementation16 Feb 2023 Rita Ramos, Desmond Elliott, Bruno Martins

The encoder in our model jointly processes the image and retrieved captions using a pretrained V&L BERT, while the decoder attends to the multimodal encoder representations, benefiting from the extra textual evidence from the retrieved captions.

Decoder Image Captioning +2

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

1 code implementation CVPR 2023 Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva

Recent advances in image captioning have focused on scaling the data and model size, substantially increasing the cost of pre-training and finetuning.

Decoder Image Captioning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.