Search Results for author: Yuduo Wang

Found 1 papers, 1 papers with code

RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering

1 code implementation19 Oct 2023 Yuduo Wang, Pedram Ghamisi

In recent years, with the rapid advancement of transformer models, transformer-based multimodal architectures have found wide application in various downstream tasks, including but not limited to Image Captioning, Visual Question Answering (VQA), and Image-Text Generation.

Image Captioning Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.