no code implementations • 17 Mar 2024 • Igor Sterner, Weizhe Lin, Jinghong Chen, Bill Byrne
Two approaches have emerged to input images into large language models (LLMs).
Image Captioning Question Answering +1