Search Results for author: Stefan Fauth

Found 1 papers, 1 papers with code

Zero-shot audio captioning with audio-language model guidance and audio context keywords

1 code implementation14 Nov 2023 Leonard Salewski, Stefan Fauth, A. Sophia Koepke, Zeynep Akata

In particular, our framework exploits a pre-trained large language model (LLM) for generating the text which is guided by a pre-trained audio-language model to produce captions that describe the audio content.

Descriptive Image Captioning +5

Cannot find the paper you are looking for? You can Submit a new open access paper.