Music Captioning
5 papers with code • 0 benchmarks • 1 datasets
Benchmarks
These leaderboards are used to track progress in Music Captioning
Most implemented papers
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
To fill this gap, we present a methodology for generating question-answer pairs from existing audio captioning datasets and introduce the MusicQA Dataset designed for answering open-ended music-related questions.
ALCAP: Alignment-Augmented Music Captioner
Music captioning has gained significant attention in the wake of the rising prominence of streaming media platforms.
LP-MusicCaps: LLM-Based Pseudo Music Captioning
In addition, we trained a transformer-based music captioning model with the dataset and evaluated it under zero-shot and transfer-learning settings.
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Large Language Models (LLMs) have shown immense potential in multimodal applications, yet the convergence of textual and musical domains remains not well-explored.
The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation
We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.