TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-to-Music Generation	MusicCaps	MusicGen w/o melody (1.5B)	FAD VGG	3.4	# 4
Text-to-Music Generation	MusicCaps	MusicGen w/ random melody (1.5B)	FAD VGG	5.0	# 9
Text-to-Music Generation	MusicCaps	MusicGen w/o melody (3.3B)	FAD VGG	3.8	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-and-controllable-music-generation/text-to-music-generation-on-musiccaps)](https://paperswithcode.com/sota/text-to-music-generation-on-musiccaps?p=simple-and-controllable-music-generation)`

Simple and Controllable Music Generation

NeurIPS 2023 · Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez ·

We tackle the task of conditional music generation. We introduce MusicGen, a single Language Model (LM) that operates over several streams of compressed discrete music representation, i.e., tokens. Unlike prior work, MusicGen is comprised of a single-stage transformer LM together with efficient token interleaving patterns, which eliminates the need for cascading several models, e.g., hierarchically or upsampling. Following this approach, we demonstrate how MusicGen can generate high-quality samples, both mono and stereo, while being conditioned on textual description or melodic features, allowing better controls over the generated output. We conduct extensive empirical evaluation, considering both automatic and human studies, showing the proposed approach is superior to the evaluated baselines on a standard text-to-music benchmark. Through ablation studies, we shed light over the importance of each of the components comprising MusicGen. Music samples, code, and models are available at https://github.com/facebookresearch/audiocraft

PDF Abstract NeurIPS 2023 PDF NeurIPS 2023 Abstract

Code

Add Remove Mark official

facebookresearch/audiocraft official

19,648

collabora/whisperspeech

↳ Quickstart in

Colab

3,352

Tasks

Add Remove

Language Modelling

Music Generation

Text-to-Music Generation

Datasets

MusicCaps

Results from the Paper

Add Remove

Ranked #4 on Text-to-Music Generation on MusicCaps

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-to-Music Generation	MusicCaps	MusicGen w/o melody (1.5B)	FAD VGG	3.4	# 4	Compare
Text-to-Music Generation	MusicCaps	MusicGen w/ random melody (1.5B)	FAD VGG	5.0	# 9	Compare
Text-to-Music Generation	MusicCaps	MusicGen w/o melody (3.3B)	FAD VGG	3.8	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Simple and Controllable Music Generation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove