Search Results for author: CJ Carr

Found 8 papers, 5 papers with code

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

1 code implementation29 Nov 2024 Julian D Parker, Anton Smirnov, Jordi Pons, CJ Carr, Zack Zukowski, Zach Evans, Xubo Liu

The tokenization of speech with neural audio codec models is a vital part of modern AI pipelines for the generation or understanding of speech, alone or in a multimodal context.

Quantization

Stable Audio Open

1 code implementation19 Jul 2024 Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons

Open generative models are vitally important for the community, allowing for fine-tunes and serving as baselines when presenting new models.

Audio Generation Text-to-Music Generation

Long-form music generation with latent diffusion

1 code implementation16 Apr 2024 Zach Evans, Julian D. Parker, CJ Carr, Zack Zukowski, Josiah Taylor, Jordi Pons

Audio-based generative models for music have seen great strides recently, but so far have not managed to produce full-length music tracks with coherent musical structure from text prompts.

Audio Generation Music Generation

Fast Timing-Conditioned Latent Audio Diffusion

2 code implementations7 Feb 2024 Zach Evans, CJ Carr, Josiah Taylor, Scott H. Hawley, Jordi Pons

Generating long-form 44. 1kHz stereo audio from text prompts can be computationally demanding.

 Ranked #1 on Text-to-Music Generation on MusicCaps (KL_passt metric)

Audio Generation Text-to-Music Generation

ProgGP: From GuitarPro Tablature Neural Generation To Progressive Metal Production

no code implementations11 Jul 2023 Jackson Loth, Pedro Sarmento, CJ Carr, Zack Zukowski, Mathieu Barthet

Recent work in the field of symbolic music generation has shown value in using a tokenization based on the GuitarPro format, a symbolic representation supporting guitar expressive attributes, as an input and output representation.

Music Generation

DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models

1 code implementation30 Jul 2021 Pedro Sarmento, Adarsh Kumar, CJ Carr, Zack Zukowski, Mathieu Barthet, Yi-Hsuan Yang

In this work, we present DadaGP, a new symbolic music dataset comprising 26, 181 song scores in the GuitarPro format covering 739 musical genres, along with an accompanying tokenized format well-suited for generative sequence models such as the Transformer.

Decoder Genre classification +3

Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands

no code implementations16 Nov 2018 CJ Carr, Zack Zukowski

This early example of neural synthesis is a proof-of-concept for how machine learning can drive new types of music software.

Sound Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.