Search Results for author: Arnaud Joly

Found 13 papers, 3 papers with code

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

no code implementations • 12 Feb 2024 • Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Echoing the widely-reported "emergent abilities" of large language models when trained on increasing volume of data, we show that BASE TTS variants built with 10K+ hours and 500M+ parameters begin to demonstrate natural prosody on textually complex sentences.

Disentanglement

Paper
Add Code

Controllable Emphasis with zero data for text-to-speech

no code implementations • 13 Jul 2023 • Arnaud Joly, Marco Nicolis, Ekaterina Peterova, Alessandro Lombardi, Ammar Abbas, Arent van Korlaar, Aman Hussain, Parul Sharma, Alexis Moinet, Mateusz Lajszczak, Penny Karanasou, Antonio Bonafonte, Thomas Drugman, Elena Sokolova

We show that this technique significantly closes the gap to methods that require explicit recordings.

Sentence

Paper
Add Code

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

no code implementations • 29 Jun 2022 • Peter Makarov, Ammar Abbas, Mateusz Łajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou

In this paper, we examine simple extensions to a Transformer-based FastSpeech-like system, with the goal of improving prosody for multi-sentence TTS.

Language Modelling Sentence

Paper
Add Code

Distribution augmentation for low-resource expressive text-to-speech

no code implementations • 13 Feb 2022 • Mateusz Lajszczak, Animesh Prasad, Arent van Korlaar, Bajibabu Bollepalli, Antonio Bonafonte, Arnaud Joly, Marco Nicolis, Alexis Moinet, Thomas Drugman, Trevor Wood, Elena Sokolova

This paper presents a novel data augmentation technique for text-to-speech (TTS), that allows to generate new (text, audio) training examples without requiring any additional data.

Data Augmentation

Paper
Add Code

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

no code implementations • 29 Jun 2021 • Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens, Sri Karlapati, Thomas Drugman

We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to synthesise speech with an improved coarse and fine-grained prosody.

Sentence

Paper
Add Code

A learned conditional prior for the VAE acoustic space of a TTS system

no code implementations • 14 Jun 2021 • Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo Trueba, Thomas Drugman

Many factors influence speech yielding different renditions of a given sentence.

Sentence

Paper
Add Code

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

no code implementations • 4 Nov 2020 • Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman

In Stage II, we propose a novel method to sample from this learnt prosodic distribution using the contextual information available in text.

Graph Attention Representation Learning +2

Paper
Add Code

Gradient tree boosting with random output projections for multi-label classification and multi-output regression

no code implementations • 18 May 2019 • Arnaud Joly, Louis Wehenkel, Pierre Geurts

We consider several extensions of gradient boosting to address such problems.

General Classification Multi-Label Classification +1

Paper
Add Code

Globally Induced Forest: A Prepruning Compression Scheme

1 code implementation • ICML 2017 • Jean-Michel Begon, Arnaud Joly, Pierre Geurts

Tree-based ensemble models are heavy memory-wise.

Paper
Code

Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity

no code implementations • 26 Apr 2017 • Arnaud Joly

Within machine learning, the supervised learning field aims at modeling the input-output relationship of a system, from past observations of its behavior.

Model Compression

Paper
Add Code

Simple connectome inference from partial correlation statistics in calcium imaging

1 code implementation • 30 Jun 2014 • Antonio Sutera, Arnaud Joly, Vincent François-Lavet, Zixiao Aaron Qiu, Gilles Louppe, Damien Ernst, Pierre Geurts

In this work, we propose a simple yet effective solution to the problem of connectome inference in calcium imaging data.

Paper
Code

Random forests with random projections of the output space for high dimensional multi-label classification

no code implementations • 14 Apr 2014 • Arnaud Joly, Pierre Geurts, Louis Wehenkel

We adapt the idea of random projections applied to the output space, so as to enhance tree-based ensemble methods in the context of multi-label classification.

General Classification Multi-Label Classification

Paper
Add Code

API design for machine learning software: experiences from the scikit-learn project

4 code implementations • 1 Sep 2013 • Lars Buitinck, Gilles Louppe, Mathieu Blondel, Fabian Pedregosa, Andreas Mueller, Olivier Grisel, Vlad Niculae, Peter Prettenhofer, Alexandre Gramfort, Jaques Grobler, Robert Layton, Jake Vanderplas, Arnaud Joly, Brian Holt, Gaël Varoquaux

Scikit-learn is an increasingly popular machine learning li- brary.

BIG-bench Machine Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.