Search Results for author: Fadi Biadsy

Found 7 papers, 2 papers with code

Streaming Parrotron for on-device speech-to-speech conversion

no code implementations25 Oct 2022 Oleg Rybakov, Fadi Biadsy, Xia Zhang, Liyang Jiang, Phoenix Meadowlark, Shivani Agrawal

We present a streaming-based approach to produce an acceptable delay, with minimal loss in speech conversion quality, when compared to a reference state of the art non-streaming approach.

Quantization STS

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization

no code implementations23 Mar 2022 Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno

We also show that learning a speaker-embedding space can scale further and reduce the amount of personalization training data required per speaker.

Real time spectrogram inversion on mobile phone

1 code implementation1 Mar 2022 Oleg Rybakov, Marco Tagliasacchi, Yunpeng Li, Liyang Jiang, Xia Zhang, Fadi Biadsy

We present two methods of real time magnitude spectrogram inversion: streaming Griffin Lim(GL) and streaming MelGAN.

Direct speech-to-speech translation with a sequence-to-sequence model

1 code implementation12 Apr 2019 Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu

We present an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation.

Speech Synthesis Speech-to-Speech Translation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.