Search Results for author: Luka Chkhetiani

Found 3 papers, 0 papers with code

Anatomy of Industrial Scale Multilingual ASR

no code implementations15 Apr 2024 Francis McCann Ramirez, Luka Chkhetiani, Andrew Ehrenberg, Robert McHardy, Rami Botros, Yash Khare, Andrea Vanzo, Taufiquzzaman Peyash, Gabriel Oexle, Michael Liang, Ilya Sklyar, Enver Fakhan, Ahmed Etefy, Daniel McCrystal, Sam Flamini, Domenic Donato, Takuya Yoshioka

This paper describes AssemblyAI's industrial-scale automatic speech recognition (ASR) system, designed to meet the requirements of large-scale, multilingual ASR serving various application needs.

Anatomy Automatic Speech Recognition +3

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

no code implementations10 Apr 2024 Kevin Zhang, Luka Chkhetiani, Francis McCann Ramirez, Yash Khare, Andrea Vanzo, Michael Liang, Sergio Ramirez Martin, Gabriel Oexle, Ruben Bousbib, Taufiquzzaman Peyash, Michael Nguyen, Dillon Pulliam, Domenic Donato

This paper presents Conformer-1, an end-to-end Automatic Speech Recognition (ASR) model trained on an extensive dataset of 570k hours of speech audio data, 91% of which was acquired from publicly available sources.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement

no code implementations13 Jun 2020 Luka Chkhetiani, Levan Bejanidze

Recent advancement in Generative Adversarial Networks in speech synthesis domain[3],[2] have shown, that it's possible to train GANs [8] in a reliable manner for high quality coherent waveform generation from mel-spectograms.

Speech Enhancement Speech Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.