no code implementations • NoDaLiDa 2021 • Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe, Mikko Kurimo
For children, the system trained on a large corpus of adult speakers performed worse than a system trained on a much smaller corpus of children’s speech.
no code implementations • NAACL (SIGMORPHON) 2022 • Aku Rouhe, Stig-Arne Grönroos, Sami Virpioja, Mathias Creutz, Mikko Kurimo
Our approach is to pre-segment the input data for a neural sequence-to-sequence model with the unsupervised method.
1 code implementation • 28 Mar 2022 • Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo
We set benchmarks on the official test sets, as well as multiple other recently used test sets.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 24 Mar 2022 • Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Tamás Grósz, Krister Lindén, Mikko Kurimo
The Donate Speech campaign has so far succeeded in gathering approximately 3600 hours of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
3 code implementations • 8 Jun 2021 • Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato de Mori, Yoshua Bengio
SpeechBrain is an open-source and all-in-one speech toolkit.
no code implementations • 28 Nov 2019 • Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann
Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data.
Ranked #3 on
Multimodal Machine Translation
on Multi30K
no code implementations • IWSLT (EMNLP) 2018 • Umut Sulubacak, Jörg Tiedemann, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo
In this paper, we also describe the experiments leading up to our final systems.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4