Search Results for author: Aku Rouhe

Found 8 papers, 2 papers with code

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces

no code implementations NoDaLiDa 2021 Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe, Mikko Kurimo

For children, the system trained on a large corpus of adult speakers performed worse than a system trained on a much smaller corpus of children’s speech.

Speaker Verification

Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks

no code implementations24 Mar 2022 Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Tamás Grósz, Krister Lindén, Mikko Kurimo

The Donate Speech campaign has so far succeeded in gathering approximately 3600 hours of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Multimodal Machine Translation through Visuals and Speech

no code implementations28 Nov 2019 Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann

Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data.

Image Captioning Multimodal Machine Translation +4

Cannot find the paper you are looking for? You can Submit a new open access paper.