no code implementations • NoDaLiDa 2021 • Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe, Mikko Kurimo
For children, the system trained on a large corpus of adult speakers performed worse than a system trained on a much smaller corpus of children’s speech.
no code implementations • COLING (TextGraphs) 2020 • Ragheb Al-Ghezi, Mikko Kurimo
We propose a simple and efficient framework to learn syntactic embeddings based on information derived from constituency parse trees.
no code implementations • EMNLP (WNUT) 2020 • Luca Molteni, Mittul Singh, Juho Leinonen, Katri Leino, Mikko Kurimo, Emanuele Della Valle
In this article, we compare two crowdsourcing sources on a dialogue paraphrasing task revolving around a chatbot service.
1 code implementation • NoDaLiDa 2021 • Juho Leinonen, Sami Virpioja, Mikko Kurimo
Forced alignment is an effective process to speed up linguistic research.
no code implementations • NoDaLiDa 2021 • Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo
The proposed method is used to improve the speech intelligibility to enhance the children’s speech recognition using an acoustic model trained on adult speech.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • EURALI (LREC) 2022 • Juho Leinonen, Niko Partanen, Sami Virpioja, Mikko Kurimo
Cross-language forced alignment is a solution for linguists who create speech corpora for very low-resource languages.
no code implementations • NAACL (SIGMORPHON) 2022 • Aku Rouhe, Stig-Arne Grönroos, Sami Virpioja, Mathias Creutz, Mikko Kurimo
Our approach is to pre-segment the input data for a neural sequence-to-sequence model with the unsupervised method.
no code implementations • COLING 2022 • Khalid Alnajjar, Mika Hämäläinen, Jörg Tiedemann, Jorma Laaksonen, Mikko Kurimo
Our results show that the model is capable of correctly detecting whether an utterance is humorous 78% of the time and how long the audience's laughter reaction should last with a mean absolute error of 600 milliseconds.
no code implementations • 28 Oct 2022 • Tamás Grósz, Mittul Singh, Sudarsana Reddy Kadiri, Hemant Kathania, Mikko Kurimo
The current state-of-the-art methods proposed for these tasks are ensembles based on deep neural networks like ResNets in conjunction with feature engineering.
no code implementations • 10 Aug 2022 • Georgios Karakasidis, Tamás Grósz, Mikko Kurimo
We hypothesize that end-to-end models can achieve better performance when provided with an organized training set consisting of examples that exhibit an increasing level of difficulty (i. e. a curriculum).
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 28 Mar 2022 • Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo
We set benchmarks on the official test sets, as well as multiple other recently used test sets.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 24 Mar 2022 • Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Tamás Grósz, Krister Lindén, Mikko Kurimo
The Donate Speech campaign has so far succeeded in gathering approximately 3600 hours of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 29 Aug 2020 • Hemant Kathania, Mittul Singh, Tamás Grósz, Mikko Kurimo
Firstly, we apply the prosody-based data augmentation to supplement the audio data.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 19 Aug 2020 • Katri Leino, Juho Leinonen, Mittul Singh, Sami Virpioja, Mikko Kurimo
Using this corpus, we also construct a retrieval-based evaluation task for Finnish chatbot development.
no code implementations • 6 Aug 2020 • Tamás Grósz, Mittul Singh, Sudarsana Reddy Kadiri, Hemant Kathania, Mikko Kurimo
On ComParE 2020 tasks, we investigate applying an ensemble of E2E models for robust performance and developing task-specific modifications for each task.
no code implementations • LREC 2020 • Mittul Singh, Peter Smit, Sami Virpioja, Mikko Kurimo
We, however, show that for character-based NNLMs, only pretraining with a related language improves the ASR performance, and using an unrelated language may deteriorate it.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 28 May 2020 • Mittul Singh, Sami Virpioja, Peter Smit, Mikko Kurimo
On these tasks, interpolating the baseline RNNLM approximation and a conventional LM outperforms the conventional LM in terms of the Maximum Term Weighted Value for single-character subwords.
1 code implementation • 8 Apr 2020 • Stig-Arne Grönroos, Sami Virpioja, Mikko Kurimo
There are several approaches for improving neural machine translation for low-resource languages: Monolingual data can be exploited via pretraining or data augmentation; Parallel corpora on related language pairs can be used via parameter sharing or transfer learning in multilingual models; Subword segmentation and regularization techniques can be applied to ensure high coverage of the vocabulary.
no code implementations • 14 Mar 2020 • Abhilash Jain, Aku Ruohe, Stig-Arne Grönroos, Mikko Kurimo
Transformers have recently taken the center stage in language modeling after LSTM's were considered the dominant model architecture for a long time.
1 code implementation • LREC 2020 • Stig-Arne Grönroos, Sami Virpioja, Mikko Kurimo
Using English, Finnish, North Sami, and Turkish data sets, we show that this approach is able to find better solutions to the optimization problem defined by the Morfessor Baseline model than its original recursive training algorithm.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • WS 2019 • Anja Virkkunen, Juri Lukkarila, Kalle Palom{\"a}ki, Mikko Kurimo
In the mobile device, augmented reality (AR) was used to help the hearing impaired observe gestures and lip movements of the speaker simultaneously with the transcriptions.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • IWSLT (EMNLP) 2018 • Umut Sulubacak, Jörg Tiedemann, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo
In this paper, we also describe the experiments leading up to our final systems.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
no code implementations • WS 2018 • Stig-Arne Grönroos, Sami Virpioja, Mikko Kurimo
This article describes the Aalto University entry to the WMT18 News Translation Shared Task.
no code implementations • WS 2018 • Stig-Arne Grönroos, Benoit Huet, Mikko Kurimo, Jorma Laaksonen, Bernard Merialdo, Phu Pham, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Raphael Troncy, Raúl Vázquez
Our experiments show that the effect of the visual features in our system is small.
no code implementations • 13 Jul 2017 • Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo
Today, the vocabulary size for language models in large vocabulary speech recognition is typically several hundreds of thousands of words.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 3 May 2016 • Seppo Enarvi, Mikko Kurimo
We present a new tool for training neural network language models (NNLMs), scoring sentences, and generating text.
English Conversational Speech Recognition
Language Modelling
+1
1 code implementation • LREC 2014 • Matti Varjokallio, Mikko Kurimo
String segmentation is an important and recurring problem in natural language processing and other domains.