no code implementations • 1 Feb 2021 • Lasse Borgholt, Tycho Max Sylvester Tax, Jakob Drachmann Havtorn, Lars Maaløe, Christian Igel
We explore the performance of such systems without fine-tuning by training a state-of-the-art speech recognizer on the fixed representations from the computationally demanding wav2vec 2. 0 framework.
1 code implementation • 1 Dec 2017 • Tycho Max Sylvester Tax, Jose Luis Diez Antich, Hendrik Purwins, Lars Maaløe
End-to-end neural network based approaches to audio modelling are generally outperformed by models trained on high-level data representations.
Environmental Sound Classification General Classification +1
no code implementations • 28 Nov 2017 • Marius Paraschiv, Lasse Borgholt, Tycho Max Sylvester Tax, Marco Singh, Lars Maaløe
Nontrivial connectivity has allowed the training of very deep networks by addressing the problem of vanishing gradients and offering a more efficient method of reusing parameters.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3