no code implementations • 23 Feb 2024 • Ismael Agchar, Ilja Baumann, Franziska Braun, Paula Andrea Perez-Toro, Korbinian Riedhammer, Sebastian Trump, Martin Ullrich
In recent years, machine learning, and in particular generative adversarial neural networks (GANs) and attention-based neural networks (transformers), have been successfully used to compose and generate music, both melodies and polyphonic pieces.
no code implementations • 30 May 2023 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer
Most stuttering detection and classification research has viewed stuttering as a multi-class classification problem or a binary detection task for each dysfluency type; however, this does not match the nature of stuttering, in which one dysfluency seldom comes alone but rather co-occurs with others.
no code implementations • 28 Oct 2022 • Ilja Baumann, Dominik Wagner, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
Recent findings show that pre-trained wav2vec 2. 0 models are reliable feature extractors for various speaker characteristics classification tasks.
no code implementations • 27 Oct 2022 • Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech.
no code implementations • 16 Jun 2022 • Ilja Baumann, Dominik Wagner, Sebastian Bayerl, Tobias Bocklet
In this work, the task is to determine whether spoken nonwords have been uttered correctly.
no code implementations • 7 Apr 2022 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Korbinian Riedhammer, Tobias Bocklet
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended utilization.