no code implementations • 3 Nov 2022 • Sofoklis Kakouros, Themos Stafylakis, Ladislav Mosner, Lukas Burget
When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how to best quantify or categorize the noisy subjective emotion labels.
no code implementations • 15 Oct 2022 • Themos Stafylakis, Ladislav Mosner, Sofoklis Kakouros, Oldrich Plchot, Lukas Burget, Jan Cernocky
Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing tasks.
no code implementations • 3 Oct 2022 • Junyi Peng, Oldrich Plchot, Themos Stafylakis, Ladislav Mosner, Lukas Burget, Jan Cernocky
In recent years, self-supervised learning paradigm has received extensive attention due to its great success in various down-stream tasks.
no code implementations • 19 Mar 2022 • Anna Silnova, Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Pavel Matejka, Lukas Burget, Ondrej Glembek, Niko Brummer
In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch.