1 code implementation • 27 Jun 2023 • Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel
It summarizes the system performance over a range of operating modes resulting from varying the decision threshold that is used to translate the system output scores into a binary detection output.
no code implementations • 5 Sep 2022 • Michael Kuhlmann, Fritz Seebauer, Janek Ebbers, Petra Wagner, Reinhold Haeb-Umbach
Disentangling speaker and content attributes of a speech signal into separate latent representations followed by decoding the content with an exchanged speaker representation is a popular approach for voice conversion, which can be trained with non-parallel and unlabeled speech data.
1 code implementation • 31 Jan 2022 • Janek Ebbers, Romain Serizel, Reinhold Haeb-Umbach
Performing an adequate evaluation of sound event detection (SED) systems is far from trivial and is still subject to ongoing research.
no code implementations • 4 May 2021 • Thomas Glarner, Janek Ebbers, Reinhold Häb-Umbach
Discovering speaker independent acoustic units purely from spoken input is known to be a hard problem.
1 code implementation • 11 Mar 2021 • Janek Ebbers, Reinhold Haeb-Umbach
It is trained to predict strong labels while using (predicted) tags, i. e., weak labels, as additional input.