Search Results for author: Lukas Burget

Found 20 papers, 4 papers with code

DiaCorrect: Error Correction Back-end For Speaker Diarization

1 code implementation15 Sep 2023 Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Diez, Lukas Burget, Yuhang Cao, Heng Lu, Jan Cernocky

In this work, we propose an error correction framework, named DiaCorrect, to refine the output of a diarization system in a simple yet effective way.

Automatic Speech Recognition speaker-diarization +3

Stabilized training of joint energy-based models and their practical applications

no code implementations7 Mar 2023 Martin Sustek, Samik Sadhu, Lukas Burget, Hynek Hermansky, Jesus Villalba, Laureano Moro-Velazquez, Najim Dehak

The JEM training relies on "positive examples" (i. e. examples from the training data set) as well as on "negative examples", which are samples from the modeled distribution $p(x)$ generated by means of Stochastic Gradient Langevin Dynamics (SGLD).

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing

no code implementations3 Nov 2022 Sofoklis Kakouros, Themos Stafylakis, Ladislav Mosner, Lukas Burget

When recognizing emotions from speech, we encounter two common problems: how to optimally capture emotion-relevant information from the speech signal and how to best quantify or categorize the noisy subjective emotion labels.

Emotion Recognition

Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations

no code implementations15 Oct 2022 Themos Stafylakis, Ladislav Mosner, Sofoklis Kakouros, Oldrich Plchot, Lukas Burget, Jan Cernocky

Self-supervised learning of speech representations from large amounts of unlabeled data has enabled state-of-the-art results in several speech processing tasks.

Descriptive Self-Supervised Learning

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction

1 code implementation27 Dec 2021 Jiangyu Han, Yanhua Long, Lukas Burget, Jan Cernocky

Particularly, we find that the Mixture-Remix fine-tuning with DPCCN significantly outperforms the TD-SpeakerBeam for unsupervised cross-domain TSE, with around 3. 5 dB SISNR improvement on target domain test set, without any source domain performance degradation.

Speech Extraction

Speaker embeddings by modeling channel-wise correlations

1 code implementation6 Apr 2021 Themos Stafylakis, Johan Rohdin, Lukas Burget

Speaker embeddings extracted with deep 2D convolutional neural networks are typically modeled as projections of first and second order statistics of channel-frequency pairs onto a linear layer, using either average or attentive pooling along the time axis.

Speaker Recognition Style Transfer

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

no code implementations4 Nov 2020 Bolaji Yusuf, Lucas Ondel, Lukas Burget, Jan Cernocky, Murat Saraclar

In the target language, we infer both the language and unit embeddings in an unsupervised manner, and in so doing, we simultaneously learn a subspace of units specific to that language and the units that dwell on it.

Acoustic Unit Discovery Clustering

A Technical Report: BUT Speech Translation Systems

no code implementations22 Oct 2020 Hari Krishna Vydana, Lukas Burget, Jan Cernocky

To reduce this performance degradation, we have jointly-trained ASR and MT modules with ASR objective as an auxiliary loss.

Translation

Self-supervised speaker embeddings

no code implementations6 Apr 2019 Themos Stafylakis, Johan Rohdin, Oldrich Plchot, Petr Mizera, Lukas Burget

Contrary to i-vectors, speaker embeddings such as x-vectors are incapable of leveraging unlabelled utterances, due to the classification loss over training speakers.

General Classification

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits

no code implementations5 Nov 2018 Hossein Zeinali, Lukas Burget, Johan Rohdin, Themos Stafylakis, Jan Cernocky

Recently, speaker embeddings extracted with deep neural networks became the state-of-the-art method for speaker verification.

Speaker Verification

Spoken Pass-Phrase Verification in the i-vector Space

no code implementations28 Sep 2018 Hossein Zeinali, Lukas Burget, Hossein Sameti, Jan Cernocky

The task of spoken pass-phrase verification is to decide whether a test utterance contains the same phrase as given enrollment utterances.

Text-Dependent Speaker Verification

Residual Memory Networks: Feed-forward approach to learn long temporal dependencies

no code implementations6 Aug 2018 Murali Karthick Baskar, Martin Karafiat, Lukas Burget, Karel Vesely, Frantisek Grezl, Jan Honza Cernocky

In this paper we propose a residual memory neural network (RMN) architecture to model short-time dependencies using deep feed-forward layers having residual and time delayed connections.

Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors

1 code implementation24 Mar 2018 Anna Silnova, Niko Brummer, Daniel Garcia-Romero, David Snyder, Lukas Burget

We have recently introduced a fast scoring algorithm for a discriminatively trained HT-PLDA backend.

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model

no code implementations27 Feb 2018 Niko Brummer, Anna Silnova, Lukas Burget, Themos Stafylakis

Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks.

Speaker Recognition

An Empirical Evaluation of Zero Resource Acoustic Unit Discovery

no code implementations5 Feb 2017 Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukas Burget, Sanjeev Khudanpur

Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations.

Acoustic Unit Discovery

Cannot find the paper you are looking for? You can Submit a new open access paper.