Search Results for author: Mathieu Fontaine

Found 9 papers, 3 papers with code

GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model

no code implementations9 Feb 2024 Haocheng Liu, Teysir Baoueb, Mathieu Fontaine, Jonathan Le Roux, Gael Richard

Diffusion models are receiving a growing interest for a variety of signal generation tasks such as speech or music synthesis.

Online speaker diarization of meetings guided by speech separation

1 code implementation30 Jan 2024 Elio Gruttadauria, Mathieu Fontaine, Slim Essid

The results show that our system improves the state-of-the-art on the AMI headset mix, using no oracle information and under full evaluation (no collar and including overlapped speech).

Action Detection Activity Detection +3

DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF

no code implementations22 Jul 2022 Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii

Our DNN-free system leverages the posteriors of the latest source spectrograms given by block-online FastMNMF to derive the current source covariance matrices for frame-online beamforming.

blind source separation Speech Enhancement

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments

1 code implementation15 Jul 2022 Kouhei Sekiguchi, Aditya Arie Nugraha, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii

This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real noisy echoic environments (e. g., cocktail party).

blind source separation Speech Enhancement

Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation

no code implementations11 May 2022 Mathieu Fontaine, Kouhei Sekiguchi, Aditya Nugraha, Yoshiaki Bando, Kazuyoshi Yoshii

This paper describes heavy-tailed extensions of a state-of-the-art versatile blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) from a unified point of view.

blind source separation Speech Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.