Search Results for author: Shoji Makino

Found 6 papers, 1 papers with code

Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN

no code implementations13 Feb 2024 Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada, Shoji Makino

With the rapid development of neural networks in recent years, the ability of various networks to enhance the magnitude spectrum of noisy speech in the single-channel speech enhancement domain has become exceptionally outstanding.

Speech Enhancement

A computationally efficient semi-blind source separation based approach for nonlinear echo cancellation based on an element-wise iterative source steering

no code implementations14 Dec 2023 Kunxing Lu, Xianrui Wang, Tetsuya Ueda, Shoji Makino, Jingdong Chen

While the semi-blind source separation-based acoustic echo cancellation (SBSS-AEC) has received much research attention due to its promising performance during double-talk compared to the traditional adaptive algorithms, it suffers from system latency and nonlinear distortions.

Acoustic echo cancellation blind source separation

Speech Emotion Recognition Based on Self-Attention Weight Correction for Acoustic and Text Features

no code implementations IEEE Access 2022 JENNIFER SANTOSO, Takeshi Yamada, Kenkichi Ishizuka, Taiichi Hashimoto, Shoji Makino

Although there is a method to improve ASR performance in the presence of emotional speech, it requires the fine-tuning of ASR, which has a high computational cost and leads to the loss of cues important for determining the presence of emotion in speech segments, which can be helpful in SER.

Multimodal Emotion Recognition Speech Emotion Recognition +2

Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier

no code implementations16 Dec 2018 Li Li, Hirokazu Kameoka, Shoji Makino

While MVAE is notable in its impressive source separation performance, the convergence-guaranteed optimization algorithm and that it allows us to estimate source-class labels simultaneously with source separation, there are still two major drawbacks, i. e., the high computational complexity and unsatisfactory source classification accuracy.

Classification General Classification

Semi-blind source separation with multichannel variational autoencoder

1 code implementation2 Aug 2018 Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino

This paper proposes a multichannel source separation technique called the multichannel variational autoencoder (MVAE) method, which uses a conditional VAE (CVAE) to model and estimate the power spectrograms of the sources in a mixture.

blind source separation

Cannot find the paper you are looking for? You can Submit a new open access paper.