Search Results for author: Haolin Chen

Found 8 papers, 2 papers with code

Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting

no code implementations19 Feb 2024 Haolin Chen, Philip N. Garner

Our results demonstrate that catastrophic forgetting can be overcome by our methods without degrading the fine-tuning performance, and using the Kronecker factored approximations produces a better preservation of the pre-training knowledge than the diagonal ones.

Language Modelling Speech Synthesis +1

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes

no code implementations29 Nov 2023 Pavel Korshunov, Haolin Chen, Philip N. Garner, Sebastien Marcel

From the publicly available speech dataset LibriTTS, we also created a separate database of only audio deepfakes LibriTTS-DF using several latest text to speech methods: YourTTS, Adaspeech, and TorToiSe.

Face Recognition Face Swapping +2

An investigation into the adaptability of a diffusion-based TTS model

no code implementations3 Mar 2023 Haolin Chen, Philip N. Garner

Given the recent success of diffusion in producing natural-sounding synthetic speech, we investigate how diffusion can be used in speaker adaptive TTS.

HyperMixer: An MLP-based Low Cost Alternative to Transformers

3 code implementations7 Mar 2022 Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, Francois Marelli, Francois Fleuret, James Henderson

We find that existing architectures such as MLPMixer, which achieves token mixing through a static MLP applied to each feature independently, are too detached from the inductive biases required for natural language understanding.

Natural Language Understanding

Stable and Compact Face Recognition via Unlabeled Data Driven Sparse Representation-Based Classification

no code implementations4 Nov 2021 XiaoHui Yang, Zheng Wang, Huan Wu, Licheng Jiao, Yiming Xu, Haolin Chen

The proposed model aims to mine the hidden semantic information and intrinsic structure information of all available data, which is suitable for few labeled samples and proportion imbalance between labeled samples and unlabeled samples problems in frontal face recognition.

Face Recognition Sparse Representation-based Classification

Can We Trust Deep Speech Prior?

no code implementations4 Nov 2020 Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han

Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture.

Speech Enhancement

Overcomplete order-3 tensor decomposition, blind deconvolution and Gaussian mixture models

no code implementations16 Jul 2020 Haolin Chen, Luis Rademacher

We propose a new algorithm for tensor decomposition, based on Jennrich's algorithm, and apply our new algorithmic ideas to blind deconvolution and Gaussian mixture models.

Tensor Decomposition

CN-CELEB: a challenging Chinese speaker recognition dataset

2 code implementations31 Oct 2019 Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang

These datasets tend to deliver over optimistic performance and do not meet the request of research on speaker recognition in unconstrained conditions.

Speaker Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.