Search Results for author: Hexin Liu

Found 14 papers, 8 papers with code

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

no code implementations • 9 Mar 2024 • Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe

Performance evaluation using large language models reveals the advantage of the linguistic hint by achieving 14. 1% and 5. 5% relative improvement on test sets of the ASRU and SEAME datasets, respectively.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

no code implementations • 17 Feb 2024 • Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

In addition, this approach is not only valuable for the detection of depression but also represents a new perspective in enhancing the ability of LLMs to comprehend and process speech signals.

Depression Detection

Paper
Add Code

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

no code implementations • 16 Feb 2024 • Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Recently, Denoising Diffusion Probabilistic Models (DDPMs) have attained leading performances across a diverse range of generative tasks.

Denoising Speech Enhancement +1

Paper
Add Code

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

1 code implementation • 27 Nov 2023 • Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, WenHan Chao, Leibny Paola Garcia

There is a positive correlation between PSR scores and ASR performance, suggesting that phonetic information extracted by monolingual SSL models can be used for downstream tasks in cross-lingual settings.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Generative error correction for code-switching speech recognition using large language models

no code implementations • 17 Oct 2023 • Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng

In this work, we propose to leverage large language models (LLMs) and lists of hypotheses generated by an ASR to address the CS problem.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Enhancing Code-switching Speech Recognition with Interactive Language Biases

no code implementations • 29 Sep 2023 • Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur

Languages usually switch within a multilingual speech signal, especially in a bilingual society.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

1 code implementation • 26 Sep 2023 • Ruixing Liang, Xiangyu Zhang, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola Garcia, Amir Manbachi

While significant advancements in artificial intelligence (AI) have catalyzed progress across various domains, its full potential in understanding visual perception remains underexplored.

Brain Computer Interface

Paper
Code

MERLIon CCS Challenge Evaluation Plan

1 code implementation • 31 May 2023 • Leibny Paola Garcia Perera, Y. H. Victoria Chua, Hexin Liu, Fei Ting Woon, Andy W. H. Khong, Justin Dauwels, Sanjeev Khudanpur, Suzy J. Styles

This paper introduces the inaugural Multilingual Everyday Recordings- Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge, focused on developing robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom.

Language Identification Task 2

Paper
Code

Investigating model performance in language identification: beyond simple error statistics

1 code implementation • 30 May 2023 • Suzy J. Styles, Victoria Y. H. Chua, Fei Ting Woon, Hexin Liu, Leibny Paola Garcia Perera, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels

These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics.

Language Identification

Paper
Code

MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization

1 code implementation • 30 May 2023 • Victoria Y. H. Chua, Hexin Liu, Leibny Paola Garcia Perera, Fei Ting Woon, Jinyi Wong, Xiangyu Zhang, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles

To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns.

Language Identification

Paper
Code

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

1 code implementation • 26 Oct 2022 • Hexin Liu, HaiHua Xu, Leibny Paola Garcia, Andy W. H. Khong, Yi He, Sanjeev Khudanpur

The comparison of the proposed methods indicates that incorporating language information is more effective than disentangling for reducing language confusion in CS speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

PQLM -- Multilingual Decentralized Portable Quantum Language Model for Privacy Protection

no code implementations • 6 Oct 2022 • Shuyue Stella Li, Xiangyu Zhang, Shu Zhou, Hongchao Shu, Ruixing Liang, Hexin Liu, Leibny Paola Garcia

In this work, we propose a highly Portable Quantum Language Model (PQLM) that can easily transmit information to downstream tasks on classical machines.

Language Modelling Sentence Embedding +3

Paper
Add Code

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification

1 code implementation • 23 Mar 2022 • Hexin Liu, Leibny Paola Garcia Perera, Andy W. H. Khong, Suzy J. Styles, Sanjeev Khudanpur

We propose a novel model to hierarchically incorporate phoneme and phonotactic information for language identification (LID) without requiring phoneme annotations for training.

Language Identification

Paper
Code

Enhance Language Identification using Dual-mode Model with Knowledge Distillation

1 code implementation • 7 Mar 2022 • Hexin Liu, Leibny Paola Garcia Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur

In this paper, we propose to employ a dual-mode framework on the x-vector self-attention (XSA-LID) model with knowledge distillation (KD) to enhance its language identification (LID) performance for both long and short utterances.

Knowledge Distillation Language Identification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.