no code implementations • 17 Jun 2025 • Tuan Nguyen, Huy-Dat Tran
Developing code-switched ASR systems is challenging due to language ambiguity and limited exposure to multilingual, code-switched data, while collecting such speech is costly.
no code implementations • 17 Jun 2025 • Long-Vu Hoang, Tuan Nguyen, Tran Huy Dat
This paper presents a novel non-invasive object classification approach using acoustic scattering, demonstrated through a case study on hair assessment.
no code implementations • 17 Jun 2025 • Tuan Nguyen, Huy-Dat Tran
Code-switching (CS), common in multilingual settings, presents challenges for ASR due to scarce and costly transcribed data caused by linguistic complexity.
no code implementations • 16 Jun 2025 • Tuan Nguyen, Long-Vu Hoang, Huy-Dat Tran
This paper presents our system for the MLC-SLM Challenge 2025, focusing on multilingual speech recognition and language modeling with large language models (LLMs).
1 code implementation • 23 May 2025 • Naseem Khan, Tuan Nguyen, Amine Bermak, Issa Khalil
The proliferation of sophisticated AI-generated deepfakes poses critical challenges for digital media authentication and societal security.
no code implementations • 16 Apr 2025 • Tingyang Sun, Tuan Nguyen, Ting He
Decentralized federated learning (DFL) is a promising machine learning paradigm for bringing artificial intelligence (AI) capabilities to the network edge.
no code implementations • 27 Jan 2025 • Long Nguyen, Huy Nguyen, Bao Khuu, Huy Luu, Huy Le, Tuan Nguyen, Tho Quan
Retrieving events from videos using text queries has become increasingly challenging due to the rapid growth of multimedia content.
no code implementations • 10 Oct 2024 • Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer, Virginie Woisard
With the rise of SSL and ASR technologies, the Wav2Vec2 ASR-based model has been fine-tuned for automated speech disorder quality assessment tasks, yielding impressive results and setting a new baseline for Head and Neck Cancer speech contexts.
no code implementations • 8 Aug 2024 • Khanh Doan, Long Tung Vuong, Tuan Nguyen, Anh Tuan Bui, Quyen Tran, Thanh-Toan Do, Dinh Phung, Trung Le
Diffusion models (DM) have become fundamental components of generative models, excelling across various domains such as image creation, audio generation, and complex data interpolation.
no code implementations • 5 Aug 2024 • Cho-Chun Chiu, Tuan Nguyen, Ting He, Shiqiang Wang, Beom-Su Kim, Ki-Il Kim
These challenges make our problem fundamentally different from classical active learning, where unlabeled samples are free and labels can be queried in real time.
no code implementations • 5 Jul 2024 • Tuan Nguyen, Dung Thuy Nguyen, Khoa D Doan, Kok-Seng Wong
While our focus is on empirical analysis, we believe it can guide backdoor research toward more realistic settings, highlighting the crucial role of FL in building robust defenses against diverse backdoor threats.
1 code implementation • 25 Jun 2024 • Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng
Recent synthetic speech detectors leveraging the Transformer model have superior performance compared to the convolutional neural network counterparts.
Ranked #4 on
Audio Deepfake Detection
on ASVspoof 2021
no code implementations • 6 May 2024 • Abhinav Agarwalla, Abhay Gupta, Alexandre Marques, Shubhra Pandit, Michael Goin, Eldar Kurtic, Kevin Leong, Tuan Nguyen, Mahmoud Salem, Dan Alistarh, Sean Lie, Mark Kurtz
We achieve this for the LLaMA-2 7B model by combining the SparseGPT one-shot pruning method and sparse pretraining of those models on a subset of the SlimPajama dataset mixed with a Python subset of The Stack dataset.
no code implementations • 29 Mar 2024 • Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer, Virginie Woisard
Automatic speech quality assessment has raised more attention as an alternative or support to traditional perceptual clinical evaluation.
no code implementations • 29 Jan 2024 • Tuan Nguyen, Van Nguyen, Trung Le, He Zhao, Quan Hung Tran, Dinh Phung
Additionally, we propose minimizing class-aware Higher-order Moment Matching (HMM) to align the corresponding class regions on the source and target domains.
1 code implementation • 8 Jan 2024 • Ngoc-Hieu Nguyen, Tuan-Anh Nguyen, Tuan Nguyen, Vu Tien Hoang, Dung D. Le, Kok-Seng Wong
Federated Recommendation (FedRec) systems have emerged as a solution to safeguard users' data in response to growing regulatory concerns.
no code implementations • 1 Jan 2024 • Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do
The task of Visual Relationship Recognition (VRR) aims to identify relationships between two interacting objects in an image and is particularly challenging due to the widely-spread and highly imbalanced distribution of <subject, relation, object> triplets.
no code implementations • 10 Dec 2023 • Khanh Doan, Quyen Tran, Tung Lam Tran, Tuan Nguyen, Dinh Phung, Trung Le
To address this, we propose the Gradient Projection Class-Prototype Conditional Diffusion Model (GPPDM), a GR-based approach for continual learning that enhances image quality in generators and thus reduces the CF in classifiers.
no code implementations • 6 Nov 2023 • Tuan Nguyen, Hirotada Honda, Takashi Sano, Vinh Nguyen, Shugo Nakamura, Tan M. Nguyen
We propose the Kuramoto Graph Neural Network (KuramotoGNN), a novel class of continuous-depth graph neural networks (GNNs) that employs the Kuramoto model to mitigate the over-smoothing phenomenon, in which node features in GNNs become indistinguishable as the number of layers increases.
no code implementations • 6 Nov 2023 • Tuan Nguyen, Tam Nguyen, Vinh Nguyen, Tan M. Nguyen
$p$-Laplacian regularization, rooted in graph and image signal processing, introduces a parameter $p$ to control the regularization effect on these data.
no code implementations • 3 Mar 2023 • Thuy Dung Nguyen, Tuan Nguyen, Phi Le Nguyen, Hieu H. Pham, Khoa Doan, Kok-Seng Wong
Federated learning (FL) is a machine learning (ML) approach that allows the use of distributed data without compromising personal privacy.
2 code implementations • 20 Feb 2023 • Tuan Nguyen, Salima Mdhaffar, Natalia Tomashenko, Jean-François Bonastre, Yannick Estève
This paper presents a study on the use of federated learning to train an ASR model based on a wav2vec 2. 0 model pre-trained by self supervision.
no code implementations • 25 May 2022 • Daniel Campos, Alexandre Marques, Tuan Nguyen, Mark Kurtz, ChengXiang Zhai
Our experimentation shows that models that are pruned during pretraining using general domain masked language models can transfer to novel domains and tasks without extensive hyperparameter exploration or specialized approaches.
1 code implementation • 14 Mar 2022 • Eldar Kurtic, Daniel Campos, Tuan Nguyen, Elias Frantar, Mark Kurtz, Benjamin Fineran, Michael Goin, Dan Alistarh
We perform an in-depth study of the accuracy-compression trade-off for unstructured weight pruning of BERT models.
no code implementations • 29 Oct 2021 • Trung Le, Dat Do, Tuan Nguyen, Huy Nguyen, Hung Bui, Nhat Ho, Dinh Phung
We study the label shift problem between the source and target domains in general domain adaptation (DA) settings.
2 code implementations • 10 Oct 2021 • Tuan Nguyen, Hanh Pham, Truong Bui, Tan Nguyen, Duc Luong, Phong Nguyen
Both automatic and human evaluation demonstrated that our approach can generate poems that have better cohesion without losing the quality due to additional loss.
1 code implementation • 1 Oct 2021 • Van-Anh Nguyen, Tuan Nguyen, Trung Le, Quan Hung Tran, Dinh Phung
To address the second challenge, we propose to bridge the gap between the target domain and the mixture of source domains in the latent space via a generator or feature extractor.
1 code implementation • UAI 2021 • Tuan Nguyen, Trung Le, He Zhao, Quan Hung Tran, Truyen Nguyen, Dinh Phung
To this end, we propose in this paper a novel model for multi-source DA using the theory of optimal transport and imitation learning.
Imitation Learning
Multi-Source Unsupervised Domain Adaptation
+1
1 code implementation • ICCV 2021 • Van-Anh Nguyen, Tuan Nguyen, Trung Le, Quan Hung Tran, Dinh Phung
To address the second challenge, we propose to bridge the gap between the target domain and the mixture of source domains in the latent space via a generator or feature extractor.
Multi-Source Unsupervised Domain Adaptation
Unsupervised Domain Adaptation
no code implementations • ICLR 2019 • Tue Le, Tuan Nguyen, Trung Le, Dinh Phung, Paul Montague, Olivier De Vel, Lizhen Qu
Due to the sharp increase in the severity of the threat imposed by software vulnerabilities, the detection of vulnerabilities in binary code has become an important concern in the software industry, such as the embedded systems industry, and in the field of computer security.
no code implementations • 25 Apr 2019 • Kyongsik Yun, Luan Nguyen, Tuan Nguyen, Doyoung Kim, Sarah Eldin, Alexander Huyen, Thomas Lu, Edward Chow
We compared the performance between the auto-detection system and the human eye.