Search Results for author: Rohan Kumar Das

Found 14 papers, 4 papers with code

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

no code implementations18 May 2023 Tanmay Khandelwal, Rohan Kumar Das

Sound event detection (SED) entails identifying the type of sound and estimating its temporal boundaries from acoustic signals.

Event Detection Multi-Task Learning +1

Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs

no code implementations27 Oct 2022 Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

We study a novel neural architecture and its training strategies of speaker encoder for speaker recognition without using any identity labels.

Contrastive Learning Self-Supervised Learning +1

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

no code implementations3 Feb 2022 Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li

The time delay neural network (TDNN) represents one of the state-of-the-art of neural solutions to text-independent speaker verification.

Text-Independent Speaker Verification

HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE

3 code implementations12 Nov 2021 Rohan Kumar Das, Ruijie Tao, Haizhou Li

This work provides a brief description of Human Language Technology (HLT) Laboratory, National University of Singapore (NUS) system submission for 2020 NIST conversational telephone speech (CTS) speaker recognition evaluation (SRE).

Domain Adaptation Speaker Recognition

Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition

no code implementations2 Oct 2021 Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna

The automatic recognition of pathological speech, particularly from children with any articulatory impairment, is a challenging task due to various reasons.

Data Augmentation speech-recognition +1

Speaker-Utterance Dual Attention for Speaker and Utterance Verification

no code implementations20 Aug 2020 Tianchi Liu, Rohan Kumar Das, Maulik Madhavi, ShengMei Shen, Haizhou Li

The proposed SUDA features an attention mask mechanism to learn the interaction between the speaker and utterance information streams.

Speaker Verification

Generative x-vectors for text-independent speaker verification

no code implementations17 Sep 2018 Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li

Speaker verification (SV) systems using deep neural network embeddings, so-called the x-vector systems, are becoming popular due to its good performance superior to the i-vector systems.

Test Text-Independent Speaker Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.