Search Results for author: Vidhyasaharan Sethu

Found 5 papers, 1 papers with code

Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio

no code implementations17 Oct 2023 Antoni Dimitriadis, Siqi Pan, Vidhyasaharan Sethu, Beena Ahmed

Spatial HuBERT learns representations that outperform state-of-the-art single-channel speech representations on a variety of spatial downstream tasks, particularly in reverberant and noisy environments.

Representation Learning Self-Supervised Learning

Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling

no code implementations21 Sep 2023 Zheng Nan, Ting Dang, Vidhyasaharan Sethu, Beena Ahmed

Connectionist temporal classification (CTC) is commonly adopted for sequence modeling tasks like speech recognition, where it is necessary to preserve order between the input and target sequences.

Classification speech-recognition +1

A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information

no code implementations10 Aug 2021 Jingyao Wu, Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah

We propose a Markovian framework referred to as Dynamic Ordinal Markov Model (DOMM) that makes use of both absolute and relative ordinal information, to improve speech based ordinal emotion prediction.

The Ambiguous World of Emotion Representation

no code implementations1 Sep 2019 Vidhyasaharan Sethu, Emily Mower Provost, Julien Epps, Carlos Busso, NIcholas Cummins, Shrikanth Narayanan

A key reason for this is the lack of a common mathematical framework to describe all the relevant elements of emotion representations.

Face Recognition Speaker Verification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.