Search Results for author: Eliathamby Ambikairajah

Found 10 papers, 3 papers with code

Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends

no code implementations5 Feb 2025 Qiquan Zhang, Buddhi Wickramasinghe, Eliathamby Ambikairajah, Vidhyasaharan Sethu, Haizhou Li

Hand-crafted features, such as Mel-filterbanks, have traditionally been the choice for many audio processing applications.

Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features

no code implementations5 Nov 2024 Hanyu Meng, Jeroen Breebaart, Jeremy Stoddard, Vidhyasaharan Sethu, Eliathamby Ambikairajah

Additionally, we introduce FOA-Conv3D, a novel back-end network for effectively utilising the SSCV feature with a 3D convolutional encoder.

Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction

1 code implementation31 Jul 2024 Jingyao Wu, Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah

There has been a significant focus on modelling emotion ambiguity in recent years, with advancements made in representing emotions as distributions to capture ambiguity.

Time Series

Binaural Selective Attention Model for Target Speaker Extraction

no code implementations18 Jun 2024 Hanyu Meng, Qiquan Zhang, Xiangyu Zhang, Vidhyasaharan Sethu, Eliathamby Ambikairajah

The remarkable ability of humans to selectively focus on a target speaker in cocktail party scenarios is facilitated by binaural audio processing.

model Target Speaker Extraction

An Exploration of Length Generalization in Transformer-Based Speech Enhancement

no code implementations17 Jun 2024 Qiquan Zhang, Hongxu Zhu, Xinyuan Qian, Eliathamby Ambikairajah, Haizhou Li

In this paper, we conduct comprehensive experiments to explore the length generalization problem in speech enhancement with Transformer.

Position Speech Enhancement

Mamba in Speech: Towards an Alternative to Self-Attention

1 code implementation21 May 2024 Xiangyu Zhang, Qiquan Zhang, Hexin Liu, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps

Moreover, experiments demonstrate the effectiveness of BiMamba as an alternative to the self-attention module in Transformer and its derivates, particularly for the semantic-aware task.

Mamba Speech Enhancement +3

A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information

no code implementations10 Aug 2021 Jingyao Wu, Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah

We propose a Markovian framework referred to as Dynamic Ordinal Markov Model (DOMM) that makes use of both absolute and relative ordinal information, to improve speech based ordinal emotion prediction.

Prediction

An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks

no code implementations3 Sep 2019 Zihan Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li, Eliathamby Ambikairajah

The neural encoding scheme, that we call Biologically plausible Auditory Encoding (BAE), emulates the functions of the perceptual components of the human auditory system, that include the cochlear filter bank, the inner hair cells, auditory masking effects from psychoacoustic models, and the spike neural encoding by the auditory nerve.

Benchmarking speech-recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.