1 code implementation • 10 Apr 2024 • Hanyu Meng, Vidhyasaharan Sethu, Eliathamby Ambikairajah
There is increasing interest in the use of the LEArnable Front-end (LEAF) in a variety of speech processing systems.
no code implementations • 18 Jan 2024 • Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li
Transformer architecture has enabled recent progress in speech enhancement.
no code implementations • 10 Aug 2021 • Jingyao Wu, Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah
We propose a Markovian framework referred to as Dynamic Ordinal Markov Model (DOMM) that makes use of both absolute and relative ordinal information, to improve speech based ordinal emotion prediction.
no code implementations • 3 Sep 2019 • Zihan Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li, Eliathamby Ambikairajah
The neural encoding scheme, that we call Biologically plausible Auditory Encoding (BAE), emulates the functions of the perceptual components of the human auditory system, that include the cochlear filter bank, the inner hair cells, auditory masking effects from psychoacoustic models, and the spike neural encoding by the auditory nerve.