no code implementations • NoDaLiDa 2021 • Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe, Mikko Kurimo
For children, the system trained on a large corpus of adult speakers performed worse than a system trained on a much smaller corpus of children’s speech.
no code implementations • NoDaLiDa 2021 • Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo
The proposed method is used to improve the speech intelligibility to enhance the children’s speech recognition using an acoustic model trained on adult speech.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 16 Nov 2024 • Arnab Kumar Roy, Hemant Kumar Kathania, Adhitiya Sharma
In this paper, we tackle the issue of data imbalance by incorporating synthetic data augmentation and leveraging the ResEmoteNet model to enhance the overall performance on facial emotion recognition task.
1 code implementation • 1 Sep 2024 • Arnab Kumar Roy, Hemant Kumar Kathania, Adhitiya Sharma, Abhishek Dey, Md. Sarfaraj Alam Ansari
In this work, we propose ResEmoteNet, a novel deep learning architecture for facial emotion recognition designed with the combination of Convolutional, Squeeze-Excitation (SE) and Residual Networks.
Ranked #1 on
Facial Expression Recognition (FER)
on RAF-DB
(using extra training data)
Facial Emotion Recognition
Facial Expression Recognition (FER)