2 code implementations • 17 Nov 2022 • Gokul Karthik Kumar, Praveen S V, Pratyush Kumar, Mitesh M. Khapra, Karthik Nandakumar
We open-source all models on the Bhashini platform.
Ranked #1 on Speech Synthesis - Rajasthani on IndicTTS
1 code implementation • 12 Oct 2022 • Gokul Karthik Kumar, Karthik Nandakumar
A simple classifier based on the FIM representation is able to achieve state-of-the-art performance on the Hateful Memes Challenge (HMC) dataset with an AUROC of 85. 8, which even surpasses the human performance of 82. 65.
Ranked #1 on Meme Classification on Tamil Memes
2 code implementations • 11 May 2022 • Gokul Karthik Kumar, Sahal Shaji Mullappilly, Abhishek Singh Gehlot
However, the CNN feature maps still maintain the spatial relationship and we utilize this property to design self-supervised learning approaches to train the encoder of object detection transformers in pretraining and multi-task learning settings.
1 code implementation • DravidianLangTech (ACL) 2022 • Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, Karthik Nandakumar
These models are pre-trained in a self-supervised fashion with a large English text corpus and further fine-tuned with a massive English QA dataset (e. g., SQuAD).