Search Results for author: Charangan Vasantharajan

Found 5 papers, 2 papers with code

Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts

1 code implementation • 24 Aug 2021 • Charangan Vasantharajan, Uthayasanker Thayasivam

The experimental results showed that ULMFiT is the best model for this task.

Language Identification Transfer Learning +2

Paper
Code

Adapting the Tesseract Open-Source OCR Engine for Tamil and Sinhala Legacy Fonts and Creating a Parallel Corpus for Tamil-Sinhala-English

1 code implementation • 13 Sep 2021 • Charangan Vasantharajan, Laksika Tharmalingam, Uthayasanker Thayasivam

Since Tamil and Sinhala are Low-Resource Languages, we improved the performance of Tesseract by employing LSTM-based training on more than 20 legacy fonts to recognize printed characters in these languages.

Optical Character Recognition (OCR)

Paper
Code

Findings of the Sentiment Analysis of Dravidian Languages in Code-Mixed Text

no code implementations • 18 Nov 2021 • Bharathi Raja Chakravarthi, Ruba Priyadharshini, Sajeetha Thavareesan, Dhivya Chinnappa, Durairaj Thenmozhi, Elizabeth Sherly, John P. McCrae, Adeep Hande, Rahul Ponnusamy, Shubhanker Banerjee, Charangan Vasantharajan

We received 22 systems for Tamil-English, 15 systems for Malayalam-English, and 15 for Kannada-English.

Sentiment Analysis

Paper
Add Code

TamilEmo: Finegrained Emotion Detection Dataset for Tamil

no code implementations • 9 Feb 2022 • Charangan Vasantharajan, Sean Benhur, Prasanna Kumar Kumarasen, Rahul Ponnusamy, Sathiyaraj Thangasamy, Ruba Priyadharshini, Thenmozhi Durairaj, Kanchana Sivanraju, Anbukkarasi Sampath, Bharathi Raja Chakravarthi, John Phillip McCrae

Our MURIL-base model has achieved a 0. 60 macro average F1-score across our 3-class group dataset.

Emotion Recognition

Paper
Add Code

Hypers@DravidianLangTech-EACL2021: Offensive language identification in Dravidian code-mixed YouTube Comments and Posts

no code implementations • EACL (DravidianLangTech) 2021 • Charangan Vasantharajan, Uthayasanker Thayasivam

Code-Mixed Offensive contents are used pervasively in social media posts in the last few years.

Language Identification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.