Search Results for author: Charangan Vasantharajan

Found 5 papers, 2 papers with code

Adapting the Tesseract Open-Source OCR Engine for Tamil and Sinhala Legacy Fonts and Creating a Parallel Corpus for Tamil-Sinhala-English

1 code implementation • 13 Sep 2021 • Charangan Vasantharajan, Laksika Tharmalingam, Uthayasanker Thayasivam

Since Tamil and Sinhala are Low-Resource Languages, we improved the performance of Tesseract by employing LSTM-based training on more than 20 legacy fonts to recognize printed characters in these languages.

Optical Character Recognition (OCR)

Paper
Code

Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts

1 code implementation • 24 Aug 2021 • Charangan Vasantharajan, Uthayasanker Thayasivam

The experimental results showed that ULMFiT is the best model for this task.

Language Identification Transfer Learning +2

Paper
Code

Hypers@DravidianLangTech-EACL2021: Offensive language identification in Dravidian code-mixed YouTube Comments and Posts

no code implementations • EACL (DravidianLangTech) 2021 • Charangan Vasantharajan, Uthayasanker Thayasivam

Code-Mixed Offensive contents are used pervasively in social media posts in the last few years.

Language Identification

Paper
Add Code

Findings of the Sentiment Analysis of Dravidian Languages in Code-Mixed Text

no code implementations • 18 Nov 2021 • Bharathi Raja Chakravarthi, Ruba Priyadharshini, Sajeetha Thavareesan, Dhivya Chinnappa, Durairaj Thenmozhi, Elizabeth Sherly, John P. McCrae, Adeep Hande, Rahul Ponnusamy, Shubhanker Banerjee, Charangan Vasantharajan

We received 22 systems for Tamil-English, 15 systems for Malayalam-English, and 15 for Kannada-English.

Sentiment Analysis

Paper
Add Code

TamilEmo: Finegrained Emotion Detection Dataset for Tamil

no code implementations • 9 Feb 2022 • Charangan Vasantharajan, Sean Benhur, Prasanna Kumar Kumarasen, Rahul Ponnusamy, Sathiyaraj Thangasamy, Ruba Priyadharshini, Thenmozhi Durairaj, Kanchana Sivanraju, Anbukkarasi Sampath, Bharathi Raja Chakravarthi, John Phillip McCrae

Our MURIL-base model has achieved a 0. 60 macro average F1-score across our 3-class group dataset.

Emotion Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.