no code implementations • 30 Oct 2021 • Sourya Dipta Das, Ayan Basak, Soumil Mandal, Dipankar Das
Research on adversarial attacks are becoming widely popular in the recent years.
no code implementations • 21 May 2020 • Sourya Dipta Das, Soumil Mandal
In this article, we describe the system that we used for the memotion analysis challenge, which is Task 8 of SemEval-2020.
no code implementations • 9 Nov 2019 • Sainik Kumar Mahata, Soumil Mandal, Dipankar Das, Sivaji Bandyopadhyay
The use of multilingualism in the new generation is widespread in the form of code-mixed data on social media, and therefore a robust translation system is required for catering to the monolingual users, as well as for easier comprehension by language processing models.
no code implementations • 12 Dec 2018 • Sainik Kumar Mahata, Soumil Mandal, Dipankar Das, Sivaji Bandyopadhyay
All of the systems use English-Hindi and English-Bengali language pairs containing simple sentences as well as sentences of other complexity.
no code implementations • 16 Oct 2018 • Soumil Mandal, Sankalp Sanand
In recent years, substantial work has been done on language tagging of code-mixed data, but most of them use large amounts of data to build their models.
no code implementations • WS 2018 • Soumil Mandal, Anil Kumar Singh
An accurate language identification tool is an absolute necessity for building complex NLP systems to be used on code-mixed data.
no code implementations • WS 2018 • Soumil Mandal, Karthick Nanmaran
Building tools for code-mixed data is rapidly gaining popularity in the NLP research community as such data is exponentially rising on social media.
no code implementations • 11 Mar 2018 • Soumil Mandal, Sainik Kumar Mahata, Dipankar Das
To gather attention and encourage researchers to work on this crisis, we prepared gold standard Bengali-English code-mixed data with language and polarity tag for sentiment analysis purposes.
no code implementations • 10 Mar 2018 • Soumil Mandal, Sourya Dipta Das, Dipankar Das
Language identification of social media text still remains a challenging task due to properties like code-mixing and inconsistent phonetic transliterations.
no code implementations • 8 Jan 2018 • Soumil Mandal, Dipankar Das
We have also tested various models trained on code-mixed data, as well as English features and the highest accuracy of 72. 50% was obtained by a Support Vector Machine (SVM) model.