Search Results for author: Soumil Mandal

Found 10 papers, 0 papers with code

Code-Mixed to Monolingual Translation Framework

no code implementations9 Nov 2019 Sainik Kumar Mahata, Soumil Mandal, Dipankar Das, Sivaji Bandyopadhyay

The use of multilingualism in the new generation is widespread in the form of code-mixed data on social media, and therefore a robust translation system is required for catering to the monolingual users, as well as for easier comprehension by language processing models.

Language Modelling Translation +1

SMT vs NMT: A Comparison over Hindi & Bengali Simple Sentences

no code implementations12 Dec 2018 Sainik Kumar Mahata, Soumil Mandal, Dipankar Das, Sivaji Bandyopadhyay

All of the systems use English-Hindi and English-Bengali language pairs containing simple sentences as well as sentences of other complexity.

Machine Translation NMT +2

Strategies for Language Identification in Code-Mixed Low Resource Languages

no code implementations16 Oct 2018 Soumil Mandal, Sankalp Sanand

In recent years, substantial work has been done on language tagging of code-mixed data, but most of them use large amounts of data to build their models.

Language Identification

Language Identification in Code-Mixed Data using Multichannel Neural Networks and Context Capture

no code implementations WS 2018 Soumil Mandal, Anil Kumar Singh

An accurate language identification tool is an absolute necessity for building complex NLP systems to be used on code-mixed data.

Language Identification

Normalization of Transliterated Words in Code-Mixed Data Using Seq2Seq Model & Levenshtein Distance

no code implementations WS 2018 Soumil Mandal, Karthick Nanmaran

Building tools for code-mixed data is rapidly gaining popularity in the NLP research community as such data is exponentially rising on social media.

Transliteration

Preparing Bengali-English Code-Mixed Corpus for Sentiment Analysis of Indian Languages

no code implementations11 Mar 2018 Soumil Mandal, Sainik Kumar Mahata, Dipankar Das

To gather attention and encourage researchers to work on this crisis, we prepared gold standard Bengali-English code-mixed data with language and polarity tag for sentiment analysis purposes.

Sentiment Analysis TAG

Language Identification of Bengali-English Code-Mixed data using Character & Phonetic based LSTM Models

no code implementations10 Mar 2018 Soumil Mandal, Sourya Dipta Das, Dipankar Das

Language identification of social media text still remains a challenging task due to properties like code-mixing and inconsistent phonetic transliterations.

Language Identification

Analyzing Roles of Classifiers and Code-Mixed factors for Sentiment Identification

no code implementations8 Jan 2018 Soumil Mandal, Dipankar Das

We have also tested various models trained on code-mixed data, as well as English features and the highest accuracy of 72. 50% was obtained by a Support Vector Machine (SVM) model.

Cannot find the paper you are looking for? You can Submit a new open access paper.