Search Results for author: Tanvir Ahmad

Found 8 papers, 1 papers with code

Language Identification of Hindi-English tweets using code-mixed BERT

no code implementations2 Jul 2021 Mohd Zeeshan Ansari, M M Sufyan Beg, Tanvir Ahmad, Mohd Jazib Khan, Ghazali Wasim

Recently, models such as BERT have shown that using a large amount of unlabeled data, the pretrained language models are even more beneficial for learning common language representations.

Language Identification Transfer Learning

Language Lexicons for Hindi-English Multilingual Text Processing

no code implementations29 Jun 2021 Mohd Zeeshan Ansari, Tanvir Ahmad, Noaima Bari

Language Identification in textual documents is the process of automatically detecting the language contained in a document based on its content.

Language Identification

A Simple and Efficient Probabilistic Language model for Code-Mixed Text

no code implementations29 Jun 2021 M Zeeshan Ansari, Tanvir Ahmad, M M Sufyan Beg, Asma Ikram

The conventional natural language processing approaches are not accustomed to the social media text due to colloquial discourse and non-homogeneous characteristics.

Information Retrieval Language Identification +7

Feature Selection on Noisy Twitter Short Text Messages for Language Identification

no code implementations11 Jul 2020 Mohd Zeeshan Ansari, Tanvir Ahmad, Ana Fatima

We apply different feature selection algorithms across various learning algorithms in order to analyze the effect of the algorithm as well as the number of features on the performance of the task.

feature selection Language Identification

Cross Script Hindi English NER Corpus from Wikipedia

no code implementations8 Oct 2018 Mohd Zeeshan Ansari, Tanvir Ahmad, Md Arshad Ali

Such corpora may be of mixed lingual nature in which text is written using multiple languages predominantly using a single script only.

named-entity-recognition Named Entity Recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.