Search Results for author: Anil Kumar Singh

Found 39 papers, 5 papers with code

Transformer-based Neural Machine Translation System for Hindi – Marathi: WMT20 Shared Task

no code implementations WMT (EMNLP) 2020 Amit Kumar, Rupjyoti Baruah, Rajesh Kumar Mundotiya, Anil Kumar Singh

This paper reports the results for the Machine Translation (MT) system submitted by the NLPRL team for the Hindi – Marathi Similar Translation Task at WMT 2020.

Machine Translation NMT +1

Unsupervised Approach for Zero-Shot Experiments: Bhojpuri–Hindi and Magahi–Hindi@LoResMT 2020

no code implementations loresmt (AACL) 2020 Amit Kumar, Rajesh Kumar Mundotiya, Anil Kumar Singh

This paper reports a Machine Translation (MT) system submitted by the NLPRL team for the Bhojpuri–Hindi and Magahi–Hindi language pairs at LoResMT 2020 shared task.

Machine Translation Translation +1

Improving Abstractive Summarization with Commonsense Knowledge

no code implementations RANLP 2021 Pranav Nair, Anil Kumar Singh

Large scale pretrained models have demonstrated strong performances on several natural language generation and understanding benchmarks.

Abstractive Text Summarization Text Generation

Parsing Indian English News Headlines

no code implementations ICON 2020 Samapika Roy, Sukhada Sukhada, Anil Kumar Singh

While the creativity seen in NHs is fascinating for language researchers, it poses a computational challenge for Natural Language Processing researchers.

Emotion Classification World Knowledge

Machine Translation by Projecting Text into the Same Phonetic-Orthographic Space Using a Common Encoding

no code implementations21 May 2023 Amit Kumar, Shantipriya Parida, Ajay Pratap, Anil Kumar Singh

One reason for this is the relative morphological richness of Indian languages, while another is that most of them fall into the extremely low resource or zero-shot categories.

Machine Translation NMT +1

Exploiting Language Relatedness in Machine Translation Through Domain Adaptation Techniques

no code implementations3 Mar 2023 Amit Kumar, Rupjyoti Baruah, Ajay Pratap, Mayank Swarnkar, Anil Kumar Singh

If the evaluation is as rigorous as resource-rich languages, both Neural Machine Translation (NMT) and Statistical Machine Translation (SMT) can produce good results with such large amounts of data.

Domain Adaptation Language Modelling +4

Linguistic Resources for Bhojpuri, Magahi and Maithili: Statistics about them, their Similarity Estimates, and Baselines for Three Applications

no code implementations29 Apr 2020 Rajesh Kumar Mundotiya, Manish Kumar Singh, Rahul Kapur, Swasti Mishra, Anil Kumar Singh

Corpus preparation for low-resource languages and for development of human language technology to analyze or computationally process them is a laborious task, primarily due to the unavailability of expert linguists who are native speakers of these languages and also due to the time and resources required.

Chunking POS +1

Learning cross-lingual phonological and orthagraphic adaptations: a case study in improving neural machine translation between low-resource languages

1 code implementation21 Nov 2018 Saurav Jha, Akhilesh Sudhakar, Anil Kumar Singh

Out-of-vocabulary (OOV) words can pose serious challenges for machine translation (MT) tasks, and in particular, for low-resource language (LRL) pairs, i. e., language pairs for which few or no parallel corpora exist.

Machine Translation NMT +1

Multi Task Deep Morphological Analyzer: Context Aware Joint Morphological Tagging and Lemma Prediction

1 code implementation21 Nov 2018 Saurav Jha, Akhilesh Sudhakar, Anil Kumar Singh

The ambiguities introduced by the recombination of morphemes constructing several possible inflections for a word makes the prediction of syntactic traits in Morphologically Rich Languages (MRLs) a notoriously complicated task.

Dependency Parsing LEMMA +5

Language Identification in Code-Mixed Data using Multichannel Neural Networks and Context Capture

no code implementations WS 2018 Soumil Mandal, Anil Kumar Singh

An accurate language identification tool is an absolute necessity for building complex NLP systems to be used on code-mixed data.

Language Identification

How emotional are you? Neural Architectures for Emotion Intensity Prediction in Microblogs

1 code implementation COLING 2018 Devang Kulshreshtha, Pranav Goel, Anil Kumar Singh

Social media based micro-blogging sites like Twitter have become a common source of real-time information (impacting organizations and their strategies, and are used for expressing emotions and opinions.

Multi-Task Learning

IIT (BHU) System for Indo-Aryan Language Identification (ILI) at VarDial 2018

no code implementations COLING 2018 Divyanshu Gupta, Gourav Dhakad, Jayprakash Gupta, Anil Kumar Singh

Text language Identification is a Natural Language Processing task of identifying and recognizing a given language out of many different languages from a piece of text.

Language Identification Machine Translation +2

IIT (BHU) Submission for the ACL Shared Task on Named Entity Recognition on Code-switched Data

no code implementations WS 2018 Shashwat Trivedi, Harsh Rangwani, Anil Kumar Singh

This paper describes the best performing system for the shared task on Named Entity Recognition (NER) on code-switched data for the language pair Spanish-English (ENG-SPA).

named-entity-recognition Named Entity Recognition +2

Di-LSTM Contrast : A Deep Neural Network for Metaphor Detection

no code implementations WS 2018 Krishnkant Swarnkar, Anil Kumar Singh

The contrast between the contextual and general meaning of a word serves as an important clue for detecting its metaphoricity.

POS Topic Models +1

Ethical Questions in NLP Research: The (Mis)-Use of Forensic Linguistics

no code implementations20 Dec 2017 Anil Kumar Singh, Akhilesh Sudhakar

Ideas from forensic linguistics are now being used frequently in Natural Language Processing (NLP), using machine learning techniques.

BIG-bench Machine Learning

IIT (BHU): System Description for LSDSem'17 Shared Task

no code implementations WS 2017 Pranav Goel, Anil Kumar Singh

This paper describes an ensemble system submitted as part of the LSDSem Shared Task 2017 - the Story Cloze Test.

Cloze Test Common Sense Reasoning +3

A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

no code implementations LREC 2012 Anil Kumar Singh

The usefulness of annotated corpora is greatly increased if there is an associated tool that can allow various kinds of operations to be performed in a simple way.

Chunking Part-Of-Speech Tagging

Cannot find the paper you are looking for? You can Submit a new open access paper.