no code implementations • ICON 2020 • Hema Ala, Dipti Sharma
In this paper we present two Machine Learning algorithms namely Stochastic Gradient Descent and Multi Layer Perceptron to Identify the technical domain of given text as such text provides information about the specific domain.
no code implementations • RANLP 2021 • Sourav Kumar, Salil Aggarwal, Dipti Sharma
India is known as the land of many tongues and dialects.
no code implementations • RANLP 2021 • Hema Ala, Vandan Mujadia, Dipti Sharma
In this paper, we present a novel approachfor domain adaptation in Neural MachineTranslation which aims to improve thetranslation quality over a new domain. Adapting new domains is a highly challeng-ing task for Neural Machine Translation onlimited data, it becomes even more diffi-cult for technical domains such as Chem-istry and Artificial Intelligence due to spe-cific terminology, etc.
no code implementations • ICON 2020 • Hema Ala, Dipti Sharma
Adapting new domain is highly challenging task for Neural Machine Translation (NMT).
no code implementations • ICON 2020 • Hema Ala, Dipti Sharma
We present a Graph Based Approach to automatically extract domain specific terms from technical domains like Biochemistry, Communication, Computer Science and Law.
no code implementations • WMT (EMNLP) 2021 • Vandan Mujadia, Dipti Sharma
This paper describes the participation of team oneNLP (LTRC, IIIT-Hyderabad) for the WMT 2021 task, similar language translation.
no code implementations • ICON 2021 • Saujas Vaduguru, Partho Sarthi, Monojit Choudhury, Dipti Sharma
Learning linguistic generalizations from only a few examples is a challenging task.
1 code implementation • LREC 2022 • Harshita Sharma, Pruthwik Mishra, Dipti Sharma
Recently there has been a surge in this area of word problem solving in Chinese with the creation of large benchmark datastes.
no code implementations • LREC 2022 • Vandan Mujadia, Dipti Sharma
We present the Hindi-Telugu Parallel Corpus of different technical domains such as Natural Science, Computer Science, Law and Healthcare along with the General domain.
no code implementations • EMNLP (DISRPT) 2021 • Sahil Bakshi, Dipti Sharma
In this paper, we present a transformer based approach towards the automated identification of discourse unit segments and connectives.
no code implementations • WMT (EMNLP) 2020 • Vandan Mujadia, Dipti Sharma
This paper describes the participation of team F1toF6 (LTRC, IIIT-Hyderabad) for the WMT 2020 task, similar language translation.
no code implementations • ACL (WAT) 2021 • Sourav Kumar, Salil Aggarwal, Dipti Sharma
For the scope of this task, we have built multilingual systems for 20 translation directions namely English-Indic (one-to- many) and Indic-English (many-to-one).
no code implementations • ICON 2020 • Ashutosh Ranjan, Dipti Sharma, Radhika Krishnan
This paper is an attempt to study polarisation on social media data.
no code implementations • ICON 2019 • Niyati Bafna, Dipti Sharma
English-Hindi machine translation systems have difficulty interpreting verb phrase ellipsis (VPE) in English, and commit errors in translating sentences with VPE.
no code implementations • ICON 2019 • Pruthwik Mishra, Ayush Joshi, Dipti Sharma
In this paper we identify all aspects related to the gadget which are present on the reviews given online on various websites.
no code implementations • 1 Nov 2022 • Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal
Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video.
no code implementations • ACL 2020 • Sneha Nallani, Manish Shrivastava, Dipti Sharma
We present a simple and effective dependency parser for Telugu, a morphologically rich, free word order language.
no code implementations • LREC 2020 • Sneha Nallani, Manish Shrivastava, Dipti Sharma
The available Paninian dependency treebank(s) for Telugu is annotated only with inter-chunk dependency relations and not all words of a sentence are part of the parse tree.
no code implementations • WS 2019 • Riya Pal, Dipti Sharma
We present a system for automating Semantic Role Labelling of Hindi-English code-mixed tweets.
no code implementations • WS 2019 • Riya Pal, Dipti Sharma
We present a data set of 1460 Hindi-English code-mixed tweets consisting of 20, 949 tokens labelled with Proposition Bank labels marking their semantic roles.
no code implementations • IJCNLP 2017 • Purvanshi Mehta, Pruthwik Mishra, Vinayak Athavale, Manish Shrivastava, Dipti Sharma
The worldstate and the query are processed separately in two different networks and finally, the networks are merged to predict the final operation.
no code implementations • WS 2017 • Riyaz A. Bhat, Irshad Bhat, Dipti Sharma
We investigate the problem of parsing conversational data of morphologically-rich languages such as Hindi where argument scrambling occurs frequently.
no code implementations • WS 2016 • Sharada Mohanty, Nehal J Wani, Manish Srivastava, Dipti Sharma
It models NLP systems an edge-labeled Directed Acyclic MultiGraph, and lets the user use publicly co-created modules in their own NLP applications irrespective of their technical proficiency in Natural Language Processing.
no code implementations • LREC 2016 • Maaz Anwar, Dipti Sharma
Our system achieves an accuracy of 86{\%} in identifying the arguments of a verb for Hindi and 75{\%} for Urdu.
no code implementations • LREC 2016 • Maaz Anwar, Riyaz Ahmad Bhat, Dipti Sharma, Ashwini Vaidya, Martha Palmer, Tafseer Ahmed Khan
The present size of this Propbank is around 180, 000 tokens which is double-propbanked by the two annotators for simple predicates.
no code implementations • LREC 2016 • Raveesh Motlani, Francis Tyers, Dipti Sharma
Morphological analysis is a fundamental task in natural-language processing, which is used in other NLP applications such as part-of-speech tagging, syntactic parsing, information retrieval, machine translation, etc.
no code implementations • LREC 2016 • Rohit Jain, Himanshu Sharma, Dipti Sharma
We report that the novel dependency features introduced have a higher impact on precision, in comparison to the syntactic features previously used for this task.
no code implementations • LREC 2014 • Kunal Sachdeva, Rishabh Srivastava, Sambhav Jain, Dipti Sharma
In this paper, we describe a Hindi to English statistical machine translation system and improve over the baseline using multiple translation models.