1 code implementation • EMNLP 2021 • Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle G. Lee, Anish Acharya, Rajiv Ratn Shah
Code-switching is the communication phenomenon where the speakers switch between different languages during a conversation.
no code implementations • EMNLP 2020 • Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, Amanda Stent
We observed that our model achieves state-of-the-art performance in the generation of abstractive keyphrases and is comparable to the best performing extractive techniques.
no code implementations • 29 Mar 2022 • Debanjan Mahata, Navneet Agarwal, Dibya Gautam, Amardeep Kumar, Swapnil Parekh, Yaman Kumar Singla, Anish Acharya, Rajiv Ratn Shah
Identifying keyphrases (KPs) from text documents is a fundamental task in natural language processing and information retrieval.
1 code implementation • 9 Mar 2022 • Jishnu Ray Chowdhury, Debanjan Mahata, Cornelia Caragea
Second, we compare different strategies to utilize a pre-trained seq2seq model to generate and select a set of questions related to a given paragraph.
1 code implementation • Findings (NAACL) 2022 • Mayank Kulkarni, Debanjan Mahata, Ravneet Arora, Rajarshi Bhowmik
In the discriminative setting, we introduce a new pre-training objective - Keyphrase Boundary Infilling with Replacement (KBIR), showing large gains in performance (upto 8. 16 points in F1) over SOTA, when the LM pre-trained using KBIR is fine-tuned for the task of keyphrase extraction.
no code implementations • NAACL 2021 • Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata, Haimin Zhang
In this paper, we study the importance of context in predicting the citation worthiness of sentences in scholarly articles.
no code implementations • 17 Apr 2021 • Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle Lee, Anish Acharya, Rajiv Ratn Shah
Towards this objective, we introduce abstractive summarization of Hindi-English code-switched conversations and develop the first code-switched conversation summarization dataset - GupShup, which contains over 6, 831 conversations in Hindi-English and their corresponding human-annotated summaries in English and Hindi-English.
no code implementations • 21 Dec 2020 • Yaman Kumar, Swati Aggarwal, Debanjan Mahata, Rajiv Ratn Shah, Ponnurangam Kumaraguru, Roger Zimmermann
In this paper, we present a fast, scalable, and accurate approach towards automated Short Answer Scoring (SAS).
2 code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Shagun Uppal, Vivek Gupta, Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, Amanda Stent
We further improve the performance by using a joint-objective for classification and textual entailment.
no code implementations • SEMEVAL 2020 • Sarthak Anand, Pradyumna Gupta, Hemant Yadav, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah
This paper presents our submission to the SemEval 2020 - Task 10 on emphasis selection in written text.
no code implementations • WS 2020 • Akash Gautam, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah
These indicators are then used to expand the dataset.
no code implementations • LREC 2020 • Swapnil Dhanwal, Hritwik Dutta, Hitesh Nankani, Nilay Shrivastava, Yaman Kumar, Junyi Jessy Li, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Am Stent, a
In this paper, we present a new corpus consisting of sentences from Hindi short stories annotated for five different discourse modes argumentative, narrative, descriptive, dialogic and informative.
no code implementations • 24 Jan 2020 • Gyanesh Anand, Akash Gautam, Puneet Mathur, Debanjan Mahata, Rajiv Ratn Shah, Ramit Sawhney
Twitter is a social media platform where users express opinions over a variety of issues.
no code implementations • 14 Dec 2019 • Akash Gautam, Puneet Mathur, Rakesh Gosangi, Debanjan Mahata, Ramit Sawhney, Rajiv Ratn Shah
In this paper, we present a dataset containing 9, 973 tweets related to the MeToo movement that were manually annotated for five different linguistic aspects: relevance, stance, hate speech, sarcasm, and dialogue acts.
no code implementations • 19 Oct 2019 • Dhruva Sahrawat, Debanjan Mahata, Mayank Kulkarni, Haimin Zhang, Rakesh Gosangi, Amanda Stent, Agniv Sharma, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann
In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings.
1 code implementation • 9 Oct 2019 • Yaman Kumar, Debanjan Mahata, Sagar Aggarwal, Anmol Chugh, Rajat Maheshwari, Rajiv Ratn Shah
In this paper, we introduce the first and largest Hindi text corpus, named BHAAV, which means emotions in Hindi, for analyzing emotions that a writer expresses through his characters in a story, as perceived by a narrator/reader.
1 code implementation • 24 Sep 2019 • Avinash Swaminathan, Raj Kuwar Gupta, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah
In this paper, we present a keyphrase generation approach using conditional Generative Adversarial Networks (GAN).
1 code implementation • ACL 2019 • Arijit Ghosh Chowdhury, Ramit Sawhney, Rajiv Ratn Shah, Debanjan Mahata
The availability of large-scale online social data, coupled with computational methods can help us answer fundamental questions relat- ing to our social lives, particularly our health and well-being.
no code implementations • NAACL 2019 • Rohan Mishra, Pradyumn Prakhar Sinha, Ramit Sawhney, Debanjan Mahata, Puneet Mathur, Rajiv Ratn Shah
Suicide is a leading cause of death among youth and the use of social media to detect suicidal ideation is an active line of research.
no code implementations • NAACL 2019 • Arijit Ghosh Chowdhury, Ramit Sawhney, Puneet Mathur, Debanjan Mahata, Rajiv Ratn Shah
The {\#}MeToo movement is an ongoing prevalent phenomenon on social media aiming to demonstrate the frequency and widespread of sexual harassment by providing a platform to speak narrate personal experiences of such harassment.
no code implementations • 10 May 2019 • Nilay Shrivastava, Astitwa Saxena, Yaman Kumar, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent
Visual speech recognition (VSR) is the task of recognizing spoken language from video input only, without any audio.
no code implementations • 19 Apr 2019 • Haimin Zhang, Debanjan Mahata, Simra Shahid, Laiba Mehnaz, Sarthak Anand, Yaman Singla, Rajiv Ratn Shah, Karan Uppal
In this paper we present our approach and the system description for Sub-task A and Sub Task B of SemEval 2019 Task 6: Identifying and Categorizing Offensive Language in Social Media.
1 code implementation • 19 Apr 2019 • Sarthak Anand, Debanjan Mahata, Kartik Aggarwal, Laiba Mehnaz, Simra Shahid, Haimin Zhang, Yaman Kumar, Rajiv Ratn Shah, Karan Uppal
In this paper we present our approach and the system description for Sub Task A of SemEval 2019 Task 9: Suggestion Mining from Online Reviews and Forums.
1 code implementation • 29 Jan 2019 • Yaman Kumar, Dhruva Sahrawat, Shubham Maheshwari, Debanjan Mahata, Amanda Stent, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann
To solve this problem, we present a novel approach to zero-shot learning by generating new classes using Generative Adversarial Networks (GANs), and show how the addition of unseen class samples increases the accuracy of a VSR system by a significant margin of 27% and allows it to handle speaker-independent out-of-vocabulary phrases.
no code implementations • 2 Dec 2018 • Nupur Baghel, Yaman Kumar, Paavini Nanda, Rajiv Ratn Shah, Debanjan Mahata, Roger Zimmermann
There has been upsurge in the number of people participating in challenges made popular through social media channels.
no code implementations • 3 Aug 2018 • Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, Jing Jiang
We believe that the developed classifier has direct uses in the areas of psychology, health informatics, pharmacovigilance and affective computing for tracking moods, emotions and sentiments of patients expressing intake of medicine in social media.
no code implementations • 16 Jul 2018 • Mayank Meghawat, Satyendra Yadav, Debanjan Mahata, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann
In this work, we propose a multimodal dataset consisiting of content, context, and social information for popularity prediction.
no code implementations • 16 Jul 2018 • Debanjan Mahata, John Kuriakose, Rajiv Ratn Shah, Roger Zimmermann, John R. Talburt
Keyword extraction is a fundamental task in natural language processing that facilitates mapping of documents to a concise set of representative single and multi-word phrases.
no code implementations • WS 2018 • Puneet Mathur, Rajiv Shah, Ramit Sawhney, Debanjan Mahata
The paper focuses on the classification of offensive tweets written in Hinglish language, which is a portmanteau of the Indic language Hindi with the Roman script.
no code implementations • NAACL 2018 • Debanjan Mahata, John Kuriakose, Rajiv Ratn Shah, Roger Zimmermann
Keyphrase extraction is a fundamental task in natural language processing that facilitates mapping of documents to a set of representative phrases.
no code implementations • 16 May 2018 • Debanjan Mahata, Jasper Friedrichs, Hitkul, Rajiv Ratn Shah
Mining social media messages for health and drug related information has received significant interest in pharmacovigilance research.
no code implementations • 21 Mar 2018 • Jasper Friedrichs, Debanjan Mahata, Shubham Gupta
This paper describes Infosys's participation in the "2nd Social Media Mining for Health Applications Shared Task at AMIA, 2017, Task 2".