Search Results for author: Debanjan Mahata

Found 33 papers, 10 papers with code

GupShup: Summarizing Open-Domain Code-Switched Conversations

1 code implementation • EMNLP 2021 • Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle G. Lee, Anish Acharya, Rajiv Ratn Shah

Code-switching is the communication phenomenon where the speakers switch between different languages during a conversation.

Abstractive Text Summarization

Paper
Code

A Preliminary Exploration of GANs for Keyphrase Generation

no code implementations • EMNLP 2020 • Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, Amanda Stent

We observed that our model achieves state-of-the-art performance in the generation of abstractive keyphrases and is comparable to the best performing extractive techniques.

Keyphrase Generation

Paper
Add Code

Enhancing Keyphrase Extraction from Long Scientific Documents using Graph Embeddings

1 code implementation • 16 May 2023 • Roberto Martínez-Cruz, Debanjan Mahata, Alvaro J. López-López, José Portela

In this study, we investigate using graph neural network (GNN) representations to enhance contextualized representations of pre-trained language models (PLMs) for keyphrase extraction from lengthy documents.

Keyphrase Extraction

Paper
Code

LDKP: A Dataset for Identifying Keyphrases from Long Scientific Documents

no code implementations • 29 Mar 2022 • Debanjan Mahata, Navneet Agarwal, Dibya Gautam, Amardeep Kumar, Swapnil Parekh, Yaman Kumar Singla, Anish Acharya, Rajiv Ratn Shah

Identifying keyphrases (KPs) from text documents is a fundamental task in natural language processing and information retrieval.

Information Retrieval Keyphrase Extraction +2

Paper
Add Code

On the Evaluation of Answer-Agnostic Paragraph-level Multi-Question Generation

1 code implementation • 9 Mar 2022 • Jishnu Ray Chowdhury, Debanjan Mahata, Cornelia Caragea

Second, we compare different strategies to utilize a pre-trained seq2seq model to generate and select a set of questions related to a given paragraph.

Question Generation Question-Generation

Paper
Code

Learning Rich Representation of Keyphrases from Text

1 code implementation • Findings (NAACL) 2022 • Mayank Kulkarni, Debanjan Mahata, Ravneet Arora, Rajarshi Bhowmik

In the discriminative setting, we introduce a new pre-training objective - Keyphrase Boundary Infilling with Replacement (KBIR), showing large gains in performance (upto 8. 16 points in F1) over SOTA, when the LM pre-trained using KBIR is fine-tuned for the task of keyphrase extraction.

Abstractive Text Summarization Keyphrase Extraction +6

Paper
Code

On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles

no code implementations • NAACL 2021 • Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata, Haimin Zhang

In this paper, we study the importance of context in predicting the citation worthiness of sentences in scholarly articles.

Sentence

Paper
Add Code

GupShup: An Annotated Corpus for Abstractive Summarization of Open-Domain Code-Switched Conversations

no code implementations • 17 Apr 2021 • Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle Lee, Anish Acharya, Rajiv Ratn Shah

Towards this objective, we introduce abstractive summarization of Hindi-English code-switched conversations and develop the first code-switched conversation summarization dataset - GupShup, which contains over 6, 831 conversations in Hindi-English and their corresponding human-annotated summaries in English and Hindi-English.

Abstractive Text Summarization

Paper
Add Code

Get It Scored Using AutoSAS -- An Automated System for Scoring Short Answers

no code implementations • 21 Dec 2020 • Yaman Kumar, Swati Aggarwal, Debanjan Mahata, Rajiv Ratn Shah, Ponnurangam Kumaraguru, Roger Zimmermann

In this paper, we present a fast, scalable, and accurate approach towards automated Short Answer Scoring (SAS).

Paper
Add Code

Two-Step Classification using Recasted Data for Low Resource Settings

2 code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Shagun Uppal, Vivek Gupta, Avinash Swaminathan, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah, Amanda Stent

We further improve the performance by using a joint-objective for classification and textual entailment.

Natural Language Inference text-classification +2

Paper
Code

MIDAS at SemEval-2020 Task 10: Emphasis Selection using Label Distribution Learning and Contextual Embeddings

no code implementations • SEMEVAL 2020 • Sarthak Anand, Pradyumna Gupta, Hemant Yadav, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah

This paper presents our submission to the SemEval 2020 - Task 10 on emphasis selection in written text.

Sentence

Paper
Add Code

Semi-Supervised Iterative Approach for Domain-Specific Complaint Detection in Social Media

no code implementations • WS 2020 • Akash Gautam, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah

These indicators are then used to expand the dataset.

Paper
Add Code

An Annotated Dataset of Discourse Modes in Hindi Stories

no code implementations • LREC 2020 • Swapnil Dhanwal, Hritwik Dutta, Hitesh Nankani, Nilay Shrivastava, Yaman Kumar, Junyi Jessy Li, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Am Stent, a

In this paper, we present a new corpus consisting of sentences from Hindi short stories annotated for five different discourse modes argumentative, narrative, descriptive, dialogic and informative.

Descriptive Sentence

Paper
Add Code

An Iterative Approach for Identifying Complaint Based Tweets in Social Media Platforms

no code implementations • 24 Jan 2020 • Gyanesh Anand, Akash Gautam, Puneet Mathur, Debanjan Mahata, Rajiv Ratn Shah, Ramit Sawhney

Twitter is a social media platform where users express opinions over a variety of issues.

Paper
Add Code

#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement

no code implementations • 14 Dec 2019 • Akash Gautam, Puneet Mathur, Rakesh Gosangi, Debanjan Mahata, Ramit Sawhney, Rajiv Ratn Shah

In this paper, we present a dataset containing 9, 973 tweets related to the MeToo movement that were manually annotated for five different linguistic aspects: relevance, stance, hate speech, sarcasm, and dialogue acts.

Paper
Add Code

Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings

no code implementations • 19 Oct 2019 • Dhruva Sahrawat, Debanjan Mahata, Mayank Kulkarni, Haimin Zhang, Rakesh Gosangi, Amanda Stent, Agniv Sharma, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings.

Keyphrase Extraction Word Embeddings

Paper
Add Code

BHAAV- A Text Corpus for Emotion Analysis from Hindi Stories

1 code implementation • 9 Oct 2019 • Yaman Kumar, Debanjan Mahata, Sagar Aggarwal, Anmol Chugh, Rajat Maheshwari, Rajiv Ratn Shah

In this paper, we introduce the first and largest Hindi text corpus, named BHAAV, which means emotions in Hindi, for analyzing emotions that a writer expresses through his characters in a story, as perceived by a narrator/reader.

Emotion Recognition Sentence

Paper
Code

Keyphrase Generation for Scientific Articles using GANs

1 code implementation • 24 Sep 2019 • Avinash Swaminathan, Raj Kuwar Gupta, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah

In this paper, we present a keyphrase generation approach using conditional Generative Adversarial Networks (GAN).

Keyphrase Generation

Paper
Code

\#YouToo? Detection of Personal Recollections of Sexual Harassment on Social Media

1 code implementation • ACL 2019 • Arijit Ghosh Chowdhury, Ramit Sawhney, Rajiv Ratn Shah, Debanjan Mahata

The availability of large-scale online social data, coupled with computational methods can help us answer fundamental questions relat- ing to our social lives, particularly our health and well-being.

Paper
Code

Speak up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment

no code implementations • NAACL 2019 • Arijit Ghosh Chowdhury, Ramit Sawhney, Puneet Mathur, Debanjan Mahata, Rajiv Ratn Shah

The {\#}MeToo movement is an ongoing prevalent phenomenon on social media aiming to demonstrate the frequency and widespread of sexual harassment by providing a platform to speak narrate personal experiences of such harassment.

Classification General Classification +4

Paper
Add Code

SNAP-BATNET: Cascading Author Profiling and Social Network Graphs for Suicide Ideation Detection on Social Media

no code implementations • NAACL 2019 • Rohan Mishra, Pradyumn Prakhar Sinha, Ramit Sawhney, Debanjan Mahata, Puneet Mathur, Rajiv Ratn Shah

Suicide is a leading cause of death among youth and the use of social media to detect suicidal ideation is an active line of research.

Paper
Add Code

MobiVSR: A Visual Speech Recognition Solution for Mobile Devices

no code implementations • 10 May 2019 • Nilay Shrivastava, Astitwa Saxena, Yaman Kumar, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent

Visual speech recognition (VSR) is the task of recognizing spoken language from video input only, without any audio.

Lip Reading Quantization +2

Paper
Add Code

Identifying Offensive Posts and Targeted Offense from Twitter

no code implementations • 19 Apr 2019 • Haimin Zhang, Debanjan Mahata, Simra Shahid, Laiba Mehnaz, Sarthak Anand, Yaman Singla, Rajiv Ratn Shah, Karan Uppal

In this paper we present our approach and the system description for Sub-task A and Sub Task B of SemEval 2019 Task 6: Identifying and Categorizing Offensive Language in Social Media.

Paper
Add Code

Suggestion Mining from Online Reviews using ULMFiT

1 code implementation • 19 Apr 2019 • Sarthak Anand, Debanjan Mahata, Kartik Aggarwal, Laiba Mehnaz, Simra Shahid, Haimin Zhang, Yaman Kumar, Rajiv Ratn Shah, Karan Uppal

In this paper we present our approach and the system description for Sub Task A of SemEval 2019 Task 9: Suggestion Mining from Online Reviews and Forums.

General Classification Language Modelling +4

Paper
Code

Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition

1 code implementation • 29 Jan 2019 • Yaman Kumar, Dhruva Sahrawat, Shubham Maheshwari, Debanjan Mahata, Amanda Stent, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

To solve this problem, we present a novel approach to zero-shot learning by generating new classes using Generative Adversarial Networks (GANs), and show how the addition of unseen class samples increases the accuracy of a VSR system by a significant margin of 27% and allows it to handle speaker-independent out-of-vocabulary phrases.

speech-recognition Visual Speech Recognition +1

Paper
Code

Kiki Kills: Identifying Dangerous Challenge Videos from Social Media

no code implementations • 2 Dec 2018 • Nupur Baghel, Yaman Kumar, Paavini Nanda, Rajiv Ratn Shah, Debanjan Mahata, Roger Zimmermann

There has been upsurge in the number of people participating in challenges made popular through social media channels.

Paper
Add Code

Did you take the pill? - Detecting Personal Intake of Medicine from Twitter

no code implementations • 3 Aug 2018 • Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, Jing Jiang

We believe that the developed classifier has direct uses in the areas of psychology, health informatics, pharmacovigilance and affective computing for tracking moods, emotions and sentiments of patients expressing intake of medicine in social media.

Paper
Add Code

A Multimodal Approach to Predict Social Media Popularity

no code implementations • 16 Jul 2018 • Mayank Meghawat, Satyendra Yadav, Debanjan Mahata, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

In this work, we propose a multimodal dataset consisiting of content, context, and social information for popularity prediction.

Paper
Add Code

Theme-weighted Ranking of Keywords from Text Documents using Phrase Embeddings

no code implementations • 16 Jul 2018 • Debanjan Mahata, John Kuriakose, Rajiv Ratn Shah, Roger Zimmermann, John R. Talburt

Keyword extraction is a fundamental task in natural language processing that facilitates mapping of documents to a concise set of representative single and multi-word phrases.

Keyword Extraction

Paper
Add Code

Detecting Offensive Tweets in Hindi-English Code-Switched Language

no code implementations • WS 2018 • Puneet Mathur, Rajiv Shah, Ramit Sawhney, Debanjan Mahata

The paper focuses on the classification of offensive tweets written in Hinglish language, which is a portmanteau of the Indic language Hindi with the Roman script.

General Classification Hate Speech Detection +1

Paper
Add Code

Key2Vec: Automatic Ranked Keyphrase Extraction from Scientific Articles using Phrase Embeddings

no code implementations • NAACL 2018 • Debanjan Mahata, John Kuriakose, Rajiv Ratn Shah, Roger Zimmermann

Keyphrase extraction is a fundamental task in natural language processing that facilitates mapping of documents to a set of representative phrases.

Chunking Keyphrase Extraction +4

Paper
Add Code

#phramacovigilance - Exploring Deep Learning Techniques for Identifying Mentions of Medication Intake from Twitter

no code implementations • 16 May 2018 • Debanjan Mahata, Jasper Friedrichs, Hitkul, Rajiv Ratn Shah

Mining social media messages for health and drug related information has received significant interest in pharmacovigilance research.

Paper
Add Code

InfyNLP at SMM4H Task 2: Stacked Ensemble of Shallow Convolutional Neural Networks for Identifying Personal Medication Intake from Twitter

no code implementations • 21 Mar 2018 • Jasper Friedrichs, Debanjan Mahata, Shubham Gupta

This paper describes Infosys's participation in the "2nd Social Media Mining for Health Applications Shared Task at AMIA, 2017, Task 2".

General Classification Task 2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.