Search Results for author: Pushpak Bhattacharyya

Found 369 papers, 39 papers with code

pyiwn: A Python based API to access Indian Language WordNets

no code implementations GWC 2018 Ritesh Panjwani, Diptesh Kanojia, Pushpak Bhattacharyya

Indian language WordNets have their individual web-based browsing interfaces along with a common interface for IndoWordNet.

Speech Synthesis

A Multi-task Model for Multilingual Trigger Detection and Classification

no code implementations ICON 2019 Sovan Kumar Sahoo, Saumajit Saha, Asif Ekbal, Pushpak Bhattacharyya

In this paper we present a deep multi-task learning framework for multilingual event and argument trigger detection and classification.

Classification Multi-Task Learning

A Deep Ensemble Framework for Fake News Detection and Multi-Class Classification of Short Political Statements

no code implementations ICON 2019 Arjun Roy, Kingshuk Basak, Asif Ekbal, Pushpak Bhattacharyya

Fake news, rumor, incorrect information, and misinformation detection are nowadays crucial issues as these might have serious consequences for our social fabrics.

Fake News Detection Misinformation +1

“So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy

1 code implementation EMNLP 2021 Anirudh Mittal, Pranav Jeevan P, Prerak Gandhi, Diptesh Kanojia, Pushpak Bhattacharyya

The normalized duration (laughter duration divided by the clip duration) of laughter in each clip is used to compute this humour coefficient score on a five-point scale (0-4).

Modelling Source- and Target- Language Syntactic Information as Conditional Context in Interactive Neural Machine Translation

no code implementations EAMT 2020 Kamal Kumar Gupta, Rejwanul Haque, Asif Ekbal, Pushpak Bhattacharyya, Andy Way

In this study, we model source-language syntactic constituency parse and target-language syntactic descriptions in the form of supertags as conditional context for interactive prediction in neural MT (NMT).

Machine Translation NMT +1

Synthesizing Audio for Hindi WordNet

no code implementations GWC 2018 Diptesh Kanojia, Preethi Jyothi, Pushpak Bhattacharyya

We also develop voices using the existing implementations of the aforementioned systems, and (2) We use these voices to generate sample audios for randomly chosen words; manually evaluate the audio generated, and produce audio for all WordNet words using the winner voice model.

Speech Synthesis

Multiple Pivot Languages and Strategic Decoder Initialization Helps Neural Machine Translation

no code implementations loresmt (COLING) 2022 Shivam Mhaskar, Pushpak Bhattacharyya

In pivot-based transfer learning, the source to pivot and the pivot to target models are used to improve the performance of the source to target model.

Machine Translation Transfer Learning +1

Novelty Detection: A Perspective from Natural Language Processing

no code implementations CL (ACL) 2022 Tirthankar Ghosal, Tanik Saikh, Tameesh Biswas, Asif Ekbal, Pushpak Bhattacharyya

In this work, we build upon our earlier investigations for document-level novelty detection and present a comprehensive account of our efforts toward the problem.

Natural Language Inference Novelty Detection

Team IITP-AINLPML at WASSA 2022: Empathy Detection, Emotion Classification and Personality Detection

no code implementations WASSA (ACL) 2022 Soumitra Ghosh, Dhirendra Maurya, Asif Ekbal, Pushpak Bhattacharyya

Computational comprehension and identifying emotional components in language have been critical in enhancing human-computer connection in recent years.

Emotion Classification

BERT based Adverse Drug Effect Tweet Classification

no code implementations NAACL (SMM4H) 2021 Tanay Kayastha, Pranjal Gupta, Pushpak Bhattacharyya

This paper describes models developed for the Social Media Mining for Health (SMM4H) 2021 shared tasks.

Classification

Mapping it differently: A solution to the linking challenges

no code implementations GWC 2016 Meghna Singh, Rajita Shukla, Jaya Saraswati, Laxmi Kashyap, Diptesh Kanojia, Pushpak Bhattacharyya

This paper reports the work of creating bilingual mappings in English for certain synsets of Hindi wordnet, the need for doing this, the methods adopted and the tools created for the task.

Information Retrieval Retrieval +4

High, Medium or Low? Detecting Intensity Variation Among polar synonyms in WordNet

no code implementations GWC 2016 Raksha Sharma, Pushpak Bhattacharyya

For fine-grained sentiment analysis, we need to go beyond zero-one polarity and find a way to compare adjectives (synonyms) that share the same sense.

Sentiment Analysis

IndoWordNet Conversion to Web Ontology Language (OWL)

no code implementations GWC 2016 Apurva Nagvenkar, Jyoti Pawar, Pushpak Bhattacharyya

In this paper, we present a data representation of IndoWordNet in Web Ontology Language (OWL).

IIITBH-IITP@CL-SciSumm20, CL-LaySumm20, LongSumm20

no code implementations EMNLP (sdp) 2020 Saichethan Reddy, Naveen Saini, Sriparna Saha, Pushpak Bhattacharyya

In this paper, we present the IIIT Bhagalpur and IIT Patna team’s effort to solve the three shared tasks namely, CL-SciSumm 2020, CL-LaySumm 2020, LongSumm 2020 at SDP 2020.

SEPRG: Sentiment aware Emotion controlled Personalized Response Generation

no code implementations INLG (ACL) 2021 Mauajama Firdaus, Umang Jain, Asif Ekbal, Pushpak Bhattacharyya

We design a Transformer based Dialogue Generation framework, that generates responses that are sensitive to the emotion of the user and corresponds to the persona and sentiment as well.

Dialogue Generation Response Generation

HindiMD: A Multi-domain Corpora for Low-resource Sentiment Analysis

no code implementations LREC 2022 Mamta ., Asif Ekbal, Pushpak Bhattacharyya, Tista Saha, Alka Kumar, Shikha Srivastava

Social media platforms such as Twitter have evolved into a vast information sharing platform, allowing people from a variety of backgrounds and expertise to share their opinions on numerous events such as terrorism, narcotics and many other social issues.

Sentiment Analysis

Retrofitting of Pre-trained Emotion Words with VAD-dimensions and the Plutchik Emotions

no code implementations ICON 2021 Manasi Kulkarni, Pushpak Bhattacharyya

This is a lexicons based approach that uses the Valence, Arousal and Dominance (VAD) values, and the Plutchik’s emotions to incorporate the emotion information in pre-trained word embeddings using post-training processing.

Emotion Recognition Word Embeddings

Emotion Enriched Retrofitted Word Embeddings

no code implementations COLING 2022 Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya

The retrofitted embeddings achieve better inter-cluster and intra-cluster distance for words having the same emotions, e. g., the joy cluster containing words like fun, happiness, etc., and the anger cluster with words like offence, rage, etc., as evaluated through different cluster quality metrics.

Sarcasm Detection Sentiment Analysis +1

Zero-shot Disfluency Detection for Indian Languages

no code implementations COLING 2022 Rohit Kundu, Preethi Jyothi, Pushpak Bhattacharyya

We present a detailed pipeline to synthetically generate disfluent text and create evaluation datasets for four Indian languages: Bengali, Hindi, Malayalam, and Marathi.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

A Scaled Encoder Decoder Network for Image Captioning in Hindi

no code implementations ICON 2021 Santosh Kumar Mishra, Sriparna Saha, Pushpak Bhattacharyya

The proposed method’s performance is compared with state-of-the-art methods in terms of BLEU scores and manual evaluation (in terms of adequacy and fluency).

Image Captioning

Product Description and QA Assisted Self-Supervised Opinion Summarization

no code implementations8 Apr 2024 Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

For evaluation, due to the unavailability of test sets with additional sources, we extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries.

Opinion Summarization

A Morphology-Based Investigation of Positional Encodings

no code implementations6 Apr 2024 Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya

How does the importance of positional encoding in pre-trained language models (PLMs) vary across languages with different morphological complexity?

Dependency Parsing named-entity-recognition +3

Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese

no code implementations20 Mar 2024 Meet Doshi, Raj Dabre, Pushpak Bhattacharyya

In this paper, we explore the utility of Translationese as synthetic data created using machine translation for pre-training language models (LMs).

Machine Translation Natural Language Understanding

Material Microstructure Design Using VAE-Regression with Multimodal Prior

no code implementations27 Feb 2024 Avadhut Sardeshmukh, Sreedhar Reddy, BP Gautham, Pushpak Bhattacharyya

The resultant model can be used for both forward and inverse prediction i. e., for predicting the properties of a given microstructure as well as for predicting the microstructure required to obtain given properties.

Property Prediction regression

One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

1 code implementation18 Feb 2024 Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments.

nlg evaluation Opinion Summarization +1

Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations

1 code implementation18 Jan 2024 Prince Jha, Krishanu Maity, Raghav Jain, Apoorv Verma, Sriparna Saha, Pushpak Bhattacharyya

A Contrastive Language-Image Pretraining (CLIP) projection-based multimodal shared-private multitask approach has been proposed for visual and textual explanation of a meme.

Explain Thyself Bully: Sentiment Aided Cyberbullying Detection with Explanation

1 code implementation17 Jan 2024 Krishanu Maity, Prince Jha, Raghav Jain, Sriparna Saha, Pushpak Bhattacharyya

While plenty of research is going on to develop better models for cyberbullying detection in monolingual language, there is very little research on the code-mixed languages and explainability aspect of cyberbullying.

Sentence Sentiment Analysis

PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities

no code implementations13 Jan 2024 Settaluri Lakshmi Sravanthi, Meet Doshi, Tankala Pavan Kalyan, Rudra Murthy, Pushpak Bhattacharyya, Raj Dabre

To demonstrate this fact, we release a Pragmatics Understanding Benchmark (PUB) dataset consisting of fourteen tasks in four pragmatics phenomena, namely, Implicature, Presupposition, Reference, and Deixis.

Instruction Following Multiple-choice

Yes, this is what I was looking for! Towards Multi-modal Medical Consultation Concern Summary Generation

1 code implementation10 Jan 2024 Abhisek Tiwari, Shreyangshu Bera, Sriparna Saha, Pushpak Bhattacharyya, Samrat Ghosh

Over the past few years, the use of the Internet for healthcare-related tasks has grown by leaps and bounds, posing a challenge in effectively managing and processing information to ensure its efficient utilization.

Intent Recognition

IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages

1 code implementation15 Dec 2023 Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya

To the best of our knowledge, IndicIRSuite is the first attempt at building large-scale Neural Information Retrieval resources for a large number of Indian languages, and we hope that it will help accelerate research in Neural IR for Indian Languages.

Information Retrieval Machine Translation +1

Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning

no code implementations29 Oct 2023 Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya

We present a novel retrofitting method to induce emotion aspects into pre-trained language models (PLMs) such as BERT and RoBERTa.

Contrastive Learning Few-Shot Learning +3

DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages

1 code implementation25 Oct 2023 Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

Towards the goal of multilingual disfluency correction, we present a high-quality human-annotated DC corpus covering four important Indo-European languages: English, Hindi, German and French.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

no code implementations29 Sep 2023 Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

The introduction of the MUStARD dataset, and its emotion recognition extension MUStARD++, have identified sarcasm to be a multi-modal phenomenon -- expressed not only in natural language text, but also through manners of speech (like tonality and intonation) and visual cues (facial expression).

Benchmarking Emotion Recognition +1

Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

1 code implementation27 Sep 2023 Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

We propose a knowledge-infused, multi-modal, multi-tasking medical domain identification and clinical conversation summary generation (MM-CliConSummation) framework.

Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric

no code implementations11 Sep 2023 Abhisek Tiwari, Muhammed Sinan, Kaushik Roy, Amit Sheth, Sriparna Saha, Pushpak Bhattacharyya

These lexical-based metrics have the following key limitations: (a) word-to-word matching without semantic consideration: It assigns the same credit for failure to generate 'nice' and 'rice' for 'good'.

Attribute Dialogue Generation +1

"Beware of deception": Detecting Half-Truth and Debunking it through Controlled Claim Editing

no code implementations15 Aug 2023 Sandeep Singamsetty, Nishtha Madaan, Sameep Mehta, Varad Bhatnagar, Pushpak Bhattacharyya

To help combat this problem, we have created a comprehensive pipeline consisting of a half-truth detection model and a claim editing model.

KITLM: Domain-Specific Knowledge InTegration into Language Models for Question Answering

1 code implementation7 Aug 2023 Ankush Agarwal, Sakharam Gawade, Amar Prakash Azad, Pushpak Bhattacharyya

Our research contributes to advancing the field of domain-specific language understanding and showcases the potential of knowledge infusion techniques in improving the performance of language models on question-answering.

Language Modelling Multi-hop Question Answering +1

"Kurosawa": A Script Writer's Assistant

no code implementations6 Aug 2023 Prerak Gandhi, Vishal Pramanik, Pushpak Bhattacharyya

We fine-tune GPT-3 with the above datasets to generate plots and scenes.

Decision Knowledge Graphs: Construction of and Usage in Question Answering for Clinical Practice Guidelines

no code implementations6 Aug 2023 Vasudhan Varma Kandula, Pushpak Bhattacharyya

In this paper, we present a Decision Knowledge Graph (DKG) representation to store CPGs and to perform question-answering on CPGs.

Knowledge Graphs Question Answering

"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations

no code implementations6 Aug 2023 N V S Abhishek, Pushpak Bhattacharyya

Emotion recognition is essential in building robust conversational agents in domains such as law, healthcare, education, and customer support.

Speech Emotion Recognition

"A Little is Enough": Few-Shot Quality Estimation based Corpus Filtering improves Machine Translation

no code implementations6 Jun 2023 Akshay Batheja, Pushpak Bhattacharyya

To the best of our knowledge, this is a novel adaptation of the QE framework to extract quality parallel corpus from the pseudo-parallel corpus.

Machine Translation Sentence +2

"Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning

no code implementations1 Jun 2023 Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Niyati Chhaya, Sumit Shekhar

We exploit context by pretraining our model with datasets of three tasks: news image captioning where the news article is the context, contextual visual entailment, and keyword extraction from the context.

Image Captioning Keyword Extraction +2

A Match Made in Heaven: A Multi-task Framework for Hyperbole and Metaphor Detection

1 code implementation27 May 2023 Naveen Badathala, Abisek Rajakumar Kalarani, Tejpalsingh Siledar, Pushpak Bhattacharyya

Additionally, our multi-task learning (MTL) approach shows an improvement of up to 17% over single-task learning (STL) for both hyperbole and metaphor detection, supporting our hypothesis.

Multi-Task Learning

VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages

no code implementations21 May 2023 Shivam Mhaskar, Vineet Bhat, Akshay Batheja, Sourabh Deoghare, Paramveer Choudhary, Pushpak Bhattacharyya

In this work, we present our deployment-ready Speech-to-Speech Machine Translation (SSMT) system for English-Hindi, English-Marathi, and Hindi-Marathi language pairs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Denoising-based UNMT is more robust to word-order divergence than MASS-based UNMT

no code implementations2 Mar 2023 Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

We aim to investigate whether UNMT approaches with self-supervised pre-training are robust to word-order divergence between language pairs.

Denoising Translation

Improving Machine Translation with Phrase Pair Injection and Corpus Filtering

no code implementations19 Jan 2023 Akshay Batheja, Pushpak Bhattacharyya

In this paper, we show that the combination of Phrase Pair Injection and Corpus Filtering boosts the performance of Neural Machine Translation (NMT) systems.

Machine Translation NMT +1

Detecting Unintended Social Bias in Toxic Language Datasets

no code implementations21 Oct 2022 Nihar Sahoo, Himanshu Gupta, Pushpak Bhattacharyya

However, very little research has been done to detect unintended social bias from these toxic language datasets.

Knowledge Graph Construction and Its Application in Automatic Radiology Report Generation from Radiologist's Dictation

no code implementations13 Jun 2022 Kaveri Kale, Pushpak Bhattacharyya, Aditya Shetty, Milind Gune, Kush Shrivastava, Rustom Lawyer, Spriha Biswas

Then the transcriptionist prepares a preliminary formatted report referring to the notes, and finally, the radiologist reviews the report, corrects the errors, and signs off.

graph construction Semantic Similarity +1

Hollywood Identity Bias Dataset: A Context Oriented Bias Analysis of Movie Dialogues

no code implementations LREC 2022 Sandhya Singh, Prapti Roy, Nihar Sahoo, Niteesh Mallela, Himanshu Gupta, Pushpak Bhattacharyya, Milind Savagaonkar, Nidhi, Roshni Ramnani, Anutosh Maitra, Shubhashis Sengupta

Since AI solutions are data intensive and there exists no domain specific data to address the problem of biases in scripts, we introduce a new dataset of movie scripts that are annotated for identity bias.

Am I No Good? Towards Detecting Perceived Burdensomeness and Thwarted Belongingness from Suicide Notes

no code implementations20 May 2022 Soumitra Ghosh, Asif Ekbal, Pushpak Bhattacharyya

The World Health Organization (WHO) has emphasized the importance of significantly accelerating suicide prevention efforts to fulfill the United Nations' Sustainable Development Goal (SDG) objective of 2030.

HiNER: A Large Hindi Named Entity Recognition Dataset

1 code implementation LREC 2022 Rudra Murthy, Pallab Bhattacharjee, Rahul Sharnagat, Jyotsana Khatri, Diptesh Kanojia, Pushpak Bhattacharyya

We use different language models to perform the sequence labelling task for NER and show the efficacy of our data by performing a comparative evaluation with models trained on another dataset available for the Hindi NER task.

named-entity-recognition Named Entity Recognition +2

Indian Language Wordnets and their Linkages with Princeton WordNet

no code implementations LREC 2018 Diptesh Kanojia, Kevin Patel, Pushpak Bhattacharyya

Linked wordnets are extensions of wordnets, which link similar concepts in wordnets of different languages.

Strategies of Effective Digitization of Commentaries and Sub-commentaries: Towards the Construction of Textual History

no code implementations5 Jan 2022 Diptesh Kanojia, Malhar Kulkarni, Sayali Ghodekar, Eivind Kahrs, Pushpak Bhattacharyya

We use the text of the K\=a\'sik\=avrtti (KV) as a sample text, and with the help of philologists, we digitize the commentaries available to us.

Utilizing Wordnets for Cognate Detection among Indian Languages

no code implementations GWC 2019 Diptesh Kanojia, Kevin Patel, Pushpak Bhattacharyya, Malhar Kulkarni, Gholamreza Haffari

Automatic Cognate Detection (ACD) is a challenging task which has been utilized to help NLP applications like Machine Translation, Information Retrieval and Computational Phylogenetics.

Information Retrieval Machine Translation +1

Challenge Dataset of Cognates and False Friend Pairs from Indian Languages

1 code implementation LREC 2020 Diptesh Kanojia, Pushpak Bhattacharyya, Malhar Kulkarni, Gholamreza Haffari

In this paper, we describe the creation of two cognate datasets for twelve Indian languages, namely Sanskrit, Hindi, Assamese, Oriya, Kannada, Gujarati, Tamil, Telugu, Punjabi, Bengali, Marathi, and Malayalam.

Information Retrieval Machine Translation +2

Tapping BERT for Preposition Sense Disambiguation

no code implementations27 Nov 2021 Siddhesh Pawar, Shyam Thombre, Anirudh Mittal, Girishkumar Ponkiya, Pushpak Bhattacharyya

In this paper, we propose a novel methodology for preposition sense disambiguation (PSD), which does not use any linguistic tools.

Question Answering

"So You Think You're Funny?": Rating the Humour Quotient in Standup Comedy

1 code implementation25 Oct 2021 Anirudh Mittal, Pranav Jeevan, Prerak Gandhi, Diptesh Kanojia, Pushpak Bhattacharyya

We devise a novel scoring mechanism to annotate the training data with a humour quotient score using the audience's laughter.

COVIDRead: A Large-scale Question Answering Dataset on COVID-19

no code implementations5 Oct 2021 Tanik Saikh, Sovan Kumar Sahoo, Asif Ekbal, Pushpak Bhattacharyya

This dataset creates a new avenue of carrying out research on COVID-19 by providing a benchmark dataset and a baseline model.

Question Answering

M2H2: A Multimodal Multiparty Hindi Dataset For Humor Recognition in Conversations

1 code implementation3 Aug 2021 Dushyant Singh Chauhan, Gopendra Vikram Singh, Navonil Majumder, Amir Zadeh, Asif Ekbal, Pushpak Bhattacharyya, Louis-Philippe Morency, Soujanya Poria

We propose several strong multimodal baselines and show the importance of contextual and multimodal information for humor recognition in conversations.

Dialogue Understanding

Crosslingual Embeddings are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study

no code implementations MTSummit 2021 Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

In this paper, we show that initializing the embedding layer of UNMT models with cross-lingual embeddings shows significant improvements in BLEU score over existing approaches with embeddings randomly initialized.

Denoising Translation +1

Towards Sentiment and Emotion aided Multi-modal Speech Act Classification in Twitter

no code implementations NAACL 2021 Tulika Saha, Apoorva Upadhyaya, Sriparna Saha, Pushpak Bhattacharyya

Experimental results indicate that the proposed framework boosts the performance of the primary task, i. e., TA classification (TAC) by benefitting from the two secondary tasks, i. e., Sentiment and Emotion Analysis compared to its uni-modal and single task TAC (tweet act classification) variants.

Classification Emotion Recognition

Semantic Extractor-Paraphraser based Abstractive Summarization

no code implementations ICON 2020 Anubhav Jangra, Raghav Jain, Vaibhav Mavi, Sriparna Saha, Pushpak Bhattacharyya

The anthology of spoken languages today is inundated with textual information, necessitating the development of automatic summarization models.

Abstractive Text Summarization

Modelling Context Emotions using Multi-task Learning for Emotion Controlled Dialog Generation

no code implementations EACL 2021 Deeksha Varshney, Asif Ekbal, Pushpak Bhattacharyya

We employ multi-task learning to predict the emotion label and to generate a viable response for a given utterance using a common encoder with multiple decoders.

Multi-Task Learning Response Generation +1

Techniques for Jointly Extracting Entities and Relations: A Survey

no code implementations10 Mar 2021 Sachin Pawar, Pushpak Bhattacharyya, Girish K. Palshikar

Traditionally, relation extraction is carried out after entity extraction in a "pipeline" fashion, so that relation extraction only focuses on determining whether any semantic relation exists between a pair of extracted entity mentions.

Relation Relation Extraction

Knowledge-based Extraction of Cause-Effect Relations from Biomedical Text

no code implementations10 Mar 2021 Sachin Pawar, Ravina More, Girish K. Palshikar, Pushpak Bhattacharyya, Vasudeva Varma

We propose a knowledge-based approach for extraction of Cause-Effect (CE) relations from biomedical text.

Cognitively Aided Zero-Shot Automatic Essay Grading

no code implementations ICON 2020 Sandeep Mathias, Rudra Murthy, Diptesh Kanojia, Pushpak Bhattacharyya

Automatic essay grading (AEG) is a process in which machines assign a grade to an essay written in response to a topic, called the prompt.

EmotionGIF-IITP-AINLPML: Ensemble-based Automated Deep Neural System for predicting category(ies) of a GIF response

no code implementations23 Dec 2020 Soumitra Ghosh, Arkaprava Roy, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we describe the systems submitted by our IITP-AINLPML team in the shared task of SocialNLP 2020, EmotionGIF 2020, on predicting the category(ies) of a GIF response for a given unlabelled tweet.

A Retrofitting Model for Incorporating Semantic Relations into Word Embeddings

no code implementations COLING 2020 Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya

We present a novel retrofitting model that can leverage relational knowledge available in a knowledge resource to improve word embeddings.

Lexical Entailment Metric Learning +2

IITP-AINLPML at SemEval-2020 Task 12: Offensive Tweet Identification and Target Categorization in a Multitask Environment

no code implementations SEMEVAL 2020 Soumitra Ghosh, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we describe the participation of IITP-AINLPML team in the SemEval-2020 SharedTask 12 on Offensive Language Identification and Target Categorization in English Twitter data.

Language Identification

EL-BERT at SemEval-2020 Task 10: A Multi-Embedding Ensemble Based Approach for Emphasis Selection in Visual Media

no code implementations SEMEVAL 2020 Chandresh Kanani, Sriparna Saha, Pushpak Bhattacharyya

Emphasis selection is the task of choosing candidate words for emphasis, it helps in automatically designing posters and other media contents with written text.

valid

Assessing the Severity of Health States based on Social Media Posts

no code implementations21 Sep 2020 Shweta Yadav, Joy Prakash Sain, Amit Sheth, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya

The diverse NLU views demonstrate its effectiveness on both the tasks and as well as on the individual disease to assess a user's health.

Multiview Learning Natural Language Understanding

Can Neural Networks Automatically Score Essay Traits?

no code implementations WS 2020 S Mathias, eep, Pushpak Bhattacharyya

Essay traits are attributes of an essay that can help explain how well written (or badly written) the essay is.

BIG-bench Machine Learning Sentence

Sentiment and Emotion help Sarcasm? A Multi-task Learning Framework for Multi-Modal Sarcasm, Sentiment and Emotion Analysis

no code implementations ACL 2020 Dushyant Singh Chauhan, Dhanush S R, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we hypothesize that sarcasm is closely related to sentiment and emotion, and thereby propose a multi-task deep learning framework to solve all these three problems simultaneously in a multi-modal conversational scenario.

Emotion Recognition Multi-Task Learning +3

Extracting N-ary Cross-sentence Relations using Constrained Subsequence Kernel

no code implementations15 Jun 2020 Sachin Pawar, Pushpak Bhattacharyya, Girish K. Palshikar

Most of the past work in relation extraction deals with relations occurring within a sentence and having only two entity arguments.

Relation Relation Extraction +2

Happy Are Those Who Grade without Seeing: A Multi-Task Learning Approach to Grade Essays Using Gaze Behaviour

1 code implementation Asian Chapter of the Association for Computational Linguistics 2020 Sandeep Mathias, Rudra Murthy, Diptesh Kanojia, Abhijit Mishra, Pushpak Bhattacharyya

To demonstrate the efficacy of this multi-task learning based approach to automatic essay grading, we collect gaze behaviour for 48 essays across 4 essay sets, and learn gaze behaviour for the rest of the essays, numbering over 7000 essays.

Multi-Task Learning Named Entity Recognition (NER) +1

A Deep Learning Approach for Automatic Detection of Fake News

1 code implementation ICON 2019 Tanik Saikh, Arkadipta De, Asif Ekbal, Pushpak Bhattacharyya

We evaluate our techniques on the two recently released datasets, namely FakeNews AMT and Celebrity for fake news detection.

Fake News Detection Feature Engineering

A Platform for Event Extraction in Hindi

no code implementations LREC 2020 Sovan Kumar Sahoo, Saumajit Saha, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we present an Event Extraction framework for Hindi language by creating an annotated resource for benchmarking, and then developing deep learning based models to set as the baselines.

Benchmarking Classification +2

CEASE, a Corpus of Emotion Annotated Suicide notes in English

no code implementations LREC 2020 Soumitra Ghosh, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we create a fine-grained emotion annotated corpus (CEASE) of suicide notes in English and develop various deep learning models to perform emotion detection on the curated dataset.

Sentence

Incorporating Politeness across Languages in Customer Care Responses: Towards building a Multi-lingual Empathetic Dialogue Agent

no code implementations LREC 2020 Mauajama Firdaus, Asif Ekbal, Pushpak Bhattacharyya

Our system is competent in generating responses in different languages (here, English and Hindi) depending on the customer{'}s preference and also is able to converse with humans in an empathetic manner to ensure customer satisfaction and retention.

ScholarlyRead: A New Dataset for Scientific Article Reading Comprehension

no code implementations LREC 2020 Tanik Saikh, Asif Ekbal, Pushpak Bhattacharyya

We present ScholarlyRead, span-of-word-based scholarly articles{'} Reading Comprehension (RC) dataset with approximately 10K manually checked passage-question-answer instances.

Question Answering Reading Comprehension

Recommendation Chart of Domains for Cross-Domain Sentiment Analysis: Findings of A 20 Domain Study

no code implementations LREC 2020 Akash Sheoran, Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya

Cross-domain sentiment analysis (CDSA) helps to address the problem of data scarcity in scenarios where labelled data for a domain (known as the target domain) is unavailable or insufficient.

Sentence Sentiment Analysis +1

Multi-domain Tweet Corpora for Sentiment Analysis: Resource Creation and Evaluation

no code implementations LREC 2020 Mamta ., Asif Ekbal, Pushpak Bhattacharyya, Shikha Srivastava, Alka Kumar, Tista Saha

Due to the phenomenal growth of online content in recent time, sentiment analysis has attracted attention of the researchers and developers.

Sentiment Analysis

Recommendation Chart of Domains for Cross-Domain Sentiment Analysis:Findings of A 20 Domain Study

no code implementations9 Apr 2020 Akash Sheoran, Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya

Cross-domain sentiment analysis (CDSA) helps to address the problem of data scarcity in scenarios where labelled data for a domain (known as the target domain) is unavailable or insufficient.

Sentence Sentiment Analysis +1

Reinforced Multi-task Approach for Multi-hop Question Generation

no code implementations COLING 2020 Deepak Gupta, Hardik Chauhan, Akella Ravi Tej, Asif Ekbal, Pushpak Bhattacharyya

Question generation (QG) attempts to solve the inverse of question answering (QA) problem by generating a natural language question given a document and an answer.

Multi-hop Question Answering Question Answering +3

Utilizing Language Relatedness to improve Machine Translation: A Case Study on Languages of the Indian Subcontinent

no code implementations19 Mar 2020 Anoop Kunchukuttan, Pushpak Bhattacharyya

To the best of our knowledge, this is the first large-scale study specifically devoted to utilizing language relatedness to improve translation between related languages.

Machine Translation Translation

Related Tasks can Share! A Multi-task Framework for Affective language

no code implementations6 Feb 2020 Kumar Shikhar Deep, Md. Shad Akhtar, Asif Ekbal, Pushpak Bhattacharyya

Expressing the polarity of sentiment as 'positive' and 'negative' usually have limited scope compared with the intensity/degree of polarity.

Multi-Task Learning Sentiment Analysis +1

A Deep Neural Framework for Contextual Affect Detection

no code implementations28 Jan 2020 Kumar Shikhar Deep, Asif Ekbal, Pushpak Bhattacharyya

A short and simple text carrying no emotion can represent some strong emotions when reading along with its context, i. e., the same sentence can express extreme anger as well as happiness depending on its context.

Sentence Word Embeddings

Machine Translation Evaluation using Bi-directional Entailment

no code implementations2 Nov 2019 Rakesh Khobragade, Heaven Patel, Anand Namdev, Anish Mishra, Pushpak Bhattacharyya

We apply our evaluation metric on WMT'14 and WMT'17 dataset to evaluate systems participating in the translation task and find that our metric has a better correlation with the human annotated score compared to the other traditional metrics at system level.

Machine Translation Semantic Similarity +2

Scrambled Translation Problem: A Problem of Denoising UNMT

no code implementations MTSummit 2021 Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

We hypothesise that the reason behind \textit{scrambled translation problem} is 'shuffling noise' which is introduced in every input sentence as a denoising strategy.

Denoising Machine Translation +2

Tale of tails using rule augmented sequence labeling for event extraction

no code implementations19 Aug 2019 Ayush Maheshwari, Hrishikesh Patel, Nandan Rathod, Ritesh Kumar, Ganesh Ramakrishnan, Pushpak Bhattacharyya

The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data.

Event Extraction

Utilizing Monolingual Data in NMT for Similar Languages: Submission to Similar Language Translation Task

no code implementations WS 2019 Jyotsana Khatri, Pushpak Bhattacharyya

This paper describes our submission to Shared Task on Similar Language Translation in Fourth Conference on Machine Translation (WMT 2019).

Machine Translation NMT +1

Parallel Corpus Filtering Based on Fuzzy String Matching

no code implementations WS 2019 Sukanta Sen, Asif Ekbal, Pushpak Bhattacharyya

Based on the scores, we sub-sample two sets (having 1 million and 5 millions English tokens) of parallel sentences from each parallel corpus, and train SMT systems for development purpose only.

NMT Sentence

DeepSentiPeer: Harnessing Sentiment in Review Texts to Recommend Peer Review Decisions

1 code implementation ACL 2019 Tirthankar Ghosal, Rajeev Verma, Asif Ekbal, Pushpak Bhattacharyya

However, the peer review texts, which contains rich sentiment information of the reviewer, reflecting his/her overall attitude towards the research in the paper, could be a valuable entity to predict the acceptance or rejection of the manuscript under consideration.

Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders

no code implementations ACL 2019 Sukanta Sen, Kamal Kumar Gupta, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we propose a multilingual unsupervised NMT scheme which jointly trains multiple languages with a shared encoder and multiple decoders.

Denoising NMT +1

Extraction of Message Sequence Charts from Narrative History Text

no code implementations WS 2019 Girish Palshikar, Sachin Pawar, Sangameshwar Patil, Swapnil Hingmire, Nitin Ramrakhiyani, Harsimran Bedi, Pushpak Bhattacharyya, Vasudeva Varma

In this paper, we advocate the use of Message Sequence Chart (MSC) as a knowledge representation to capture and visualize multi-actor interactions and their temporal ordering.

Dependency Parsing

Extraction of Message Sequence Charts from Software Use-Case Descriptions

no code implementations NAACL 2019 Girish Palshikar, Nitin Ramrakhiyani, Sangameshwar Patil, Sachin Pawar, Swapnil Hingmire, Vasudeva Varma, Pushpak Bhattacharyya

We apply this tool to extract MSCs from several real-life software use-case descriptions and show that it performs better than the existing techniques.

Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator Network

no code implementations NAACL 2019 Hitesh Golchha, Mauajama Firdaus, Asif Ekbal, Pushpak Bhattacharyya

We use real interactions on Twitter between customer care professionals and aggrieved customers to create a large conversational dataset having both forms of agent responses: {`}generic{'} and {`}courteous{'}.

Relation extraction between the clinical entities based on the shortest dependency path based LSTM

no code implementations24 Mar 2019 Dhanachandra Ningthoujam, Shweta Yadav, Pushpak Bhattacharyya, Asif Ekbal

In this paper, we present an efficient relation extraction system based on the shortest dependency path (SDP) generated from the dependency parsed tree of the sentence.

Relation Relation Extraction +1

A Deep Ensemble Framework for Fake News Detection and Classification

no code implementations12 Nov 2018 Arjun Roy, Kingshuk Basak, Asif Ekbal, Pushpak Bhattacharyya

Fake news, rumor, incorrect information, and misinformation detection are nowadays crucial issues as these might have serious consequences for our social fabrics.

Classification Fake News Detection +2

Helping each Other: A Framework for Customer-to-Customer Suggestion Mining using a Semi-supervised Deep Neural Network

no code implementations1 Nov 2018 Hitesh Golchha, Deepak Gupta, Asif Ekbal, Pushpak Bhattacharyya

We evaluate the performance of our proposed model on a benchmark customer review dataset, comprising of the reviews of Hotel and Electronics domains.

Sentiment Analysis Suggestion mining

Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages

no code implementations NAACL 2019 Rudra Murthy V, Anoop Kunchukuttan, Pushpak Bhattacharyya

To bridge this divergence, We propose to pre-order the assisting language sentence to match the word order of the source language and train the parent model.

Machine Translation NMT +3

Is your Statement Purposeless? Predicting Computer Science Graduation Admission Acceptance based on Statement Of Purpose

no code implementations WS 2017 Diptesh Kanojia, Nikhil Wani, Pushpak Bhattacharyya

We present a quantitative, data-driven machine learning approach to mitigate the problem of unpredictability of Computer Science Graduate School Admissions.

Uncovering Code-Mixed Challenges: A Framework for Linguistically Driven Question Generation and Neural Based Question Answering

no code implementations CONLL 2018 Deepak Gupta, Pabitra Lenka, Asif Ekbal, Pushpak Bhattacharyya

In this paper, we propose a linguistically motivated technique for code-mixed question generation (CMQG) and a neural network based architecture for code-mixed question answering (CMQA).

Question Answering Question Generation +1

Combining Graph-based Dependency Features with Convolutional Neural Network for Answer Triggering

no code implementations5 Aug 2018 Deepak Gupta, Sarah Kohail, Pushpak Bhattacharyya

Answer triggering is the task of selecting the best-suited answer for a given question from a set of candidate answers if exists.

A Multi-task Ensemble Framework for Emotion, Sentiment and Intensity Prediction

no code implementations3 Aug 2018 Md. Shad Akhtar, Deepanway Ghosal, Asif Ekbal, Pushpak Bhattacharyya, Sadao Kurohashi

In this paper, through multi-task ensemble framework we address three problems of emotion and sentiment analysis i. e. "emotion classification & intensity", "valence, arousal & dominance for emotion" and "valence & arousal} for sentiment".

Emotion Classification General Classification +1

Leveraging Medical Sentiment to Understand Patients Health on Social Media

no code implementations30 Jul 2018 Shweta Yadav, Joy Sain, Amit Sheth, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya

A large percentage of this population is actively engaged in health social networks to share health-related information.

Feature Assisted bi-directional LSTM Model for Protein-Protein Interaction Identification from Biomedical Texts

no code implementations5 Jul 2018 Shweta Yadav, Ankit Kumar, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya

In this paper, we present a novel method based on deep bidirectional long short-term memory (B-LSTM) technique that exploits word sequences and dependency path related information to identify PPI information from text.

Identifying Transferable Information Across Domains for Cross-domain Sentiment Classification

no code implementations ACL 2018 Raksha Sharma, Pushpak Bhattacharyya, D, S apat, ipan, Himanshu Sharad Bhatt

In this paper, we propose that words that do not change their polarity and significance represent the transferable (usable) information across domains for cross-domain sentiment classification.

Classification Domain Adaptation +5

Multi-Task Learning Framework for Mining Crowd Intelligence towards Clinical Treatment

no code implementations NAACL 2018 Shweta Yadav, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya, Amit Sheth

In this paper, we adopt a novel adversarial learning approach for our multi-task learning framework to learn the sentiment{'}s strengths expressed in a medical blog.

General Classification Multi-Task Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.