Search Results for author: Animesh Mukherjee

Found 43 papers, 11 papers with code

A Data Bootstrapping Recipe for Low-Resource Multilingual Relation Classification

no code implementations CoNLL (EMNLP) 2021 Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

Data collection is challenging for Indian languages, because they are syntactically and morphologically diverse, as well as different from resource-rich languages like English.

Classification Fine-tuning +1

Two-Face: Adversarial Audit of Commercial Face Recognition Systems

no code implementations17 Nov 2021 Siddharth D Jaiswal, Karthikeya Duggirala, Abhisek Dash, Animesh Mukherjee

Computer vision applications like automated face detection are used for a variety of purposes ranging from unlocking smart devices to tracking potential persons of interest for surveillance.

Face Detection Face Recognition

Quality change: norm or exception? Measurement, Analysis and Detection of Quality Change in Wikipedia

1 code implementation2 Nov 2021 Paramita Das, Bhanu Prakash Reddy Guda, Sasi Bhusan Seelaboyina, Soumya Sarkar, Animesh Mukherjee

To the best of our knowledge, this is the first work that rigorously explores English Wikipedia article quality life cycle from the perspective of quality indicators and provides a novel unsupervised page level approach to detect quality switch, which can help in automatic content monitoring in Wikipedia thus contributing significantly to the CSCW community.

Change Point Detection Time Series

A Data Bootstrapping Recipe for Low Resource Multilingual Relation Classification

no code implementations18 Oct 2021 Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

Relation classification (sometimes called 'extraction') requires trustworthy datasets for fine-tuning large language models, as well as for evaluation.

Classification Fine-tuning +1

When expertise gone missing: Uncovering the loss of prolific contributors in Wikipedia

no code implementations21 Sep 2021 Paramita Das, Bhanu Prakash Reddy Guda, Debajit Chakraborty, Soumya Sarkar, Animesh Mukherjee

Success of planetary-scale online collaborative platforms such as Wikipedia is hinged on active and continued participation of its voluntary contributors.

Information Retrieval

Debiasing Multilingual Word Embeddings: A Case Study of Three Indian Languages

1 code implementation21 Jul 2021 Srijan Bansal, Vishal Garimella, Ayush Suhane, Animesh Mukherjee

In this paper, we advance the current state-of-the-art method for debiasing monolingual word embeddings so as to generalize well in a multilingual setting.

Multilingual Word Embeddings

"Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups

2 code implementations7 Feb 2021 Punyajoy Saha, Binny Mathew, Kiran Garimella, Animesh Mukherjee

We observe that users writing fear speech messages use various events and symbols to create the illusion of fear among the reader about a target community.

When the Umpire is also a Player: Bias in Private Label Product Recommendations on E-commerce Marketplaces

no code implementations30 Jan 2021 Abhisek Dash, Abhijnan Chakraborty, Saptarshi Ghosh, Animesh Mukherjee, Krishna P. Gummadi

Along a number of our proposed bias measures, we find that the sponsored recommendations are significantly more biased toward Amazon private label products compared to organic recommendations.

Fairness

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

4 code implementations18 Dec 2020 Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

We also observe that models, which utilize the human rationales for training, perform better in reducing unintended bias towards target communities.

Hate Speech Detection Text Classification

Gandhipedia: A one-stop AI-enabled portal for browsing Gandhian literature, life-events and his social network

no code implementations5 Jun 2020 Sayantan Adak, Atharva Vyas, Animesh Mukherjee, Heer Ambavi, Pritam Kadasi, Mayank Singh, Shivam Patel

We introduce an AI-enabled portal that presents an excellent visualization of Mahatma Gandhi's life events by constructing temporal and spatial social networks from the Gandhian literature.

Aspect-based Sentiment Analysis of Scientific Reviews

1 code implementation5 Jun 2020 Souvic Chakraborty, Pawan Goyal, Animesh Mukherjee

We also investigate the extent of disagreement between the reviewers and the chair and find that the inter-reviewer disagreement may have a link to the disagreement with the chair.

Active Learning Aspect-Based Sentiment Analysis

HateMonitors: Language Agnostic Abuse Detection in Social Media

1 code implementation27 Sep 2019 Punyajoy Saha, Binny Mathew, Pawan Goyal, Animesh Mukherjee

In this paper, we present our machine learning model, HateMonitor, developed for Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), a shared task at FIRE 2019.

Abuse Detection Abusive Language

Competing Topic Naming Conventions in Quora: Predicting Appropriate Topic Merges and Winning Topics from Millions of Topic Pairs

no code implementations10 Sep 2019 Binny Mathew, Suman Kalyan Maity, Pawan Goyal, Animesh Mukherjee

Our system is also able to predict ~ 25% of the correct case of merges within the first month of the merge and ~ 40% of the cases within a year.

Anomaly Detection

On the Compositionality Prediction of Noun Phrases using Poincar\'e Embeddings

no code implementations ACL 2019 Abhik Jana, Dima Puzyrev, Alex Panchenko, er, Pawan Goyal, Chris Biemann, Animesh Mukherjee

In particular, we use hypernymy information of the multiword and its constituents encoded in the form of the recently introduced Poincar{\'e} embeddings in addition to the distributional information to detect compositionality for noun phrases.

StRE: Self Attentive Edit Quality Prediction in Wikipedia

1 code implementation ACL 2019 Soumya Sarkar, Bhanu Prakash Reddy, Sandipan Sikdar, Animesh Mukherjee

Wikipedia can easily be justified as a behemoth, considering the sheer volume of content that is added or removed every minute to its several projects.

On the Compositionality Prediction of Noun Phrases using Poincaré Embeddings

no code implementations7 Jun 2019 Abhik Jana, Dmitry Puzyrev, Alexander Panchenko, Pawan Goyal, Chris Biemann, Animesh Mukherjee

In particular, we use hypernymy information of the multiword and its constituents encoded in the form of the recently introduced Poincar\'e embeddings in addition to the distributional information to detect compositionality for noun phrases.

KGPChamps at SemEval-2019 Task 3: A deep learning approach to detect emotions in the dialog utterances.

no code implementations SEMEVAL 2019 Jasabanta Patro, Nitin Choudhary, Kalpit Chittora, Animesh Mukherjee

We report the bidirectional LSTM model, along with the input word embedding as the concatenation of word embedding generated from bidirectional LSTM for word characters and conceptnet embedding, as the best performing model with a highest micro-F1 score of 0. 7261.

DeepTagRec: A Content-cum-User based Tag Recommendation Framework for Stack Overflow

no code implementations10 Mar 2019 Suman Kalyan Maity, Abhishek Panigrahi, Sayan Ghosh, Arundhati Banerjee, Pawan Goyal, Animesh Mukherjee

In this paper, we develop a content-cum-user based deep learning framework DeepTagRec to recommend appropriate question tags on Stack Overflow.

A Network-centric Framework for Auditing Recommendation Systems

no code implementations7 Feb 2019 Abhisek Dash, Animesh Mukherjee, Saptarshi Ghosh

In this work, we propose a novel network-centric framework which is not only able to quantify various static properties of RSs, but also is able to quantify dynamic properties such as how likely RSs are to lead to polarization or segregation of information among their users.

Recommendation Systems

Detecting Reliable Novel Word Senses: A Network-Centric Approach

no code implementations14 Dec 2018 Abhik Jana, Animesh Mukherjee, Pawan Goyal

The outlined method can therefore be used as a new post-hoc step to improve the precision of novel word sense detection in a robust and reliable way where the underlying framework uses a graph structure.

Analyzing the hate and counter speech accounts on Twitter

no code implementations6 Dec 2018 Binny Mathew, Navish Kumar, Ravina, Pawan Goyal, Animesh Mukherjee

We also build a supervised model for classifying the hateful and counterspeech accounts on Twitter and obtain an F-score of 0. 77.

Social and Information Networks

Spread of hate speech in online social media

no code implementations4 Dec 2018 Binny Mathew, Ritam Dutt, Pawan Goyal, Animesh Mukherjee

The present online social media platform is afflicted with several issues, with hate speech being on the predominant forefront.

Social and Information Networks

Deep Dive into Anonymity: A Large Scale Analysis of Quora Questions

no code implementations17 Nov 2018 Binny Mathew, Ritam Dutt, Suman Kalyan Maity, Pawan Goyal, Animesh Mukherjee

In particular, we observe that the choice to post the question as anonymous is dependent on the user's perception of anonymity and they often choose to speak about depression, anxiety, social ties and personal issues under the guise of anonymity.

Deep Learning for Social Media Health Text Classification

no code implementations WS 2018 Santosh Tokala, Vaibhav Gambhir, Animesh Mukherjee

This paper describes the systems developed for 1st and 2nd tasks of the 3rd Social Media Mining for Health Applications Shared Task at EMNLP 2018.

Classification General Classification +3

WikiRef: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages

no code implementations COLING 2018 Abhik Jana, Pranjal Kanojiya, Pawan Goyal, Animesh Mukherjee

In this paper, we propose a novel two step approach -- WikiRef -- that (i) leverages the wikilinks present in a scientific Wikipedia target page and, thereby, (ii) recommends highly relevant references to be included in that target page appropriately and automatically borrowed from the reference section of the wikilinks.

AppTechMiner: Mining Applications and Techniques from Scientific Articles

no code implementations10 Sep 2017 Mayank Singh, Soham Dan, Sanyam Agarwal, Pawan Goyal, Animesh Mukherjee

We also categorize individual research articles based on their application areas and the techniques proposed/improved in the article.

Information Retrieval

Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media

no code implementations15 Mar 2017 Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee

We first propose context based clustering method to sample a set of candidate words from the social media data. Next, we propose three novel and similar metrics based on the usage of these words by the users in different tweets; these metrics were used to score and rank the candidate words indicating their borrowed likeliness.

Language Use Matters: Analysis of the Linguistic Structure of Question Texts Can Characterize Answerability in Quora

no code implementations11 Mar 2017 Suman Kalyan Maity, Aman Kharb, Animesh Mukherjee

Notably, features representing the language use patterns of the users are most discriminative and alone account for an accuracy of 74. 18%.

Which techniques does your application use?: An information extraction framework for scientific articles

no code implementations23 Aug 2016 Soham Dan, Sanyam Agarwal, Mayank Singh, Pawan Goyal, Animesh Mukherjee

Every field of research consists of multiple application areas with various techniques routinely used to solve problems in these wide range of application areas.

Language Modelling

WASSUP? LOL : Characterizing Out-of-Vocabulary Words in Twitter

no code implementations31 Jan 2016 Suman Kalyan Maity, Chaitanya Sarda, Anshit Chaudhary, Abhijeet Patil, Shraman Kumar, Akash Mondal, Animesh Mukherjee

Language in social media is mostly driven by new words and spellings that are constantly entering the lexicon thereby polluting it and resulting in high deviation from the formal written version.

General Classification

That's sick dude!: Automatic identification of word sense change across different timescales

no code implementations ACL 2014 Sunny Mitra, Ritwik Mitra, Martin Riedl, Chris Biemann, Animesh Mukherjee, Pawan Goyal

In this paper, we propose an unsupervised method to identify noun sense changes based on rigorous analysis of time-varying text data available in the form of millions of digitized books.

Word Sense Disambiguation

Cannot find the paper you are looking for? You can Submit a new open access paper.