Search Results for author: Ponnurangam Kumaraguru

Found 60 papers, 25 papers with code

HashSet - A Dataset For Hashtag Segmentation

1 code implementation LREC 2022 Prashant Kodali, Akshala Bhatnagar, Naman Ahuja, Manish Shrivastava, Ponnurangam Kumaraguru

HashSet dataset is sampled from a different set of tweets when compared to existing datasets and provides an alternate distribution of hashtags to build and validate hashtag segmentation models.

Segmentation Specificity

CoMeT: Towards Code-Mixed Translation Using Parallel Monolingual Sentences

1 code implementation NAACL (CALCS) 2021 Devansh Gautam, Prashant Kodali, Kshitij Gupta, Anmol Goel, Manish Shrivastava, Ponnurangam Kumaraguru

Code-mixed languages are very popular in multilingual societies around the world, yet the resources lag behind to enable robust systems on such languages.

Machine Translation Translation

Towards identifying Source credibility on Information Leakage in Digital Gadget Market

no code implementations7 Sep 2024 Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru

With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed.

NER

Multilingual Non-Factoid Question Answering with Silver Answers

no code implementations20 Aug 2024 Ritwik Mishra, Sreeram Vennam, Rajiv Ratn Shah, Ponnurangam Kumaraguru

The APS model attained an accuracy of 80\% and 72\%, as well as a macro F1 of 72\% and 66\%, on the MuNfQuAD testset and the golden set, respectively.

Question Answering

Counter Turing Test ($CT^2$): Investigating AI-Generated Text Detection for Hindi -- Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$)

no code implementations22 Jul 2024 Ishan Kavathekar, Anku Rani, Ashmit Chamoli, Ponnurangam Kumaraguru, Amit Sheth, Amitava Das

The widespread adoption of large language models (LLMs) and awareness around multilingual LLMs have raised concerns regarding the potential risks and repercussions linked to the misapplication of AI-generated text, necessitating increased vigilance.

Text Detection

Higher Order Structures For Graph Explanations

no code implementations5 Jun 2024 Akshit Sinha, Sreeram Vennam, Charu Sharma, Ponnurangam Kumaraguru

Graph Neural Networks (GNNs) have emerged as powerful tools for learning representations of graph-structured data, demonstrating remarkable performance across various tasks.

Graph Learning

Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

no code implementations28 May 2024 Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

However, after manual evaluation of the validation set, we find that by correcting gold label errors and improving dataset annotation schema, GPT-4 with our prompts can achieve (1) 89. 6%-96. 8% accuracy in DST, and (2) more than 99% correct response generation across different languages.

Dialogue State Tracking In-Context Learning +1

COBIAS: Contextual Reliability in Bias Assessment

1 code implementation22 Feb 2024 Priyanshul Govil, Hemang Jain, Vamshi Krishna Bonagiri, Aman Chadha, Ponnurangam Kumaraguru, Manas Gaur, Sanorita Dey

We develop the Context-Oriented Bias Indicator and Assessment Score (COBIAS) to measure a biased statement's reliability in detecting bias based on the variance in model behavior across different contexts.

SaGE: Evaluating Moral Consistency in Large Language Models

1 code implementation21 Feb 2024 Vamshi Krishna Bonagiri, Sreeram Vennam, Priyanshul Govil, Ponnurangam Kumaraguru, Manas Gaur

To this extent, we construct the Moral Consistency Corpus (MCC), containing 50K moral questions, responses to them by LLMs, and the RoTs that these models followed.

Decision Making HellaSwag +2

Multilingual Coreference Resolution in Low-resource South Asian Languages

1 code implementation21 Feb 2024 Ritwik Mishra, Pooja Desur, Rajiv Ratn Shah, Ponnurangam Kumaraguru

We introduce a Translated dataset for Multilingual Coreference Resolution (TransMuCoRes) in 31 South Asian languages using off-the-shelf tools for translation and word-alignment.

coreference-resolution Word Alignment

Corrective Machine Unlearning

1 code implementation21 Feb 2024 Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal

We hope our work spurs research towards developing better methods for corrective unlearning and offers practitioners a new strategy to handle data integrity challenges arising from web-scale training.

Machine Unlearning

InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?

no code implementations16 Feb 2024 Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru

Task performance and fairness scores of LLaMA and LLaMA--2 models indicate that the proposed $LSS_{\beta}$ metric can effectively determine the readiness of a model for safe usage in the legal sector.

Fairness

Representation Surgery: Theory and Practice of Affine Steering

1 code implementation15 Feb 2024 Shashwat Singh, Shauli Ravfogel, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru

In the case of neural language models, an encoding of the undesirable behavior is often present in the model's representations.

RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

1 code implementation13 Feb 2024 Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

However, little attention has been paid to the efficacy of continually learned representations, as representations are learned alongside classifiers throughout the learning process.

Continual Learning Representation Learning

Measuring Moral Inconsistencies in Large Language Models

no code implementations26 Jan 2024 Vamshi Krishna Bonagiri, Sreeram Vennam, Manas Gaur, Ponnurangam Kumaraguru

To address this issue, we propose a novel information-theoretic measure called Semantic Graph Entropy (SGE) to measure the consistency of an LLM in moral scenarios.

Decision Making Language Modelling +2

Towards Effective Paraphrasing for Information Disguise

1 code implementation8 Nov 2023 Anmol Agarwal, Shrey Gupta, Vamshi Bonagiri, Manas Gaur, Joseph Reagle, Ponnurangam Kumaraguru

Information Disguise (ID), a part of computational ethics in Natural Language Processing (NLP), is concerned with best practices of textual paraphrasing to prevent the non-consensual use of authors' posts on the Internet.

Ethics Sentence

Exploring Graph Neural Networks for Indian Legal Judgment Prediction

no code implementations19 Oct 2023 Mann Khatri, Mirza Yusuf, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru

We explored various embeddings as model features, while nodes such as time nodes and judicial acts were added and pruned to evaluate the model's performance.

Fairness Graph Neural Network +2

CAFIN: Centrality Aware Fairness inducing IN-processing for Unsupervised Representation Learning on Graphs

1 code implementation10 Apr 2023 Arvindh Arun, Aakash Aanegola, Amul Agrawal, Ramasuri Narayanam, Ponnurangam Kumaraguru

Unsupervised Representation Learning on graphs is gaining traction due to the increasing abundance of unlabelled network data and the compactness, richness, and usefulness of the representations generated.

Fairness Graph Learning +3

Are Models Trained on Indian Legal Data Fair?

no code implementations13 Mar 2023 Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran

We highlight the propagation of learnt algorithmic biases in the bail prediction task for models trained on Hindi legal documents.

Fairness

Zero-shot Entity and Tweet Characterization with Designed Conditional Prompts and Contexts

no code implementations18 Apr 2022 Sharath Srivatsa, Tushar Mohan, Kumari Neha, Nishchay Malakar, Ponnurangam Kumaraguru, Srinath Srinivasa

In this work, we approach the problem of characterizing Named Entities and Tweets as an open-ended text classification and open-ended fact probing problem. We evaluate the zero-shot language model capabilities of Generative Pretrained Transformer 2 (GPT-2) to characterize Entities and Tweets subjectively with human psychology-inspired and logical conditional prefixes and contexts.

Language Modelling text-classification +1

GAME-ON: Graph Attention Network based Multimodal Fusion for Fake News Detection

1 code implementation25 Feb 2022 Mudit Dhawan, Shakshi Sharma, Aditya Kadam, Rajesh Sharma, Ponnurangam Kumaraguru

A plethora of previous multimodal-based work has tried to address the problem of modeling heterogeneous modalities in identifying fake content.

Fake News Detection Graph Attention +1

HashSet -- A Dataset For Hashtag Segmentation

2 code implementations18 Jan 2022 Prashant Kodali, Akshala Bhatnagar, Naman Ahuja, Manish Shrivastava, Ponnurangam Kumaraguru

HashSet dataset is sampled from a different set of tweets when compared to existing datasets and provides an alternate distribution of hashtags to build and validate hashtag segmentation models.

Segmentation Specificity

Towards Adversarial Evaluations for Inexact Machine Unlearning

3 code implementations17 Jan 2022 Shashwat Goel, Ameya Prabhu, Amartya Sanyal, Ser-Nam Lim, Philip Torr, Ponnurangam Kumaraguru

Machine Learning models face increased concerns regarding the storage of personal user data and adverse impacts of corrupted data like backdoors or systematic bias.

Machine Unlearning Memorization

Diagnosing Data from ICTs to Provide Focused Assistance in Agricultural Adoptions

no code implementations29 Oct 2021 Ashwin Singh, Mallika Subramanian, Anmol Agarwal, Pratyush Priyadarshi, Shrey Gupta, Kiran Garimella, Sanjeev Kumar, Ritesh Kumar, Lokesh Garg, Erica Arya, Ponnurangam Kumaraguru

Our classifier achieves accuracies ranging from 79% to 90% across the five states, demonstrating its potential for assisting future ethnographic investigations.

Specificity

Statistical Learning to Operationalize a Domain Agnostic Data Quality Scoring

no code implementations16 Aug 2021 Sezal Chug, Priya Kaushal, Ponnurangam Kumaraguru, Tavpritesh Sethi

The current empirical study was undertaken to formulate a concrete automated data quality platform to assess the quality of incoming dataset and generate a quality label, score and comprehensive report.

Decision Making

Factorization of Fact-Checks for Low Resource Indian Languages

no code implementations23 Feb 2021 Shivangi Singhal, Rajiv Ratn Shah, Ponnurangam Kumaraguru

The majority of studies on automatic fact-checking and fake news detection is restricted to English only.

Fact Checking Fake News Detection

(Un)Masked COVID-19 Trends from Social Media

no code implementations30 Oct 2020 Asmit Kumar Singh, Paras Mehan, Divyanshu Sharma, Rohan Pandey, Tavpritesh Sethi, Ponnurangam Kumaraguru

Wearing masks is a useful protection method against COVID-19, which has caused widespread economic and social impact worldwide.

Segmentation Semantic Segmentation

AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts

1 code implementation COLING 2020 Mohit Chandra, Ashwin Pathak, Eesha Dutta, Paryul Jain, Manish Gupta, Manish Shrivastava, Ponnurangam Kumaraguru

While extensive popularity of online social media platforms has made information dissemination faster, it has also resulted in widespread online abuse of different types like hate speech, offensive language, sexist and racist opinions, etc.

Abuse Detection severity prediction

imdpGAN: Generating Private and Specific Data with Generative Adversarial Networks

no code implementations29 Sep 2020 Saurabh Gupta, Arun Balaji Buduru, Ponnurangam Kumaraguru

With experiments on MNIST dataset, we show that imdpGAN preserves the privacy of the individual data point, and learns latent codes to control the specificity of the generated samples.

Binary Classification Generative Adversarial Network +1

Multi-objective Reinforcement Learning based approach for User-Centric Power Optimization in Smart Home Environments

no code implementations29 Sep 2020 Saurabh Gupta, Siddhant Bhambri, Karan Dhingra, Arun Balaji Buduru, Ponnurangam Kumaraguru

We experiment on real-world smart home data, and show that the multi-objective approaches: i) establish trade-off between the two objectives, ii) achieve better combined user satisfaction and power consumption than single-objective approaches.

Management Multi-Objective Reinforcement Learning

VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning

1 code implementation14 Sep 2020 Raghav Awasthi, Keerat Kaur Guliani, Saif Ahmad Khan, Aniket Vashishtha, Mehrab Singh Gill, Arshita Bhatt, Aditya Nagori, Aniket Gupta, Ponnurangam Kumaraguru, Tavpritesh Sethi

We approach this problem by proposing a novel pipeline VacSIM that dovetails Deep Reinforcement Learning models into a Contextual Bandits approach for optimizing the distribution of COVID-19 vaccine.

Multi-Armed Bandits OpenAI Gym +2

Psychometric Analysis and Coupling of Emotions Between State Bulletins and Twitter in India during COVID-19 Infodemic

no code implementations12 May 2020 Baani Leen Kaur Jolly, Palash Aggrawal, Amogh Gulati, Amarjit Singh Sethi, Ponnurangam Kumaraguru, Tavpritesh Sethi

In this study, we analyze the psychometric impact and coupling of the COVID-19 infodemic with the official bulletins related to COVID-19 at the national and state level in India.

Misinformation Time Series Analysis

Is change the only constant? Profile change perspective on #LokSabhaElections2019

no code implementations22 Sep 2019 Kumari Neha, Shashank Srikanth, Sonali Singhal, Shwetanshu Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru

Users on Twitter are identified with the help of their profile attributes that consists of username, display name, profile image, to name a few.

Attribute

Hashtags are (not) judgemental: The untold story of Lok Sabha elections 2019

no code implementations16 Sep 2019 Saurabh Gupta, Asmit Kumar Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru

In the political context, hashtags on Twitter are used by users to campaign for their parties, spread news, or to get followers and get a general idea by following a discussion built around a hashtag.

Semantic Similarity Semantic Textual Similarity

What sets Verified Users apart? Insights, Analysis and Prediction of Verified Users on Twitter

no code implementations12 Mar 2019 Indraneil Paul, Abhinav Khattar, Shaan Chopra, Ponnurangam Kumaraguru, Manish Gupta

The aim of the paper is two-fold: First, we test if discerning the verification status of a handle from profile metadata and content features is feasible.

Attentional Road Safety Networks

no code implementations12 Dec 2018 Sonu Gupta, Deepak Srivatsav, A. V. Subramanyam, Ponnurangam Kumaraguru

Road safety mapping using satellite images is a cost-effective but a challenging problem for smart city planning.

Domain Adaptation

Neural Machine Translation for English-Tamil

1 code implementation WS 2018 Himanshu Choudhary, Aditya Kumar Pathak, Rajiv Ratan Saha, Ponnurangam Kumaraguru

We propose a novel neural machine translation technique using word-embedding along with Byte-Pair-Encoding (BPE) to develop an efficient translation system that overcomes the OOV (Out Of Vocabulary) problem for languages which do not have much translations available online.

Machine Translation NMT +1

Language Identification and Named Entity Recognition in Hinglish Code Mixed Tweets

no code implementations ACL 2018 Kushagra Singh, Indira Sen, Ponnurangam Kumaraguru

While growing code-mixed content on Online Social Networks(OSN) provides a fertile ground for studying various aspects of code-mixing, the lack of automated text analysis tools render such studies challenging.

Abuse Detection Chunking +6

A Twitter Corpus for Hindi-English Code Mixed POS Tagging

no code implementations WS 2018 Kushagra Singh, Indira Sen, Ponnurangam Kumaraguru

Code-mixing is a linguistic phenomenon where multiple languages are used in the same occurrence that is increasingly common in multilingual societies.

POS POS Tagging

Hardening Deep Neural Networks via Adversarial Model Cascades

1 code implementation2 Feb 2018 Deepak Vijaykeerthy, Anshuman Suri, Sameep Mehta, Ponnurangam Kumaraguru

Deep neural networks (DNNs) are vulnerable to malicious inputs crafted by an adversary to produce erroneous outputs.

PicHunt: Social Media Image Retrieval for Improved Law Enforcement

no code implementations2 Aug 2016 Sonal Goel, Niharika Sachdeva, Ponnurangam Kumaraguru, A. V. Subramanyam, Divam Gupta

This results in the need for first responders to inspect the spread of such images and users propagating them on social media.

Image Retrieval Retrieval

Analyzing Social and Stylometric Features to Identify Spear phishing Emails

no code implementations14 Jun 2014 Prateek Dewan, Anand Kashyap, Ponnurangam Kumaraguru

To the best of our knowledge, this is one of the first attempts to make use of a combination of stylometric features extracted from emails, and social features extracted from an online social network to detect targeted spear phishing emails.

Cannot find the paper you are looking for? You can Submit a new open access paper.