Search Results for author: Amitava Das

Found 51 papers, 14 papers with code

Image2tweet: Datasets in Hindi and English for Generating Tweets from Images

1 code implementation ICON 2021 Rishabh Jha, Varshith Kaki, Varuna Kolla, Shubham Bhagat, Parth Patwa, Amitava Das, Santanu Pal

The aim is to generate a specialized text like a tweet, that is not a direct result of visual-linguistic grounding that is usually leveraged in similar tasks, but conveys a message that factors-in not only the visual content of the image, but also additional real world contextual information associated with the event described within the image as closely as possible.

Image Captioning World Knowledge

The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

no code implementations15 Jan 2024 Saurav Pawar, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Aman Chadha, Amitava Das

The advent of Large Language Models (LLMs) represents a notable breakthrough in Natural Language Processing (NLP), contributing to substantial progress in both text comprehension and generation.

Reading Comprehension

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

1 code implementation2 Jan 2024 S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Anku Rani, Vipula Rawte, Aman Chadha, Amitava Das

As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded.

Hallucination Retrieval +1

SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

no code implementations1 Dec 2023 Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence.

Deception Detection Multi-Task Learning

On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models

1 code implementation11 Oct 2023 Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

Through our analysis, we find that LLMs' ability to identify sentence analogies is positively correlated with their ability to encode syntactic and semantic structures of sentences.

Language Modelling Sentence

The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

no code implementations8 Oct 2023 Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S. M Towhidul Islam Tonmoy, Aman Chadha, Amit P. Sheth, Amitava Das

Finally, to establish a method for quantifying and to offer a comparative spectrum that allows us to evaluate and rank LLMs based on their vulnerability to producing hallucinations, we propose Hallucination Vulnerability Index (HVI).

Hallucination

Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

no code implementations8 Oct 2023 Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

Given this cynosural spotlight on generative AI, AI-generated text detection (AGTD) has emerged as a topic that has already received immediate attention in research, with some initial methods having been proposed, soon followed by emergence of techniques to bypass detection.

Text Detection

A Survey of Hallucination in Large Foundation Models

1 code implementation12 Sep 2023 Vipula Rawte, Amit Sheth, Amitava Das

Hallucination in a foundation model (FM) refers to the generation of content that strays from factual reality or includes fabricated information.

Hallucination

RESTORE: Graph Embedding Assessment Through Reconstruction

no code implementations28 Aug 2023 Hong Yung Yip, Chidaksh Ravuru, Neelabha Banerjee, Shashwat Jha, Amit Sheth, Aman Chadha, Amitava Das

We analyze their effectiveness in preserving the (a) topological structure of node-level graph reconstruction with an increasing number of hops and (b) semantic information on various word semantic and analogy tests.

Graph Embedding Graph Reconstruction +1

Why Do We Need Neuro-symbolic AI to Model Pragmatic Analogies?

no code implementations2 Aug 2023 Thilini Wijesiriwardene, Amit Sheth, Valerie L. Shalin, Amitava Das

A hallmark of intelligence is the ability to use a familiar domain to make inferences about a less familiar domain, known as analogical reasoning.

FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering

no code implementations22 May 2023 Megha Chakraborty, Khushbu Pahwa, Anku Rani, Shreyas Chatterjee, Dwip Dalal, Harshit Dave, Ritvik G, Preethi Gurumurthy, Adarsh Mahor, Samahriti Mukherjee, Aditya Pakala, Ishan Paul, Janvita Reddy, Arghya Sarkar, Kinjal Sensharma, Aman Chadha, Amit P. Sheth, Amitava Das

To address this gap, we introduce FACTIFY 3M, a dataset of 3 million samples that pushes the boundaries of the domain of fact verification via a multimodal fake news dataset, in addition to offering explainability through the concept of 5W question-answering.

Fact Verification Question Answering

ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models

no code implementations8 May 2023 Thilini Wijesiriwardene, Ruwan Wickramarachchi, Bimal G. Gajera, Shreeyash Mukul Gowaikar, Chandan Gupta, Aman Chadha, Aishwarya Naresh Reganti, Amit Sheth, Amitava Das

Over the past decade, analogies, in the form of word-level analogies, have played a significant role as an intrinsic measure of evaluating the quality of word embedding methods such as word2vec.

Negation Sentence

FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering

no code implementations7 May 2023 Anku Rani, S. M Towhidul Islam Tonmoy, Dwip Dalal, Shreya Gautam, Megha Chakraborty, Aman Chadha, Amit Sheth, Amitava Das

Finally, we report a baseline QA system to automatically locate those answers from evidence documents, which can serve as a baseline for future research in the field.

Fact Checking Fact Verification +3

Factify 2: A Multimodal Fake News and Satire News Dataset

1 code implementation8 Apr 2023 S Suryavardan, Shreyash Mishra, Parth Patwa, Megha Chakraborty, Anku Rani, Aishwarya Reganti, Aman Chadha, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

In this paper, we provide a multi-modal fact-checking dataset called FACTIFY 2, improving Factify 1 by using new data sources and adding satire articles.

Claim Verification Fact Checking +1

Memotion Analysis through the Lens of Joint Embedding

1 code implementation13 Nov 2021 Nethra Gunti, Sathyanarayanan Ramamoorthy, Parth Patwa, Amitava Das

Joint embedding (JE) is a way to encode multi-modal data into a vector space where text remains as the grounding key and other modalities like image are to be anchored with such keys.

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

1 code implementation12 Nov 2021 Mohsin Ali, Kandukuri Sai Teja, Sumanth Manduru, Parth Patwa, Amitava Das

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc.

Position Word Embeddings

Hater-O-Genius Aggression Classification using Capsule Networks

1 code implementation ICON 2020 Parth Patwa, Srinivas PYKL, Amitava Das, Prerana Mukherjee, Viswanath Pulabaigari

In this paper, we propose an end-to-end ensemble-based architecture to automatically identify and classify aggressive tweets.

Classification

Hostility Detection Dataset in Hindi

1 code implementation6 Nov 2020 Mohit Bhardwaj, Md Shad Akhtar, Asif Ekbal, Amitava Das, Tanmoy Chakraborty

In this paper, we present a novel hostility detection dataset in Hindi language.

Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech Diffusion on Twitter

1 code implementation9 Oct 2020 Sarah Masud, Subhabrata Dutta, Sakshi Makkar, Chhavi Jain, Vikram Goyal, Amitava Das, Tanmoy Chakraborty

Meanwhile, to predict the retweet dynamics on Twitter, we propose RETINA, a novel neural architecture that incorporates exogenous influence using scaled dot-product attention.

World Knowledge

SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!

1 code implementation9 Aug 2020 Chhavi Sharma, Deepesh Bhageria, William Scott, Srinivas PYKL, Amitava Das, Tanmoy Chakraborty, Viswanath Pulabaigari, Bjorn Gamback

The objective of this proposal is to bring the attention of the research community towards the automatic processing of Internet memes.

Emotion Recognition

Minority Positive Sampling for Switching Points - an Anecdote for the Code-Mixing Language Modeling

no code implementations LREC 2020 Arindam Chatterjere, Vineeth Guptha, Parul Chopra, Amitava Das

To better understand the problem of LM for CM, we initially experimented with several statistical language modeling techniques and consequently experimented with contemporary neural language models.

Language Modelling

Aggression and Misogyny Detection using BERT: A Multi-Task Approach

1 code implementation LREC 2020 Niloofar Safi Samghabadi, Parth Patwa, Srinivas PYKL, Prerana Mukherjee, Amitava Das, Thamar Solorio

In recent times, the focus of the NLP community has increased towards offensive language, aggression, and hate-speech detection. This paper presents our system for TRAC-2 shared task on {``}Aggression Identification{''} (sub-task A) and {``}Misogynistic Aggression Identification{''} (sub-task B).

Abusive Language Aggression Identification +3

NIT\_Agartala\_NLP\_Team at SemEval-2019 Task 6: An Ensemble Approach to Identifying and Categorizing Offensive Language in Twitter Social Media Corpora

no code implementations SEMEVAL 2019 Steve Durairaj Swamy, Anupam Jamatia, Bj{\"o}rn Gamb{\"a}ck, Amitava Das

The paper describes the systems submitted to OffensEval (SemEval 2019, Task 6) on {`}Identifying and Categorizing Offensive Language in Social Media{'} by the {`}NIT{\_}Agartala{\_}NLP{\_}Team{'}.

Sentiment Analysis of Code-Mixed Indian Languages: An Overview of SAIL_Code-Mixed Shared Task @ICON-2017

no code implementations18 Mar 2018 Braja Gopal Patra, Dipankar Das, Amitava Das

This paper presents overview of the shared task on sentiment analysis of code-mixed data pairs of Hindi-English and Bengali-English collected from the different social media platform.

Sentiment Analysis Stance Detection

A Societal Sentiment Analysis: Predicting the Values and Ethics of Individuals by Analysing Social Media Content

no code implementations EACL 2017 Tushar Maheshwari, Aishwarya N. Reganti, Samiksha Gupta, Anupam Jamatia, Upendra Kumar, Bj{\"o}rn Gamb{\"a}ck, Amitava Das

Several experiments were carried out on the corpora to classify the ethical values of users, incorporating Linguistic Inquiry Word Count analysis, n-grams, topic models, psycholinguistic lexica, speech-acts, and non-linguistic information, while applying a range of machine learners (Support Vector Machines, Logistic Regression, and Random Forests) to identify the best linguistic and non-linguistic features for automatic classification of values and ethics.

Ethics General Classification +3

Comparing the Level of Code-Switching in Corpora

no code implementations LREC 2016 Bj{\"o}rn Gamb{\"a}ck, Amitava Das

Social media texts are often fairly informal and conversational, and when produced by bilinguals tend to be written in several different languages simultaneously, in the same way as conversational speech.

Cannot find the paper you are looking for? You can Submit a new open access paper.