Search Results for author: Avinash Madasu

Found 19 papers, 7 papers with code

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples

1 code implementation30 Nov 2023 Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Anahita Bhiwandiwalla, Vasudev Lal

Our approach utilizes Stable Diffusion with cross attention control to produce sets of counterfactual image-text pairs that are highly similar in their depiction of a subject (e. g., a given occupation) while differing only in their depiction of intersectional social attributes (e. g., race & gender).

counterfactual

Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks

1 code implementation7 Oct 2023 Avinash Madasu, Anahita Bhiwandiwalla, Vasudev Lal

We investigate 9 foundational image-text models on a diverse set of video tasks that include video action recognition (video AR), video retrieval (video RT), video question answering (video QA), video multiple choice (video MC) and video captioning (video CP).

Action Recognition Multiple-choice +6

Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples

no code implementations4 Oct 2023 Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Vasudev Lal

While vision-language models (VLMs) have achieved remarkable performance improvements recently, there is growing evidence that these models also posses harmful biases with respect to social attributes such as gender and race.

counterfactual

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

no code implementations30 Aug 2023 Kilichbek Haydarov, Xiaoqian Shen, Avinash Madasu, Mahmoud Salem, Li-Jia Li, Gamaleldin Elsayed, Mohamed Elhoseiny

We introduce Affective Visual Dialog, an emotion explanation and reasoning task as a testbed for research on understanding the formation of emotions in visually grounded conversations.

Explanation Generation Question Answering +1

ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models

1 code implementation28 Jun 2023 Avinash Madasu, Vasudev Lal

The study is performed on two categories of video retrieval models: (i) which are pre-trained on video-text pairs and fine-tuned on downstream video retrieval datasets (Eg.

Retrieval Video Retrieval +1

Is Multimodal Vision Supervision Beneficial to Language?

1 code implementation10 Feb 2023 Avinash Madasu, Vasudev Lal

We compare the performance of language representations of stand-alone text encoders of these models to the language representations of text encoders learnt through vision supervision.

Image Retrieval Natural Language Understanding +4

What do Large Language Models Learn beyond Language?

1 code implementation21 Oct 2022 Avinash Madasu, Shashank Srivastava

Large language models (LMs) have rapidly become a mainstay in Natural Language Processing.

MuMUR : Multilingual Multimodal Universal Retrieval

no code implementations24 Aug 2022 Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shachar Rosenman, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal

In this paper, we propose a framework MuMUR, that utilizes knowledge transfer from a multilingual model to boost the performance of multi-modal (image and video) retrieval.

Image Retrieval Machine Translation +3

A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework

no code implementations21 Aug 2022 Avinash Madasu, Anvesh Rao Vijjini

A well formed query is defined as a query which is formulated in the manner of an inquiry, and with correct interrogatives, spelling and grammar.

Language Modelling

A Unified Framework for Emotion Identification and Generation in Dialogues

no code implementations31 May 2022 Avinash Madasu, Mauajama Firdaus, Asif Eqbal

Social chatbots have gained immense popularity, and their appeal lies not just in their capacity to respond to the diverse requests from users, but also in the ability to develop an emotional connection with users.

Learning to Retrieve Videos by Asking Questions

1 code implementation11 May 2022 Avinash Madasu, Junier Oliva, Gedas Bertasius

To overcome this limitation, we propose a novel framework for Video Retrieval using Dialog (ViReD), which enables the user to interact with an AI agent via multiple rounds of dialog, where the user refines retrieved results by answering questions generated by an AI agent.

Retrieval Text to Video Retrieval +1

Sequential Domain Adaptation through Elastic Weight Consolidation for Sentiment Analysis

no code implementations2 Jul 2020 Avinash Madasu, Vijjini Anvesh Rao

SDA draws on EWC for training on successive source domains to move towards a general domain solution, thereby solving the problem of domain adaptation.

Domain Adaptation Sentiment Analysis

A Study of Feature Extraction techniques for Sentiment Analysis

no code implementations4 Jun 2019 Avinash Madasu, Sivasankar E

Sentiment Analysis refers to the study of systematically extracting the meaning of subjective text .

Sentiment Analysis

Gated Convolutional Neural Networks for Domain Adaptation

no code implementations16 May 2019 Avinash Madasu, Vijjini Anvesh Rao

In this paper, we show that Gated Convolutional Neural Networks (GCN) perform effectively at learning sentiment analysis in a manner where domain dependant knowledge is filtered out using its gates.

Domain Adaptation Sentiment Analysis

Effectiveness of Self Normalizing Neural Networks for Text Classification

no code implementations3 May 2019 Avinash Madasu, Vijjini Anvesh Rao

In this paper we aim to show the effectiveness of proposed, Self Normalizing Convolutional Neural Networks(SCNN) on text classification.

General Classification text-classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.