no code implementations • 14 Mar 2025 • Avinash Madasu, Vasudev Lal, Phillip Howard
We propose Concept Consistency Score (CCS), a novel interpretability metric that measures how consistently individual attention heads in CLIP models align with specific concepts.
no code implementations • 3 Oct 2024 • Sungduk Yu, Man Luo, Avinash Madasu, Vasudev Lal, Phillip Howard
To address this deficiency, we propose a new detection approach which surpasses existing methods in the identification of GPT-4o written peer reviews at low levels of false positive classifications.
no code implementations • 10 Sep 2024 • Avinash Madasu, Yossi Gandelsman, Vasudev Lal, Phillip Howard
However, little is known about the inner workings of CLIP.
1 code implementation • CVPR 2024 • Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Anahita Bhiwandiwalla, Vasudev Lal
Our approach utilizes Stable Diffusion with cross attention control to produce sets of counterfactual image-text pairs that are highly similar in their depiction of a subject (e. g., a given occupation) while differing only in their depiction of intersectional social attributes (e. g., race & gender).
1 code implementation • 7 Oct 2023 • Avinash Madasu, Anahita Bhiwandiwalla, Vasudev Lal
We investigate 9 foundational image-text models on a diverse set of video tasks that include video action recognition (video AR), video retrieval (video RT), video question answering (video QA), video multiple choice (video MC) and video captioning (video CP).
no code implementations • 4 Oct 2023 • Phillip Howard, Avinash Madasu, Tiep Le, Gustavo Lujan Moreno, Vasudev Lal
While vision-language models (VLMs) have achieved remarkable performance improvements recently, there is growing evidence that these models also posses harmful biases with respect to social attributes such as gender and race.
no code implementations • 30 Aug 2023 • Kilichbek Haydarov, Xiaoqian Shen, Avinash Madasu, Mahmoud Salem, Li-Jia Li, Gamaleldin Elsayed, Mohamed Elhoseiny
We introduce Affective Visual Dialog, an emotion explanation and reasoning task as a testbed for research on understanding the formation of emotions in visually grounded conversations.
2 code implementations • 28 Jun 2023 • Avinash Madasu, Vasudev Lal
The study is performed on two categories of video retrieval models: (i) which are pre-trained on video-text pairs and fine-tuned on downstream video retrieval datasets (Eg.
1 code implementation • 27 May 2023 • Mauajama Firdaus, Avinash Madasu, Asif Ekbal
Lastly, a decoder generates the corresponding response for the given dialogue context and the extracted slot values.
1 code implementation • 10 Feb 2023 • Avinash Madasu, Vasudev Lal
We compare the performance of language representations of stand-alone text encoders of these models to the language representations of text encoders learnt through vision supervision.
1 code implementation • 21 Oct 2022 • Avinash Madasu, Shashank Srivastava
Large language models (LMs) have rapidly become a mainstay in Natural Language Processing.
no code implementations • 24 Aug 2022 • Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shachar Rosenman, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal
In this paper, we propose a framework MuMUR, that utilizes knowledge transfer from a multilingual model to boost the performance of multi-modal (image and video) retrieval.
no code implementations • 21 Aug 2022 • Avinash Madasu, Anvesh Rao Vijjini
A well formed query is defined as a query which is formulated in the manner of an inquiry, and with correct interrogatives, spelling and grammar.
no code implementations • 31 May 2022 • Avinash Madasu, Mauajama Firdaus, Asif Eqbal
Social chatbots have gained immense popularity, and their appeal lies not just in their capacity to respond to the diverse requests from users, but also in the ability to develop an emotional connection with users.
1 code implementation • 11 May 2022 • Avinash Madasu, Junier Oliva, Gedas Bertasius
To overcome this limitation, we propose a novel framework for Video Retrieval using Dialog (ViReD), which enables the user to interact with an AI agent via multiple rounds of dialog, where the user refines retrieved results by answering questions generated by an AI agent.
no code implementations • 2 Jul 2020 • Avinash Madasu, Vijjini Anvesh Rao
SDA draws on EWC for training on successive source domains to move towards a general domain solution, thereby solving the problem of domain adaptation.
no code implementations • 3 May 2020 • Avinash Madasu, Vijjini Anvesh Rao
Aspect Term Sentiment Analysis (ATSA) is a subtask of ABSA, in which aspect terms are contained within the given sentence.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+2
no code implementations • 1 Nov 2019 • Avinash Madasu, Sivasankar E
Feature selection is a crucial process in machine learning.
no code implementations • IJCNLP 2019 • Avinash Madasu, Vijjini Anvesh Rao
With the advent of deep learning, convolutional neural network (CNN) has been a popular solution to this task.
no code implementations • 4 Jun 2019 • Avinash Madasu, Sivasankar E
Sentiment Analysis refers to the study of systematically extracting the meaning of subjective text .
no code implementations • 16 May 2019 • Avinash Madasu, Vijjini Anvesh Rao
In this paper, we show that Gated Convolutional Neural Networks (GCN) perform effectively at learning sentiment analysis in a manner where domain dependant knowledge is filtered out using its gates.
no code implementations • 3 May 2019 • Avinash Madasu, Vijjini Anvesh Rao
In this paper we aim to show the effectiveness of proposed, Self Normalizing Convolutional Neural Networks(SCNN) on text classification.