Search Results for author: Adriana Kovashka

Found 37 papers, 7 papers with code

Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

no code implementations3 Jan 2024 Kyle Buettner, Sina Malakouti, Xiang Lorraine Li, Adriana Kovashka

In the absence of training data from target geographies, we hypothesize that geographically diverse descriptive knowledge of categories can enhance robustness.

Descriptive Language Modelling +3

Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment

1 code implementation24 Sep 2023 Sina Malakouti, Adriana Kovashka

Existing domain adaptation (DA) and generalization (DG) methods in object detection enforce feature alignment in the visual space but face challenges like object appearance variability and scene complexity, which make it difficult to distinguish between objects and achieve accurate detection.

Descriptive Domain Generalization +4

Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport

no code implementations11 Jun 2023 Yuya Asano, Diane Litman, Mingzhi Yu, Nikki Lobczowski, Timothy Nokes-Malach, Adriana Kovashka, Erin Walker

While speech-enabled teachable agents have some advantages over typing-based ones, they are vulnerable to errors stemming from misrecognition by automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining

no code implementations25 Apr 2023 Giacomo Nebbia, Adriana Kovashka

In this work, we investigate hypernymization as a way to deal with named entities for pretraining grounding-based multi-modal models and for fine-tuning on open-vocabulary detection.

Language Modelling Retrieval +1

Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection

no code implementations17 Mar 2023 Kyle Buettner, Adriana Kovashka

Methods are mostly evaluated in terms of how well object class names are learned, but captions also contain rich attribute context that should be considered when learning object alignment.

Attribute Contrastive Learning +7

VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

no code implementations16 Mar 2023 Arushi Rai, Adriana Kovashka

The use of large-scale vision-language datasets is limited for object detection due to the negative impact of label noise on localization.

object-detection Weakly Supervised Object Detection

Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors

no code implementations9 Mar 2023 Mesut Erhan Unal, Adriana Kovashka

In this paper, we tackle HOI detection with the weakest supervision setting in the literature, using only image-level interaction labels, with the help of a pretrained vision-language model (VLM) and a large language model (LLM).

Human-Object Interaction Detection Language Modelling +2

Contrastive View Design Strategies to Enhance Robustness to Domain Shifts in Downstream Object Detection

no code implementations9 Dec 2022 Kyle Buettner, Adriana Kovashka

To address this gap, we conduct an empirical study of contrastive learning and out-of-domain object detection, studying how contrastive view design affects robustness.

Contrastive Learning Object +2

Symbolic image detection using scene and knowledge graphs

1 code implementation10 Jun 2022 Nasrin Kalanat, Adriana Kovashka

In this paper, we use a scene graph, a graph representation of an image, to capture visual components.

Common Sense Reasoning Knowledge Graphs

Weakly-Supervised Action Detection Guided by Audio Narration

no code implementations12 May 2022 Keren Ye, Adriana Kovashka

We explored how to eliminate the expensive annotations in video detection data which provide refined boundaries.

Action Detection

Learning Better Visual Representations for Weakly-Supervised Object Detection Using Natural Language Supervision

no code implementations29 Sep 2021 Mesut Erhan Unal, Adriana Kovashka

We present a framework to better leverage natural language supervision for a specific downstream task, namely weakly-supervised object detection (WSOD).

object-detection Representation Learning +1

Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers

1 code implementation24 Jun 2021 Katelyn Morrison, Benjamin Gilby, Colton Lipchak, Adam Mattioli, Adriana Kovashka

We find that vision transformer architectures are inherently more robust to corruptions than the ResNet-50 and MLP-Mixers.

Data Augmentation

Linguistic Structures as Weak Supervision for Visual Scene Graph Generation

1 code implementation CVPR 2021 Keren Ye, Adriana Kovashka

Prior work in scene graph generation requires categorical supervision at the level of triplets - subjects and objects, and predicates that relate them, either with or without bounding box information.

Graph Generation Scene Graph Generation

Domain-robust VQA with diverse datasets and methods but no target labels

no code implementations CVPR 2021 Mingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa

The observation that computer vision methods overfit to dataset specifics has inspired diverse attempts to make object recognition models robust to domain shifts.

Object Recognition Question Answering +2

SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection

no code implementations4 Jan 2021 Keren Ye, Adriana Kovashka, Mark Sandler, Menglong Zhu, Andrew Howard, Marco Fornoni

In this paper we address the question: can task-specific detectors be trained and represented as a shared set of weights, plus a very small set of additional weights for each task?

Object object-detection +2

Learning to Transfer Visual Effects from Videos to Images

no code implementations3 Dec 2020 Christopher Thomas, Yale Song, Adriana Kovashka

We study the problem of animating images by transferring spatio-temporal visual effects (such as melting) from a collection of videos.

Optical Flow Estimation

Predicting the Politics of an Image Using Webly Supervised Data

1 code implementation NeurIPS 2019 Christopher Thomas, Adriana Kovashka

We collect a dataset of over one million unique images and associated news articles from left- and right-leaning news sources, and develop a method to predict the image's political leaning.

Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

1 code implementation ICCV 2019 Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent

Learning to localize and name object instances is a fundamental problem in vision, but state-of-the-art approaches rely on expensive bounding box supervision.

Object object-detection +1

Measuring Effectiveness of Video Advertisements

no code implementations15 Jan 2019 James Hahn, Adriana Kovashka

They are attractive to companies and nearly unavoidable for consumers.

Artistic Object Recognition by Unsupervised Style Adaptation

no code implementations28 Dec 2018 Christopher Thomas, Adriana Kovashka

To do so, we introduce a complementary training modality constructed to be similar in artistic style to the target domain, and enforce that the network learns features that are invariant between the two training modalities.

Domain Adaptation Object +3

Learning to discover and localize visual objects with open vocabulary

no code implementations25 Nov 2018 Keren Ye, Mingda Zhang, Wei Li, Danfeng Qin, Adriana Kovashka, Jesse Berent

To alleviate the cost of obtaining accurate bounding boxes for training today's state-of-the-art object detection models, recent weakly supervised detection work has proposed techniques to learn from image-level labels.

Object object-detection +1

Story Understanding in Video Advertisements

no code implementations29 Jul 2018 Keren Ye, Kyle Buettner, Adriana Kovashka

We dedicate our study to understand the dynamic structure of video ads automatically.

Persuasive Faces: Generating Faces in Advertisements

no code implementations25 Jul 2018 Christopher Thomas, Adriana Kovashka

We show how our model can be used to produce visually distinct faces which appear to be from a fixed ad topic category.

Face Generation Generative Adversarial Network

Image Retrieval with Mixed Initiative and Multimodal Feedback

no code implementations8 May 2018 Nils Murrugarra-Llerena, Adriana Kovashka

How would you search for a unique, fashionable shoe that a friend wore and you want to buy, but you didn't take a picture?

Attribute Image Retrieval +2

ADVISE: Symbolism and External Knowledge for Decoding Advertisements

no code implementations ECCV 2018 Keren Ye, Adriana Kovashka

In order to convey the most content in their limited space, advertisements embed references to outside knowledge via symbolism.

Clustering Image Captioning +2

Automatic Understanding of Image and Video Advertisements

no code implementations CVPR 2017 Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

There is more to images than their objective physical content: for example, advertisements are created to persuade a viewer to take a certain action.

Crowdsourcing in Computer Vision

no code implementations7 Nov 2016 Adriana Kovashka, Olga Russakovsky, Li Fei-Fei, Kristen Grauman

Computer vision systems require large amounts of manually annotated data to properly learn challenging visual concepts.

Object Recognition

Seeing Behind the Camera: Identifying the Authorship of a Photograph

no code implementations CVPR 2016 Christopher Thomas, Adriana Kovashka

To explore the feasibility of current computer vision techniques to address this problem, we created a new dataset of over 180, 000 images taken by 41 well-known photographers.

WhittleSearch: Interactive Image Search with Relative Attribute Feedback

no code implementations15 May 2015 Adriana Kovashka, Devi Parikh, Kristen Grauman

We propose a novel mode of feedback for image search, where a user describes which properties of exemplar images should be adjusted in order to more closely match his/her mental model of the image sought.

Attribute Image Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.