Search Results for author: Danna Gurari

Found 29 papers, 10 papers with code

Iterative Feature Transformation for Fast and Versatile Universal Style Transfer

1 code implementation ECCV 2020 Tai-Yin Chiu, Danna Gurari

The general framework for fast universal style transfer consists of an autoencoder and a feature transformation at the bottleneck.

Style Transfer

Interpreting COVID Lateral Flow Tests' Results with Foundation Models

no code implementations21 Apr 2024 Stuti Pandey, Josh Myers-Dean, Jarek Reynolds, Danna Gurari

Accordingly, we explore the abilities of modern foundation vision language models (VLMs) in interpreting such tests.

An Evaluation of GPT-4V and Gemini in Online VQA

no code implementations17 Dec 2023 Mengchen Liu, Chongyan Chen, Danna Gurari

While there is much excitement about the potential of large multimodal models (LMM), a comprehensive evaluation is critical to establish their true capabilities and limitations.

Question Answering Visual Question Answering

Interactive Segmentation for Diverse Gesture Types Without Context

no code implementations20 Jul 2023 Josh Myers-Dean, Yifei Fan, Brian Price, Wilson Chan, Danna Gurari

Interactive segmentation entails a human marking an image to guide how a model either creates or edits a segmentation.

Interactive Segmentation Segmentation

Helping Visually Impaired People Take Better Quality Pictures

1 code implementation14 May 2023 Maniratnam Mandal, Deepti Ghadiyaram, Danna Gurari, Alan C. Bovik

The photographs taken by visually impaired users often suffer from one or both of two kinds of quality issues: technical quality (distortions), and semantic quality, such as framing and aesthetic composition.

Multi-Task Learning

Salient Object Detection for Images Taken by People With Vision Impairments

no code implementations12 Jan 2023 Jarek Reynolds, Chandra Kanth Nagesh, Danna Gurari

Salient object detection is the task of producing a binary mask for an image that deciphers which pixels belong to the foreground object versus background.

Object object-detection +2

Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style Transfer

no code implementations12 Oct 2022 Tai-Yin Chiu, Danna Gurari

Photorealistic style transfer is the task of synthesizing a realistic-looking image when adapting the content from one image to appear in the style of another image.

Style Transfer

VizWiz-FewShot: Locating Objects in Images Taken by People With Visual Impairments

no code implementations24 Jul 2022 Yu-Yun Tseng, Alexander Bell, Danna Gurari

Compared to existing few-shot object detection and instance segmentation datasets, our dataset is the first to locate holes in objects (e. g., found in 12. 3\% of our segmentations), it shows objects that occupy a much larger range of sizes relative to the images, and text is over five times more common in our objects (e. g., found in 22. 4\% of our segmentations).

Few-Shot Object Detection Instance Segmentation +1

Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning

no code implementations21 Dec 2021 Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes.

Generalized Few-Shot Semantic Segmentation Meta-Learning +2

PhotoWCT$^2$: Compact Autoencoder for Photorealistic Style Transfer Resulting from Blockwise Training and Skip Connections of High-Frequency Residuals

1 code implementation22 Oct 2021 Tai-Yin Chiu, Danna Gurari

First, we introduce blockwise training to perform coarse-to-fine feature transformations that enable state-of-art stylization strength in a single autoencoder in place of the inefficient cascade of four autoencoders used in PhotoWCT.

4k Style Transfer

CrowdMOT: Crowdsourcing Strategies for Tracking Multiple Objects in Videos

no code implementations29 Sep 2020 Samreen Anjum, Chi Lin, Danna Gurari

Crowdsourcing is a valuable approach for tracking objects in videos in a more scalable manner than possible with domain experts.

Objectness-Aware Few-Shot Semantic Segmentation

1 code implementation6 Apr 2020 Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

We demonstrate how to increase overall model capacity to achieve improved performance, by introducing objectness, which is class-agnostic and so not prone to overfitting, for complementary use with class-specific features.

Few-Shot Semantic Segmentation Segmentation +1

Assessing Image Quality Issues for Real-World Problems

no code implementations CVPR 2020 Tai-Yin Chiu, Yinan Zhao, Danna Gurari

We introduce a new large-scale dataset that links the assessment of image quality issues to two practical vision tasks: image captioning and visual question answering.

Image Captioning Question Answering +1

Captioning Images Taken by People Who Are Blind

no code implementations ECCV 2020 Danna Gurari, Yinan Zhao, Meng Zhang, Nilavra Bhattacharya

While an important problem in the vision community is to design algorithms that can automatically caption images, few publicly-available datasets for algorithm development directly address the interests of real users.

Image Captioning

VizWiz Dataset Browser: A Tool for Visualizing Machine Learning Datasets

no code implementations19 Dec 2019 Nilavra Bhattacharya, Danna Gurari

We present a visualization tool to exhaustively search and browse through a set of large-scale machine learning datasets.

BIG-bench Machine Learning

VizWiz-Priv: A Dataset for Recognizing the Presence and Purpose of Private Visual Information in Images Taken by Blind People

no code implementations CVPR 2019 Danna Gurari, Qing Li, Chi Lin, Yinan Zhao, Anhong Guo, Abigale Stangl, Jeffrey P. Bigham

We introduce the first visual privacy dataset originating from people who are blind in order to better understand their privacy disclosures and to encourage the development of algorithms that can assist in preventing their unintended disclosures.

Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

no code implementations30 Apr 2019 Danna Gurari, Yinan Zhao, Suyog Dutt Jain, Margrit Betke, Kristen Grauman

We propose a resource allocation framework for predicting how best to allocate a fixed budget of human annotation effort in order to collect higher quality segmentations for a given batch of images and automated methods.

Semantic Segmentation

Guided Image Inpainting: Replacing an Image Region by Pulling Content from Another Image

no code implementations22 Mar 2018 Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Deep generative models have shown success in automatically synthesizing missing image regions using surrounding context.

Image Inpainting

VizWiz Grand Challenge: Answering Visual Questions from Blind People

1 code implementation CVPR 2018 Danna Gurari, Qing Li, Abigale J. Stangl, Anhong Guo, Chi Lin, Kristen Grauman, Jiebo Luo, Jeffrey P. Bigham

The study of algorithms to automatically answer visual questions currently is motivated by visual question answering (VQA) datasets constructed in artificial VQA settings.

Question Answering Visual Question Answering

Predicting Foreground Object Ambiguity and Efficiently Crowdsourcing the Segmentation(s)

no code implementations30 Apr 2017 Danna Gurari, Kun He, Bo Xiong, Jianming Zhang, Mehrnoosh Sameki, Suyog Dutt Jain, Stan Sclaroff, Margrit Betke, Kristen Grauman

We propose the ambiguity problem for the foreground object segmentation task and motivate the importance of estimating and accounting for this ambiguity when designing vision systems.

Object Semantic Segmentation +1

Visual Question: Predicting If a Crowd Will Agree on the Answer

no code implementations29 Aug 2016 Danna Gurari, Kristen Grauman

Visual question answering (VQA) systems are emerging from a desire to empower users to ask any natural language question about visual content and receive a valid answer in response.

Question Answering valid +1

Pull the Plug? Predicting If Computers or Humans Should Segment Images

no code implementations CVPR 2016 Danna Gurari, Suyog Jain, Margrit Betke, Kristen Grauman

We propose a resource allocation framework for predicting how best to allocate a fixed budget of human annotation effort in order to collect higher quality segmentations for a given batch of images and automated methods.

Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.