Search Results for author: Brian Price

Found 58 papers, 23 papers with code

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

no code implementations6 Nov 2023 Hanrong Ye, Jason Kuen, Qing Liu, Zhe Lin, Brian Price, Dan Xu

On the highly competitive ADE20K and COCO benchmarks, our data generation method markedly improves the performance of state-of-the-art segmentation models in semantic segmentation, panoptic segmentation, and instance segmentation.

Image Generation Image Segmentation +3

Putting the Object Back into Video Object Segmentation

1 code implementation19 Oct 2023 Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

Object Segmentation +3

Tracking Anything with Decoupled Video Segmentation

1 code implementation ICCV 2023 Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

 Ranked #1 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)

Open-Vocabulary Video Segmentation Open-World Video Segmentation +7

Interactive Segmentation for Diverse Gesture Types Without Context

no code implementations20 Jul 2023 Josh Myers-Dean, Yifei Fan, Brian Price, Wilson Chan, Danna Gurari

Interactive segmentation entails a human marking an image to guide how a model either creates or edits a segmentation.

Interactive Segmentation Segmentation

Towards Open-World Segmentation of Parts

1 code implementation CVPR 2023 Tai-Yu Pan, Qing Liu, Wei-Lun Chao, Brian Price

Second, we introduce a novel approach to improve part segmentation on unseen objects, inspired by an interesting finding -- for unseen objects, the pixel-wise features extracted by the model often reveal high-quality part segments.

Contrastive Learning Segmentation

GamutMLP: A Lightweight MLP for Color Loss Recovery

no code implementations CVPR 2023 Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown

Inspired by neural implicit representations for 2D images, we propose a method that optimizes a lightweight multi-layer-perceptron (MLP) model during the gamut reduction step to predict the clipped values.

Improving Diffusion Models for Scene Text Editing with Dual Encoders

1 code implementation12 Apr 2023 Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang, Brian Price, Shiyu Chang

Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance.

Scene Text Editing Style Transfer +1

ObjectStitch: Object Compositing With Diffusion Model

no code implementations CVPR 2023 Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results.

Data Augmentation Object

ObjectStitch: Generative Object Compositing

1 code implementation2 Dec 2022 Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results.

Data Augmentation Object

One-Trimap Video Matting

1 code implementation27 Jul 2022 Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee

A key of OTVM is the joint modeling of trimap propagation and alpha prediction.

Image Matting Video Matting

Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks

no code implementations CVPR 2022 Fanqing Lin, Brian Price, Tony Martinez

Recently, feature backpropagating refinement scheme (f-BRS) has been proposed for the task of interactive segmentation, which enables efficient optimization of a small set of auxiliary variables inserted into the pretrained network to produce object segmentation that better aligns with user inputs.

Image Matting Interactive Segmentation +3

Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning

no code implementations21 Dec 2021 Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes.

Generalized Few-Shot Semantic Segmentation Meta-Learning +2

Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

1 code implementation CVPR 2021 Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi

We also introduce Text Refinement Network (TexRNet), a novel text segmentation approach that adapts to the unique properties of text, e. g. non-convex boundary, diverse texture, etc., which often impose burdens on traditional segmentation models.

Segmentation Style Transfer +2

Text and Style Conditioned GAN for Generation of Offline Handwriting Lines

1 code implementation1 Sep 2020 Brian Davis, Chris Tensmeyer, Brian Price, Curtis Wigington, Bryan Morse, Rajiv Jain

This paper presents a GAN for generating images of handwritten lines conditioned on arbitrary text and latent style vectors.

Handwriting generation

Objectness-Aware Few-Shot Semantic Segmentation

1 code implementation6 Apr 2020 Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

We demonstrate how to increase overall model capacity to achieve improved performance, by introducing objectness, which is class-agnostic and so not prone to overfitting, for complementary use with class-specific features.

Few-Shot Semantic Segmentation Segmentation +1

DeepStrip: High Resolution Boundary Refinement

no code implementations25 Mar 2020 Peng Zhou, Brian Price, Scott Cohen, Gregg Wilensky, Larry S. Davis

In this paper, we target refining the boundaries in high resolution images given low resolution masks.

Vocal Bursts Intensity Prediction

Getting to 99% Accuracy in Interactive Segmentation

3 code implementations17 Mar 2020 Marco Forte, Brian Price, Scott Cohen, Ning Xu, François Pitié

We propose a novel interactive architecture and a novel training scheme that are both tailored to better exploit the user workflow.

Interactive Segmentation

Deep Visual Template-Free Form Parsing

3 code implementations5 Sep 2019 Brian Davis, Bryan Morse, Scott Cohen, Brian Price, Chris Tensmeyer

Automatic, template-free extraction of information from form images is challenging due to the variety of form layouts.

Answering Questions about Data Visualizations using Efficient Bimodal Fusion

1 code implementation5 Aug 2019 Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e. g. bar charts, pie charts, and line graphs.

Chart Question Answering Optical Character Recognition +3

Image Recoloring Based on Object Color Distributions

1 code implementation Eurographics 2019 - Short Papers 2019 Mahmoud Afifi, Brian Price, Scott Cohen, and Michael S. Brown

We present a method to perform automatic image recoloring based on the distribution of colors associated with objects present in an image.

Object Segmentation +1

Measuring Human Perception to Improve Handwritten Document Transcription

no code implementations7 Apr 2019 Samuel Grieggs, Bingyu Shen, Greta Rauch, Pei Li, Jiaqi Ma, David Chiang, Brian Price, Walter J. Scheirer

The subtleties of human perception, as measured by vision scientists through the use of psychophysics, are important clues to the internal workings of visual recognition.

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

4 code implementations ECCV 2018 Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, Thomas Huang

End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.

Ranked #12 on Video Object Segmentation on YouTube-VOS 2018 (F-Measure (Unseen) metric)

Image Segmentation Object +7

Compositing-aware Image Search

no code implementations ECCV 2018 Hengshuang Zhao, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Brian Price, Jiaya Jia

We present a new image search technique that, given a background image, returns compatible foreground objects for image compositing tasks.

Image Retrieval Object

Interactive Boundary Prediction for Object Selection

no code implementations ECCV 2018 Hoang Le, Long Mai, Brian Price, Scott Cohen, Hailin Jin, Feng Liu

Instead of relying on pre-defined low-level image features, our method adaptively predicts object boundaries according to image content and user interactions.

Image Segmentation Interactive Segmentation +3

Start, Follow, Read: End-to-End Full-Page Handwriting Recognition

1 code implementation ECCV 2018 Curtis Wigington, Chris Tensmeyer, Brian Davis, William Barrett, Brian Price, Scott Cohen

Despite decades of research, offline handwriting recognition (HWR) of degraded historical documents remains a challenging problem, which if solved could greatly improve the searchability of online cultural heritage archives.

Handwriting Recognition Handwritten Text Recognition +4

Disentangling Structure and Aesthetics for Style-Aware Image Completion

no code implementations CVPR 2018 Andrew Gilbert, John Collomosse, Hailin Jin, Brian Price

Content-aware image completion or in-painting is a fundamental tool for the correction of defects or removal of objects in images.

Guided Image Inpainting: Replacing an Image Region by Pulling Content from Another Image

no code implementations22 Mar 2018 Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Deep generative models have shown success in automatically synthesizing missing image regions using surrounding context.

Image Inpainting

Discriminability objective for training descriptive captions

1 code implementation CVPR 2018 Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich

One property that remains lacking in image captions generated by contemporary methods is discriminability: being able to tell two images apart given the caption for one of them.

Caption Generation Descriptive +1

Group-Theme Recoloring for Multi-Image Color Consistency

1 code implementation Pacific Graphics 2017 Rang Nguyen, Brian Price, Scott Cohen, and Michael S. Brown

Methods such as color transfer are effective in making an image share similar colors with a target image; however, color transfer is not suitable for modifying multiple images.

Deep GrabCut for Object Selection

no code implementations2 Jul 2017 Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang

In this paper, we propose a novel segmentation approach that uses a rectangle as a soft constraint by transforming it into an Euclidean distance map.

Instance Segmentation Interactive Segmentation +3

Depth From Defocus in the Wild

no code implementations CVPR 2017 Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos

We consider the problem of two-frame depth from defocus in conditions unsuitable for existing methods yet typical of everyday photography: a handheld cellphone camera, a small aperture, a non-stationary scene and sparse surface texture.

Deep Image Matting

8 code implementations CVPR 2017 Ning Xu, Brian Price, Scott Cohen, Thomas Huang

We evaluate our algorithm on the image matting benchmark, our testing set, and a wide variety of real images.

Semantic Image Matting

SURGE: Surface Regularized Geometry Estimation from a Single Image

no code implementations NeurIPS 2016 Peng Wang, Xiaohui Shen, Bryan Russell, Scott Cohen, Brian Price, Alan L. Yuille

This paper introduces an approach to regularize 2. 5D surface normal and depth predictions at each pixel given a single input image.

Salient Object Subitizing

no code implementations CVPR 2015 Jianming Zhang, Shugao Ma, Mehrnoosh Sameki, Stan Sclaroff, Margrit Betke, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

We study the problem of Salient Object Subitizing, i. e. predicting the existence and the number of salient objects in an image using holistic cues.

Image Retrieval Object +4

Two Illuminant Estimation and User Correction Preference

no code implementations CVPR 2016 Dongliang Cheng, Abdelrahman Abdelhamed, Brian Price, Scott Cohen, Michael S. Brown

Existing methods attempt to estimate a spatially varying illumination map, however, results are error prone and the resulting illumination maps are too low-resolution to be used for proper spatially varying white-balance correction.

Vocal Bursts Valence Prediction

Interactive Segmentation on RGBD Images via Cue Selection

no code implementations CVPR 2016 Jie Feng, Brian Price, Scott Cohen, Shih-Fu Chang

While these methods achieve better results than color-based methods, they are still limited in either using depth as an additional color channel or simply combining depth with color in a linear way.

Image Retrieval Image Segmentation +5

Automatic Annotation of Structured Facts in Images

no code implementations WS 2016 Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal

Motivated by the application of fact-level image understanding, we present an automatic method for data collection of structured visual facts from images with captions.

Minimum Barrier Salient Object Detection at 80 FPS

no code implementations ICCV 2015 Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

Powered by this fast MBD transform algorithm, the proposed salient object detection method runs at 80 FPS, and significantly outperforms previous methods with similar speed on four large benchmark datasets, and achieves comparable or better performance than state-of-the-art methods.

Ranked #6 on Video Salient Object Detection on VOS-T (using extra training data)

Object object-detection +2

Beyond White: Ground Truth Colors for Color Constancy Correction

no code implementations ICCV 2015 Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown

A limitation in color constancy research is the inability to establish ground truth colors for evaluating corrected images.

Color Constancy

Sherlock: Scalable Fact Learning in Images

no code implementations16 Nov 2015 Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal

We show that learning visual facts in a structured way enables not only a uniform but also generalizable visual understanding.

Multiview Learning Retrieval

LCNN: Low-level Feature Embedded CNN for Salient Object Detection

no code implementations17 Aug 2015 Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price

In this paper, we propose a novel deep neural network framework embedded with low-level features (LCNN) for salient object detection in complex images.

object-detection RGB Salient Object Detection +1

PatchCut: Data-Driven Object Segmentation via Local Shape Transfer

no code implementations CVPR 2015 Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, Ming-Hsuan Yang

The transferred local shape masks constitute a patch-level segmentation solution space and we thus develop a novel cascade algorithm, PatchCut, for coarse-to-fine object segmentation.

Object Object Discovery +2

Effective Learning-Based Illuminant Estimation Using Simple Features

no code implementations CVPR 2015 Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown

More recent state-of-the-art methods employ learning-based techniques that produce better results, but often rely on complex features and have long evaluation and training times.

Color Constancy

Towards Unified Depth and Semantic Prediction From a Single Image

no code implementations CVPR 2015 Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille

By allowing for interactions between the depth and semantic information, the joint network provides more accurate depth prediction than a state-of-the-art CNN trained solely for depth prediction [5].

Depth Estimation Depth Prediction +1

Inner and Inter Label Propagation: Salient Object Detection in the Wild

2 code implementations27 May 2015 Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price

For most natural images, some boundary superpixels serve as the background labels and the saliency of other superpixels are determined by ranking their similarities to the boundary labels based on an inner propagation scheme.

Computational Efficiency object-detection +4

Joint Object and Part Segmentation using Deep Learned Potentials

no code implementations ICCV 2015 Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan Yuille

Segmenting semantic objects from images and parsing them into their respective semantic parts are fundamental steps towards detailed object understanding in computer vision.

Object Segmentation +1

Semantic Object Selection

no code implementations CVPR 2014 Ejaz Ahmed, Scott Cohen, Brian Price

With the tag provided by the user we do a text query of an image database to gather exemplars of the object.

Image Retrieval Object +5

Improving Image Matting Using Comprehensive Sampling Sets

no code implementations CVPR 2013 Ehsan Shahrian, Deepu Rajan, Brian Price, Scott Cohen

The first is that the range in which the foreground and background are sampled is often limited to such an extent that the true foreground and background colors are not present.

Image Matting

Cannot find the paper you are looking for? You can Submit a new open access paper.