Search Results for author: Brian Price

Found 58 papers, 23 papers with code

PhraseClick: Toward Achieving Flexible Interactive Segmentation by Phrase and Click

no code implementations • ECCV 2020 • Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang

We propose to employ phrase expressions as another interaction input to infer the attributes of target object.

Paper
Add Code

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

no code implementations • 15 Mar 2024 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga

Generative object compositing emerges as a promising new avenue for compositional image editing.

Object

Paper
Add Code

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

no code implementations • 6 Nov 2023 • Hanrong Ye, Jason Kuen, Qing Liu, Zhe Lin, Brian Price, Dan Xu

On the highly competitive ADE20K and COCO benchmarks, our data generation method markedly improves the performance of state-of-the-art segmentation models in semantic segmentation, panoptic segmentation, and instance segmentation.

Image Generation Image Segmentation +3

Paper
Add Code

Putting the Object Back into Video Object Segmentation

1 code implementation • 19 Oct 2023 • Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

Ranked #1 on Semi-Supervised Video Object Segmentation on MOSE

Object Segmentation +3

448

Paper
Code

Tracking Anything with Decoupled Video Segmentation

1 code implementation • ICCV 2023 • Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

Ranked #1 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)

Open-Vocabulary Video Segmentation Open-World Video Segmentation +7

1,049

Paper
Code

Interactive Segmentation for Diverse Gesture Types Without Context

no code implementations • 20 Jul 2023 • Josh Myers-Dean, Yifei Fan, Brian Price, Wilson Chan, Danna Gurari

Interactive segmentation entails a human marking an image to guide how a model either creates or edits a segmentation.

Interactive Segmentation Segmentation

Paper
Add Code

Towards Open-World Segmentation of Parts

1 code implementation • CVPR 2023 • Tai-Yu Pan, Qing Liu, Wei-Lun Chao, Brian Price

Second, we introduce a novel approach to improve part segmentation on unseen objects, inspired by an interesting finding -- for unseen objects, the pixel-wise features extracted by the model often reveal high-quality part segments.

Contrastive Learning Segmentation

Paper
Code

GamutMLP: A Lightweight MLP for Color Loss Recovery

no code implementations • CVPR 2023 • Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown

Inspired by neural implicit representations for 2D images, we propose a method that optimizes a lightweight multi-layer-perceptron (MLP) model during the gamut reduction step to predict the clipped values.

Paper
Add Code

Improving Diffusion Models for Scene Text Editing with Dual Encoders

1 code implementation • 12 Apr 2023 • Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang, Brian Price, Shiyu Chang

Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance.

Scene Text Editing Style Transfer +1

Paper
Code

ObjectStitch: Object Compositing With Diffusion Model

no code implementations • CVPR 2023 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results.

Data Augmentation Object

Paper
Add Code

ObjectStitch: Generative Object Compositing

1 code implementation • 2 Dec 2022 • Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

Data Augmentation Object

Paper
Code

One-Trimap Video Matting

1 code implementation • 27 Jul 2022 • Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee

A key of OTVM is the joint modeling of trimap propagation and alpha prediction.

Image Matting Video Matting

Paper
Code

Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation

no code implementations • CVPR 2022 • Yutong Dai, Brian Price, He Zhang, Chunhua Shen

Deep image matting methods have achieved increasingly better results on benchmarks (e. g., Composition-1k/alphamatting. com).

Data Augmentation Image Matting

Paper
Add Code

Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks

no code implementations • CVPR 2022 • Fanqing Lin, Brian Price, Tony Martinez

Recently, feature backpropagating refinement scheme (f-BRS) has been proposed for the task of interactive segmentation, which enables efficient optimization of a small set of auxiliary variables inserted into the pretrained network to produce object segmentation that better aligns with user inputs.

Image Matting Interactive Segmentation +3

Paper
Add Code

Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning

no code implementations • 21 Dec 2021 • Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Generalized few-shot semantic segmentation was introduced to move beyond only evaluating few-shot segmentation models on novel classes to include testing their ability to remember base classes.

Generalized Few-Shot Semantic Segmentation Meta-Learning +2

Paper
Add Code

Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

1 code implementation • CVPR 2021 • Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi

We also introduce Text Refinement Network (TexRNet), a novel text segmentation approach that adapts to the unique properties of text, e. g. non-convex boundary, diverse texture, etc., which often impose burdens on traditional segmentation models.

Segmentation Style Transfer +2

237

Paper
Code

Text and Style Conditioned GAN for Generation of Offline Handwriting Lines

1 code implementation • 1 Sep 2020 • Brian Davis, Chris Tensmeyer, Brian Price, Curtis Wigington, Bryan Morse, Rajiv Jain

This paper presents a GAN for generating images of handwritten lines conditioned on arbitrary text and latent style vectors.

Handwriting generation

Paper
Code

Objectness-Aware Few-Shot Semantic Segmentation

1 code implementation • 6 Apr 2020 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

We demonstrate how to increase overall model capacity to achieve improved performance, by introducing objectness, which is class-agnostic and so not prone to overfitting, for complementary use with class-specific features.

Few-Shot Semantic Segmentation Segmentation +1

766

Paper
Code

DeepStrip: High Resolution Boundary Refinement

no code implementations • 25 Mar 2020 • Peng Zhou, Brian Price, Scott Cohen, Gregg Wilensky, Larry S. Davis

In this paper, we target refining the boundaries in high resolution images given low resolution masks.

Vocal Bursts Intensity Prediction

Paper
Add Code

Getting to 99% Accuracy in Interactive Segmentation

3 code implementations • 17 Mar 2020 • Marco Forte, Brian Price, Scott Cohen, Ning Xu, François Pitié

We propose a novel interactive architecture and a novel training scheme that are both tailored to better exploit the user workflow.

Interactive Segmentation

452

Paper
Code

Deep Visual Template-Free Form Parsing

3 code implementations • 5 Sep 2019 • Brian Davis, Bryan Morse, Scott Cohen, Brian Price, Chris Tensmeyer

Automatic, template-free extraction of information from form images is challenging due to the variety of form layouts.

Paper
Code

Unconstrained Foreground Object Search

no code implementations • ICCV 2019 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Many people search for foreground objects to use when editing images.

Object

Paper
Add Code

Answering Questions about Data Visualizations using Efficient Bimodal Fusion

1 code implementation • 5 Aug 2019 • Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e. g. bar charts, pie charts, and line graphs.

Ranked #1 on Visual Question Answering (VQA) on DVQA test-familiar

Chart Question Answering Optical Character Recognition +3

Paper
Code

Image Recoloring Based on Object Color Distributions

1 code implementation • Eurographics 2019 - Short Papers 2019 • Mahmoud Afifi, Brian Price, Scott Cohen, and Michael S. Brown

We present a method to perform automatic image recoloring based on the distribution of colors associated with objects present in an image.

Object Segmentation +1

Paper
Code

Measuring Human Perception to Improve Handwritten Document Transcription

no code implementations • 7 Apr 2019 • Samuel Grieggs, Bingyu Shen, Greta Rauch, Pei Li, Jiaqi Ma, David Chiang, Brian Price, Walter J. Scheirer

The subtleties of human perception, as measured by vision scientists through the use of psychophysics, are important clues to the internal workings of visual recognition.

Paper
Add Code

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

4 code implementations • ECCV 2018 • Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, Thomas Huang

End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.

Ranked #12 on Video Object Segmentation on YouTube-VOS 2018 (F-Measure (Unseen) metric)

Image Segmentation Object +7

Paper
Code

Compositing-aware Image Search

no code implementations • ECCV 2018 • Hengshuang Zhao, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Brian Price, Jiaya Jia

We present a new image search technique that, given a background image, returns compatible foreground objects for image compositing tasks.

Image Retrieval Object

Paper
Add Code

Interactive Boundary Prediction for Object Selection

no code implementations • ECCV 2018 • Hoang Le, Long Mai, Brian Price, Scott Cohen, Hailin Jin, Feng Liu

Instead of relying on pre-defined low-level image features, our method adaptively predicts object boundaries according to image content and user interactions.

Image Segmentation Interactive Segmentation +3

Paper
Add Code

Start, Follow, Read: End-to-End Full-Page Handwriting Recognition

1 code implementation • ECCV 2018 • Curtis Wigington, Chris Tensmeyer, Brian Davis, William Barrett, Brian Price, Scott Cohen

Despite decades of research, offline handwriting recognition (HWR) of degraded historical documents remains a challenging problem, which if solved could greatly improve the searchability of online cultural heritage archives.

Ranked #12 on Handwritten Text Recognition on IAM

Handwriting Recognition Handwritten Text Recognition +4

Paper
Code

Disentangling Structure and Aesthetics for Style-Aware Image Completion

no code implementations • CVPR 2018 • Andrew Gilbert, John Collomosse, Hailin Jin, Brian Price

Content-aware image completion or in-painting is a fundamental tool for the correction of defects or removal of objects in images.

Paper
Add Code

Guided Image Inpainting: Replacing an Image Region by Pulling Content from Another Image

no code implementations • 22 Mar 2018 • Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Deep generative models have shown success in automatically synthesizing missing image regions using surrounding context.

Image Inpainting

Paper
Add Code

Discriminability objective for training descriptive captions

1 code implementation • CVPR 2018 • Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich

One property that remains lacking in image captions generated by contemporary methods is discriminability: being able to tell two images apart given the caption for one of them.

Caption Generation Descriptive +1

110

Paper
Code

DVQA: Understanding Data Visualizations via Question Answering

1 code implementation • CVPR 2018 • Kushal Kafle, Brian Price, Scott Cohen, Christopher Kanan

Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them.

Chart Question Answering Question Answering +1

Paper
Code

Group-Theme Recoloring for Multi-Image Color Consistency

1 code implementation • Pacific Graphics 2017 • Rang Nguyen, Brian Price, Scott Cohen, and Michael S. Brown

Methods such as color transfer are effective in making an image share similar colors with a target image; however, color transfer is not suitable for modifying multiple images.

Paper
Code

Deep GrabCut for Object Selection

no code implementations • 2 Jul 2017 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang

In this paper, we propose a novel segmentation approach that uses a rectangle as a soft constraint by transforming it into an Euclidean distance map.

Instance Segmentation Interactive Segmentation +3

Paper
Add Code

Depth From Defocus in the Wild

no code implementations • CVPR 2017 • Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos

We consider the problem of two-frame depth from defocus in conditions unsuitable for existing methods yet typical of everyday photography: a handheld cellphone camera, a small aperture, a non-stationary scene and sparse surface texture.

Paper
Add Code

Forecasting Human Dynamics from Static Images

no code implementations • CVPR 2017 • Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng

This paper presents the first study on forecasting human dynamics from static images.

Human Dynamics Pose Estimation

Paper
Add Code

Deep Image Matting

8 code implementations • CVPR 2017 • Ning Xu, Brian Price, Scott Cohen, Thomas Huang

We evaluate our algorithm on the image matting benchmark, our testing set, and a wide variety of real images.

Ranked #3 on Semantic Image Matting on Semantic Image Matting Dataset

Semantic Image Matting

8,226

Paper
Code

SURGE: Surface Regularized Geometry Estimation from a Single Image

no code implementations • NeurIPS 2016 • Peng Wang, Xiaohui Shen, Bryan Russell, Scott Cohen, Brian Price, Alan L. Yuille

This paper introduces an approach to regularize 2. 5D surface normal and depth predictions at each pixel given a single input image.

Paper
Add Code

Salient Object Subitizing

no code implementations • CVPR 2015 • Jianming Zhang, Shugao Ma, Mehrnoosh Sameki, Stan Sclaroff, Margrit Betke, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

We study the problem of Salient Object Subitizing, i. e. predicting the existence and the number of salient objects in an image using holistic cues.

Image Retrieval Object +4

Paper
Add Code

Two Illuminant Estimation and User Correction Preference

no code implementations • CVPR 2016 • Dongliang Cheng, Abdelrahman Abdelhamed, Brian Price, Scott Cohen, Michael S. Brown

Existing methods attempt to estimate a spatially varying illumination map, however, results are error prone and the resulting illumination maps are too low-resolution to be used for proper spatially varying white-balance correction.

Vocal Bursts Valence Prediction

Paper
Add Code

Interactive Segmentation on RGBD Images via Cue Selection

no code implementations • CVPR 2016 • Jie Feng, Brian Price, Scott Cohen, Shih-Fu Chang

While these methods achieve better results than color-based methods, they are still limited in either using depth as an additional color channel or simply combining depth with color in a linear way.

Image Retrieval Image Segmentation +5

Paper
Add Code

Unconstrained Salient Object Detection via Proposal Subset Optimization

1 code implementation • CVPR 2016 • Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

Our system leverages a Convolutional-Neural-Network model to generate location proposals of salient objects.

Object object-detection +2

Paper
Code

Automatic Annotation of Structured Facts in Images

no code implementations • WS 2016 • Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal

Motivated by the application of fact-level image understanding, we present an automatic method for data collection of structured visual facts from images with captions.

Paper
Add Code

Object Contour Detection with a Fully Convolutional Encoder-Decoder Network

3 code implementations • CVPR 2016 • Jimei Yang, Brian Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang

We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network.

Contour Detection Edge Detection +1

121

Paper
Code

Deep Interactive Object Selection

3 code implementations • CVPR 2016 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang

Interactive object selection is a very important research problem and has many applications.

Ranked #11 on Interactive Segmentation on SBD

Interactive Segmentation Object

Paper
Code

Minimum Barrier Salient Object Detection at 80 FPS

no code implementations • ICCV 2015 • Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

Powered by this fast MBD transform algorithm, the proposed salient object detection method runs at 80 FPS, and significantly outperforms previous methods with similar speed on four large benchmark datasets, and achieves comparable or better performance than state-of-the-art methods.

Ranked #6 on Video Salient Object Detection on VOS-T (using extra training data)

Object object-detection +2

Paper
Add Code

Beyond White: Ground Truth Colors for Color Constancy Correction

no code implementations • ICCV 2015 • Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown

A limitation in color constancy research is the inability to establish ground truth colors for evaluating corrected images.

Color Constancy

Paper
Add Code

Sherlock: Scalable Fact Learning in Images

no code implementations • 16 Nov 2015 • Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, Ahmed Elgammal

We show that learning visual facts in a structured way enables not only a uniform but also generalizable visual understanding.

Multiview Learning Retrieval

Paper
Add Code

LCNN: Low-level Feature Embedded CNN for Salient Object Detection

no code implementations • 17 Aug 2015 • Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price

In this paper, we propose a novel deep neural network framework embedded with low-level features (LCNN) for salient object detection in complex images.

object-detection RGB Salient Object Detection +1

Paper
Add Code

PatchCut: Data-Driven Object Segmentation via Local Shape Transfer

no code implementations • CVPR 2015 • Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, Ming-Hsuan Yang

The transferred local shape masks constitute a patch-level segmentation solution space and we thus develop a novel cascade algorithm, PatchCut, for coarse-to-fine object segmentation.

Object Object Discovery +2

Paper
Add Code

Effective Learning-Based Illuminant Estimation Using Simple Features

no code implementations • CVPR 2015 • Dongliang Cheng, Brian Price, Scott Cohen, Michael S. Brown

More recent state-of-the-art methods employ learning-based techniques that produce better results, but often rely on complex features and have long evaluation and training times.

Color Constancy

Paper
Add Code

Towards Unified Depth and Semantic Prediction From a Single Image

no code implementations • CVPR 2015 • Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille

By allowing for interactions between the depth and semantic information, the joint network provides more accurate depth prediction than a state-of-the-art CNN trained solely for depth prediction [5].

Depth Estimation Depth Prediction +1

Paper
Add Code

Inner and Inter Label Propagation: Salient Object Detection in the Wild

2 code implementations • 27 May 2015 • Hongyang Li, Huchuan Lu, Zhe Lin, Xiaohui Shen, Brian Price

For most natural images, some boundary superpixels serve as the background labels and the saliency of other superpixels are determined by ranking their similarities to the boundary labels based on an inner propagation scheme.

Computational Efficiency object-detection +4

Paper
Code

Joint Object and Part Segmentation using Deep Learned Potentials

no code implementations • ICCV 2015 • Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan Yuille

Segmenting semantic objects from images and parsing them into their respective semantic parts are fundamental steps towards detailed object understanding in computer vision.

Object Segmentation +1

Paper
Add Code

Context Driven Scene Parsing with Attention to Rare Classes

no code implementations • CVPR 2014 • Jimei Yang, Brian Price, Scott Cohen, Ming-Hsuan Yang

This paper presents a scalable scene parsing algorithm based on image retrieval and superpixel matching.

General Classification Image Retrieval +2

Paper
Add Code

Semantic Object Selection

no code implementations • CVPR 2014 • Ejaz Ahmed, Scott Cohen, Brian Price

With the tag provided by the user we do a text query of an image database to gather exemplars of the object.

Image Retrieval Object +5

Paper
Add Code

Improving Image Matting Using Comprehensive Sampling Sets

no code implementations • CVPR 2013 • Ehsan Shahrian, Deepu Rajan, Brian Price, Scott Cohen

The first is that the range in which the foreground and background are sampled is often limited to such an extent that the true foreground and background colors are not present.

Image Matting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.