no code implementations • 14 Aug 2024 • Seung Hyun Lee, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang
The goal of image cropping is to identify visually appealing crops within an image.
1 code implementation • 16 Jun 2019 • Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan Essa
We propose 4 insights that help to significantly improve the performance of deep learning models that predict surface normals and semantic labels from a single RGB image.
Ranked #1 on
Semantic Segmentation
on ScanNetV2
(Pixel Accuracy metric)
1 code implementation • CVPR 2014 • Steven Hickson, Stan Birchfield, Irfan Essa, Henrik Christensen
We present an efficient and scalable algorithm for segmenting 3D RGBD point clouds by combining depth, color, and temporal information using a multistage, hierarchical graph-based approach.
1 code implementation • 26 Jan 2018 • Steven Hickson, Anelia Angelova, Irfan Essa, Rahul Sukthankar
We consider the problem of retrieving objects from image data and learning to classify them into meaningful semantic categories with minimal supervision.
1 code implementation • 23 Jan 2018 • Daniel Castro, Steven Hickson, Patsorn Sangkloy, Bhavishya Mittal, Sean Dai, James Hays, Irfan Essa
We present a comparison of numerous state-of-the-art techniques on our dataset using three different representations (video, optical flow and multi-person pose data) in order to analyze these approaches.
1 code implementation • 2 Aug 2017 • Steven Hickson, Irfan Essa, Henrik Christensen
Most of the approaches for indoor RGBD semantic la- beling focus on using pixels or superpixels to train a classi- fier.
no code implementations • 2 Aug 2017 • Bryan Willimon, Steven Hickson, Ian Walker, Stan Birchfield
In particular, we show that our method is able to estimate the configuration of a textureless nonrigid object with no correspondences available.
no code implementations • 22 Jul 2017 • Steven Hickson, Nick Dufour, Avneesh Sud, Vivek Kwatra, Irfan Essa
One of the main challenges of social interaction in virtual reality settings is that head-mounted displays occlude a large portion of the face, blocking facial expressions and thereby restricting social engagement cues among users.
no code implementations • 6 Oct 2015 • Daniel Castro, Steven Hickson, Vinay Bettadapura, Edison Thomaz, Gregory Abowd, Henrik Christensen, Irfan Essa
We collected a dataset of 40, 103 egocentric images over a 6 month period with 19 activity classes and demonstrate the benefit of state-of-the-art deep learning techniques for learning and predicting daily activities.