no code implementations • CVPR 2024 • Xiaojuan Wang, Janne Kontkanen, Brian Curless, Steve Seitz, Ira Kemelmacher, Ben Mildenhall, Pratul Srinivasan, Dor Verbin, Aleksander Holynski
We present a method that uses a text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene, e. g., ranging from a wide-angle landscape view of a forest to a macro shot of an insect sitting on one of the tree branches.
no code implementations • 12 Oct 2023 • Mengyi Shan, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz
We present a system that automatically brings street view imagery to life by populating it with naturally behaving, animated pedestrians and vehicles.
no code implementations • ICCV 2021 • Soumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz
Whereas existing light stages require expensive, room-scale spherical capture gantries and exist in only a few labs in the world, we demonstrate how to acquire useful data from a normal TV or desktop monitor.
2 code implementations • CVPR 2021 • Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian Curless, Steve Seitz, Ira Kemelmacher-Shlizerman
We introduce a real-time, high-resolution background replacement technique which operates at 30fps in 4K resolution, and 60fps for HD on a modern GPU.
1 code implementation • NeurIPS 2020 • Teerapat Jenrungrot, Vivek Jayaram, Steve Seitz, Ira Kemelmacher-Shlizerman
Given a multi-microphone recording of an unknown number of speakers talking concurrently, we simultaneously localize the sources and separate the individual speakers.
2 code implementations • ECCV 2020 • Luyang Zhu, Konstantinos Rematas, Brian Curless, Steve Seitz, Ira Kemelmacher-Shlizerman
Based on these models, we introduce a new method that takes as input a single photo of a clothed player in any basketball pose and outputs a high resolution mesh and 3D pose for that player.
no code implementations • ECCV 2020 • Yifan Wang, Brian Curless, Steve Seitz
By analyzing the motion of people and other objects in a scene, we demonstrate how to infer depth, occlusion, lighting, and shadow information from video taken from a single camera viewpoint.
1 code implementation • CVPR 2020 • Soumyadip Sengupta, Vivek Jayaram, Brian Curless, Steve Seitz, Ira Kemelmacher-Shlizerman
To bridge the domain gap to real imagery with no labeling, we train another matting network guided by the first network and by a discriminator that judges the quality of composites.
Ranked #1 on Image Matting on Adobe Matting
no code implementations • CVPR 2020 • Jeong Joon Park, Aleksander Holynski, Steve Seitz
We address the dual problems of novel view synthesis and environment reconstruction from hand-held RGBD sensors.
no code implementations • 21 Aug 2019 • Xuan Luo, Yanmeng Kong, Jason Lawrence, Ricardo Martin-Brualla, Steve Seitz
This paper introduces the largest and most diverse collection of rectified stereo image pairs to the research community, KeystoneDepth, consisting of tens of thousands of stereographs of historical people, events, objects, and scenes between 1860 and 1963.
no code implementations • 12 Nov 2018 • Ricardo Martin-Brualla, Rohit Pandey, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Julien Valentin, Sameh Khamis, Philip Davidson, Anastasia Tkach, Peter Lincoln, Adarsh Kowdle, Christoph Rhemann, Dan B. Goldman, Cem Keskin, Steve Seitz, Shahram Izadi, Sean Fanello
We take the novel approach to augment such real-time performance capture systems with a deep architecture that takes a rendering from an arbitrary viewpoint, and jointly performs completion, super resolution, and denoising of the imagery in real-time.
no code implementations • 6 Sep 2018 • Jeong Joon Park, Richard Newcombe, Steve Seitz
We present an approach for interactively scanning highly reflective objects with a commodity RGBD sensor.
no code implementations • CVPR 2018 • Konstantinos Rematas, Ira Kemelmacher-Shlizerman, Brian Curless, Steve Seitz
We present a system that transforms a monocular video of a soccer game into a moving 3D reconstruction, in which the players and field can be rendered interactively with a 3D viewer or through an Augmented Reality device.
no code implementations • CVPR 2016 • Ira Kemelmacher-Shlizerman, Steve Seitz, Daniel Miller, Evan Brossard
Our key observations are that testing at the million scale reveals big performance differences (of algorithms that perform similarly well on smaller scale) and that age invariant recognition as well as pose are still challenging for most.
no code implementations • 2 Jun 2015 • Supasorn Suwajanakorn, Ira Kemelmacher-Shlizerman, Steve Seitz
We reconstruct a controllable model of a person from a large photo collection that captures his or her {\em persona}, i. e., physical appearance and behavior.