1 code implementation • ICCV 2023 • Dongxu Zhao, Daniel Lichy, Pierre-Nicolas Perrin, Jan-Michael Frahm, Soumyadip Sengupta
We propose a fast and generalizable solution to Multi-view Photometric Stereo (MVPS), called MVPSNet.
no code implementations • 18 Mar 2023 • Thanh Vu, Baochen Sun, Bodi Yuan, Alex Ngai, Yueqi Li, Jan-Michael Frahm
The success of data mixing augmentations in image classification tasks has been well-received.
no code implementations • 13 Mar 2023 • Shuxian Wang, Yubo Zhang, Sarah K. McGill, Julian G. Rosenman, Jan-Michael Frahm, Soumyadip Sengupta, Stephen M. Pizer
Reconstructing a 3D surface from colonoscopy video is challenging due to illumination and reflectivity variation in the video frame that can cause defective shape predictions.
no code implementations • CVPR 2023 • Jialiang Wang, Daniel Scharstein, Akash Bapat, Kevin Blackburn-Matzen, Matthew Yu, Jonathan Lehman, Suhib Alsisan, Yanghan Wang, Sam Tsai, Jan-Michael Frahm, Zijian He, Peter Vajda, Michael F. Cohen, Matt Uyttendaele
We present the design of a productionized end-to-end stereo depth sensing system that does pre-processing, online stereo rectification, and stereo depth estimation with a fallback to monocular depth estimation when rectification is unreliable.
no code implementations • 4 Oct 2022 • Thanh Vu, Yanqi Zhou, Chunfeng Wen, Yueqi Li, Jan-Michael Frahm
Our first key insight is that MultiTask Learning (MTL) and hardware-aware Neural Architecture Search (NAS) can work in synergy to greatly benefit on-device Dense Predictions (DP).
Ranked #111 on Semantic Segmentation on NYU Depth v2
Hardware Aware Neural Architecture Search Neural Architecture Search +2
1 code implementation • 5 Apr 2022 • John Lim, Jan-Michael Frahm, Fabian Monrose
We evaluate our method on real-life data using a variety of metrics to quantify the amount of information an attacker is able to recover.
no code implementations • 14 Mar 2022 • Jisan Mahmud, Jan-Michael Frahm
VPFusion attains high-quality reconstruction using both - 3D feature volume to capture 3D-structure-aware context, and pixel-aligned image features to capture fine local detail.
no code implementations • 19 Nov 2021 • Yubo Zhang, Jan-Michael Frahm, Samuel Ehrenstein, Sarah K. McGill, Julian G. Rosenman, Shuxian Wang, Stephen M. Pizer
Aiming to fundamentally improve the depth estimation quality for colonoscopy 3D reconstruction, in this work we have designed a set of training losses to deal with the special challenges of colonoscopy data.
no code implementations • 1 Jan 2021 • John Lim, Fabian Monrose, Jan-Michael Frahm
We evaluate our method on real-life data using a variety of metrics to quantify the amount of information an attacker is able to recover.
1 code implementation • 6 Dec 2020 • Thanh Vu, Marc Eder, True Price, Jan-Michael Frahm
To address these constraints, we propose the Any-Width Network (AWN), an adjustable-width CNN architecture and associated training routine that allow for fine-grained control over speed and accuracy during inference.
1 code implementation • 12 Sep 2020 • John Lim, True Price, Fabian Monrose, Jan-Michael Frahm
This indicates that these models are able to learn rich, meaningful representations from our synthetic data and that training on the synthetic data can help overcome the issue of having small, real-life datasets for vision-based key stroke inference attacks.
1 code implementation • 27 Aug 2020 • Johannes Kopf, Kevin Matzen, Suhib Alsisan, Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, Peizhao Zhang, Zijian He, Peter Vajda, Ayush Saraf, Michael Cohen
3D photos are static in time, like traditional photos, but are displayed with interactive parallax on mobile or desktop screens, as well as on Virtual Reality devices, where viewing it also includes stereo.
no code implementations • 27 Aug 2020 • Aleksander Holynski, David Geraghty, Jan-Michael Frahm, Chris Sweeney, Richard Szeliski
Low-frequency long-range errors (drift) are an endemic problem in 3D structure from motion, and can often hamper reasonable reconstructions of the scene.
1 code implementation • CVPR 2020 • Marc Eder, Mykhailo Shvets, John Lim, Jan-Michael Frahm
In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision.
1 code implementation • 22 Nov 2019 • Jisan Mahmud, Rajat Vikram Singh, Peri Akiva, Spondon Kundu, Kuan-Chuan Peng, Jan-Michael Frahm
By learning view synthesis, we explicitly encourage the feature extractor to encode information about not only the visible, but also the occluded parts of the scene.
1 code implementation • 26 Jun 2019 • Marc Eder, True Price, Thanh Vu, Akash Bapat, Jan-Michael Frahm
We present a versatile formulation of the convolution operation that we term a "mapped convolution."
no code implementations • 21 May 2019 • Marc Eder, Jan-Michael Frahm
Applying convolutional neural networks to spherical images requires particular considerations.
1 code implementation • 15 Apr 2019 • Rui Wang, Stephen M. Pizer, Jan-Michael Frahm
Deep learning-based, single-view depth estimation methods have recently shown highly promising results.
1 code implementation • SIGGRAPH Asia 2018 2018 • Peter Hedman, Julien Philip, True Price, Jan-Michael Frahm, George Drettakis, Gabriel Brostow
We present a new deep learning approach to blending for IBR, in which we use held-out real image data to learn blending weights to combine input photo contributions.
no code implementations • CVPR 2018 • True Price, Johannes L. Schönberger, Zhen Wei, Marc Pollefeys, Jan-Michael Frahm
Image-based 3D reconstruction for Internet photo collections has become a robust technology to produce impressive virtual representations of real-world scenes.
no code implementations • CVPR 2018 • Akash Bapat, True Price, Jan-Michael Frahm
In this paper, we introduce a novel multi-camera tracking approach that for the first time jointly leverages the information introduced by rolling shutter and radial distortion as a feature to achieve superior performance with respect to high-frequency camera pose estimation.
no code implementations • 17 May 2018 • Rui Wang, Jan-Michael Frahm, Stephen M. Pizer
Our method produces superior results to the state-of-the-art learning-based, single- or two-view depth estimation methods on both indoor and outdoor benchmark datasets.
no code implementations • CVPR 2019 • Akash Bapat, Jan-Michael Frahm
We present a framework for edge-aware optimization that is an order of magnitude faster than the state of the art while having comparable performance.
no code implementations • CVPR 2017 • Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm
We address the problem of large scale image geo-localization where the location of an image is estimated by identifying geo-tagged reference images depicting the same place.
no code implementations • CVPR 2017 • Tobias Palmer, Kalle Astrom, Jan-Michael Frahm
There is a significant interest in scene reconstruction from underwater images given its utility for oceanic research and for recreational image manipulation.
1 code implementation • CVPR 2016 • Johannes L. Schonberger, Jan-Michael Frahm
Incremental Structure-from-Motion is a prevalent strategy for 3D reconstruction from unordered image collections.
Ranked #13 on Point Clouds on Tanks and Temples
no code implementations • CVPR 2016 • Filip Radenovic, Johannes L. Schonberger, Dinghuang Ji, Jan-Michael Frahm, Ondrej Chum, Jiri Matas
We present an algorithm that leverages the appearance variety to obtain more complete and accurate scene geometry along with consistent multi-illumination appearance information.
no code implementations • 22 May 2016 • Enliang Zheng, Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm
Given the smooth motion of dynamic objects, we observe any element in the dictionary can be well approximated by a sparse linear combination of other elements in the same dictionary (i. e. self-expression).
no code implementations • ICCV 2015 • Enliang Zheng, Ke Wang, Enrique Dunn, Jan-Michael Frahm
We propose two novel minimal solvers which advance the state of the art in satellite imagery processing.
no code implementations • ICCV 2015 • Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm
We address the problem of recognizing a place depicted in a query image by using a large database of geo-tagged images at a city-scale.
no code implementations • ICCV 2015 • Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm
We propose a framework for the automatic creation of time-lapse mosaics of a given scene.
no code implementations • ICCV 2015 • Enliang Zheng, Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm
We target the sparse 3D reconstruction of dynamic objects observed by multiple unsynchronized video cameras with unknown temporal overlap.
no code implementations • CVPR 2015 • Johannes L. Schonberger, Filip Radenovic, Ondrej Chum, Jan-Michael Frahm
Structure-from-Motion for unordered image collections has significantly advanced in scale over the last decade.
no code implementations • CVPR 2015 • Jared Heinly, Johannes L. Schonberger, Enrique Dunn, Jan-Michael Frahm
We propose a novel, large-scale, structure-from-motion framework that advances the state of the art in data scalability from city-scale modeling (millions of images) to world-scale modeling (several tens of millions of images) using just a single computer.
no code implementations • CVPR 2015 • David Perra, Rohit Kumar Gupta, Jan-Michael Frahm
Our calibration scheme allows a head-worn device to calculate a locally optimal eye-device transformation on demand by computing an optimal model from a local window of previous frames.
no code implementations • CVPR 2015 • Johannes L. Schonberger, Alexander C. Berg, Jan-Michael Frahm
Based on the insights of this evaluation, we propose a learning-based approach, the PAirwise Image Geometry Encoding (PAIGE), to efficiently identify image pairs with scene overlap without the need to perform exhaustive putative matching and geometric verification.
no code implementations • CVPR 2014 • Enliang Zheng, Enrique Dunn, Vladimir Jojic, Jan-Michael Frahm
We propose a multi-view depthmap estimation approach aimed at adaptively ascertaining the pixel level data associations between a reference image and all the elements of a source image set.
no code implementations • CVPR 2014 • Yilin Wang, Ke Wang, Enrique Dunn, Jan-Michael Frahm
We develop a sequential optimal sampling framework for stereo disparity estimation by adapting the Sequential Probability Ratio Test (SPRT) model.