no code implementations • 30 Jan 2025 • Vitor Guizilini, Muhammad Zubair Irshad, Dian Chen, Greg Shakhnarovich, Rares Ambrus
Our method uses raymap conditioning to both augment visual features with spatial information from different viewpoints, as well as to guide the generation of images and depth maps from novel views.
no code implementations • 25 Nov 2024 • Shester Gueuwou, Xiaodan Du, Greg Shakhnarovich, Karen Livescu, Alexander H. Liu
Sign language processing has traditionally relied on task-specific models, limiting the potential for transfer learning across tasks.
no code implementations • 11 Jun 2024 • Shester Gueuwou, Xiaodan Du, Greg Shakhnarovich, Karen Livescu
A persistent challenge in sign language video processing, including the task of sign language to written language translation, is how we learn representations of sign language in an effective and efficient way that can preserve the important attributes of these languages, while remaining invariant to irrelevant visual differences.
no code implementations • CVPR 2024 • Joshua Ahn, Haochen Wang, Raymond A. Yeh, Greg Shakhnarovich
Scale-ambiguity in 3D scene dimensions leads to magnitude-ambiguity of volumetric densities in neural radiance fields, i. e., the densities double when scene size is halved, and vice versa.
no code implementations • 28 Nov 2023 • Xiaodan Du, Nicholas Kolkin, Greg Shakhnarovich, Anand Bhattad
Generative models excel at mimicking real scenes, suggesting they might inherently encode important intrinsic scene properties.
1 code implementation • 10 Nov 2023 • Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi
Text-to-3D with diffusion models has achieved remarkable progress in recent years.
no code implementations • 26 Oct 2023 • Sudarshan Babu, Richard Liu, Avery Zhou, Michael Maire, Greg Shakhnarovich, Rana Hanocka
We introduce HyperFields, a method for generating text-conditioned Neural Radiance Fields (NeRFs) with a single forward pass and (optionally) some fine-tuning.
no code implementations • 9 Dec 2022 • Nam Anh Dinh, Haochen Wang, Greg Shakhnarovich, Rana Hanocka
There is no settled universal 3D representation for geometry with many alternatives such as point clouds, meshes, implicit functions, and voxels to name a few.
1 code implementation • CVPR 2023 • Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich
We propose to apply chain rule on the learned gradients, and back-propagate the score of a diffusion model through the Jacobian of a differentiable renderer, which we instantiate to be a voxel radiance field.
Ranked #6 on
Text to 3D
on T$^3$Bench
1 code implementation • 8 Sep 2022 • Xiaodan Du, Raymond A. Yeh, Nicholas Kolkin, Eli Shechtman, Greg Shakhnarovich
We propose Fast text2StyleGAN, a natural language interface that adapts pre-trained GANs for text-guided human face synthesis.
1 code implementation • 28 Jul 2022 • Vitor Guizilini, Igor Vasiljevic, Jiading Fang, Rares Ambrus, Greg Shakhnarovich, Matthew Walter, Adrien Gaidon
Modern 3D computer vision leverages learning to boost geometric reasoning, mapping image data to classical structures such as cost volumes or epipolar constraints to improve matching.
1 code implementation • 25 May 2022 • Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu
Existing work on sign language translation - that is, translation from sign language videos into sentences in a written language - has focused mainly on (1) data collected in a controlled environment or (2) data in a specific domain, which limits the applicability to real-world settings.
Ranked #2 on
Gloss-free Sign Language Translation
on OpenASL
Gloss-free Sign Language Translation
Sign Language Translation
+1
1 code implementation • 7 Apr 2022 • Jiahao Li, Greg Shakhnarovich, Raymond A. Yeh
Our method for phrase localization requires no human annotations or additional training.
1 code implementation • 24 Mar 2022 • Nicholas Kolkin, Michal Kucera, Sylvain Paris, Daniel Sykora, Eli Shechtman, Greg Shakhnarovich
We propose Neural Neighbor Style Transfer (NNST), a pipeline that offers state-of-the-art quality, generalization, and competitive efficiency for artistic style transfer.
no code implementations • ACL 2022 • Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu
This is an important task since significant content in sign language is often conveyed via fingerspelling, and to our knowledge the task has not been studied before.
no code implementations • 11 Feb 2022 • Avrim Blum, Omar Montasser, Greg Shakhnarovich, Hongyang Zhang
We present an oracle-efficient algorithm for boosting the adversarial robustness of barely robust learners.
no code implementations • 6 Dec 2021 • Jiading Fang, Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon, Matthew R. Walter
Camera calibration is integral to robotics and computer vision algorithms that seek to infer geometric properties of the scene from visual input streams.
1 code implementation • CVPR 2021 • Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu
We propose a benchmark and a suite of evaluation metrics, some of which reflect the effect of detection on the downstream fingerspelling recognition task.
no code implementations • 31 Mar 2021 • Vitor Guizilini, Igor Vasiljevic, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon
In this work, we extend monocular self-supervised depth and ego-motion estimation to large-baseline multi-camera rigs.
1 code implementation • CVPR 2021 • Pedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David Mcallester
We study image segmentation from an information-theoretic perspective, proposing a novel adversarial method that performs unsupervised segmentation by partitioning images into maximally independent sets.
Ranked #1 on
Unsupervised Image Segmentation
on Flowers
1 code implementation • 15 Aug 2020 • Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Wolfram Burgard, Greg Shakhnarovich, Adrien Gaidon
Self-supervised learning has emerged as a powerful tool for depth and ego-motion estimation, leading to state-of-the-art results on benchmark datasets.
1 code implementation • 29 May 2020 • Ruotian Luo, Greg Shakhnarovich
We develop and evaluate captioning models that allow control of caption length.
no code implementations • CVPR 2020 • Haochen Wang, Ruotian Luo, Michael Maire, Greg Shakhnarovich
The core of our approach, Pixel Consensus Voting, is a framework for instance segmentation based on the Generalized Hough transform.
Ranked #36 on
Panoptic Segmentation
on COCO test-dev
1 code implementation • CVPR 2020 • Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
We consider the problem of space-time super-resolution (ST-SR): increasing spatial resolution of video frames and simultaneously interpolating frames to increase the frame rate.
no code implementations • 27 Mar 2020 • Davis Gilton, Ruotian Luo, Rebecca Willett, Greg Shakhnarovich
This paper presents a framework for the analysis of changes in visual streams: ordered sequences of images, possibly separated by significant time gaps.
2 code implementations • ICCV 2019 • Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu
In this paper we focus on recognition of fingerspelling sequences in American Sign Language (ASL) videos collected in the wild, mainly from YouTube and Deaf social media.
6 code implementations • CVPR 2019 • Nicholas Kolkin, Jason Salavon, Greg Shakhnarovich
Style transfer algorithms strive to render the content of one image using the style of another.
7 code implementations • 4 Apr 2019 • Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
Previous feed-forward architectures of recently proposed deep super-resolution networks learn the features of low-resolution inputs and the non-linear mapping from those to a high-resolution output.
Ranked #1 on
Image Super-Resolution
on BSDS100 - 8x upscaling
7 code implementations • CVPR 2019 • Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
We proposed a novel architecture for the problem of video super-resolution.
no code implementations • 26 Oct 2018 • Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Jonathan Michaux, Diane Brentari, Greg Shakhnarovich, Karen Livescu
As the first attempt at fingerspelling recognition in the wild, this work is intended to serve as a baseline for future work on sign language recognition in realistic conditions.
no code implementations • 30 Mar 2018 • Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
We consider how image super resolution (SR) can contribute to an object detection task in low-resolution images.
16 code implementations • CVPR 2018 • Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
The feed-forward architectures of recently proposed deep super-resolution networks learn representations of low-resolution inputs, and the non-linear mapping from those to high-resolution output.
Ranked #21 on
Video Super-Resolution
on Vid4 - 4x upscaling
no code implementations • ECCV 2018 • Huaizu Jiang, Erik Learned-Miller, Gustav Larsson, Michael Maire, Greg Shakhnarovich
As an agent moves through the world, the apparent motion of scene elements is (usually) inversely proportional to their depth.
no code implementations • NeurIPS 2014 • Shubhendu Trivedi, David Mcallester, Greg Shakhnarovich
We formulate the problem of metric learning for k nearest neighbor classification as a large margin structured prediction problem, with a latent variable representing the choice of neighbors and the task loss directly corresponding to classification error.