no code implementations • 12 Dec 2024 • Carlos Esteves, Mohammed Suhail, Ameesh Makadia
Image tokenizers map images to sequences of discrete tokens, and are a crucial component of autoregressive transformer-based image generation.
no code implementations • 5 Dec 2024 • Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia
Latent variable generative models have emerged as powerful tools for generative tasks including image and video synthesis.
no code implementations • CVPR 2024 • Thomas W. Mitchel, Carlos Esteves, Ameesh Makadia
We introduce a framework for intrinsic latent diffusion models operating directly on the surfaces of 3D shapes, with the goal of synthesizing high-quality textures.
1 code implementation • ICCV 2023 • Utkarsh Singhal, Carlos Esteves, Ameesh Makadia, Stella X. Yu
However, too much or too little invariance can hurt, and the correct amount is unknown a priori and dependent on the instance.
1 code implementation • 8 Jun 2023 • Carlos Esteves, Jean-Jacques Slotine, Ameesh Makadia
Spherical CNNs generalize CNNs to functions on the sphere, by using spherical convolutions as the main linear operation.
no code implementations • ICCV 2023 • Zezhou Cheng, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia
Consequently, there is growing interest in extending NeRF models to jointly optimize camera poses and scene representation, which offers an alternative to off-the-shelf SfM pipelines which have well-understood failure modes.
no code implementations • ICCV 2023 • Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar
We present a self-supervised technique that directly optimizes on a sparse collection of images of a particular object/object category to obtain consistent dense correspondences across the collection.
no code implementations • 18 Aug 2022 • Richard Li, Carlos Esteves, Ameesh Makadia, Pulkit Agrawal
We present a system for accurately predicting stable orientations for diverse rigid objects.
no code implementations • 21 Jul 2022 • Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia
Neural rendering has received tremendous attention since the advent of Neural Radiance Fields (NeRF), and has pushed the state-of-the-art on novel-view synthesis considerably.
1 code implementation • CVPR 2022 • Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia
Classical light field rendering for novel view synthesis can accurately reproduce view-dependent effects such as reflection, refraction, and translucency, but requires a dense view sampling of the scene.
no code implementations • 29 Sep 2021 • Carlos Esteves, Tianjian Lu, Mohammed Suhail, Yi-fan Chen, Ameesh Makadia
In this work, we generalize positional encoding with Fourier features to non-Euclidean manifolds.
no code implementations • 29 Sep 2021 • Diego Patino, Carlos Esteves, Kostas Daniilidis
In this paper we propose a deep learning method for unsupervised 3D implicit shape reconstruction from point clouds.
2 code implementations • 10 Jun 2021 • Kieran Murphy, Carlos Esteves, Varun Jampani, Srikumar Ramalingam, Ameesh Makadia
Single image pose estimation is a fundamental problem in many vision and robotics tasks, and existing deep learning approaches suffer by not completely modeling and handling: i) uncertainty about the predictions, and ii) symmetric objects with multiple (sometimes infinite) correct poses.
2 code implementations • 4 Dec 2020 • Carlos Esteves
In this thesis, we extend equivariance to other kinds of transformations, such as rotation and scaling.
2 code implementations • NeurIPS 2020 • Jake Levinson, Carlos Esteves, Kefan Chen, Noah Snavely, Angjoo Kanazawa, Afshin Rostamizadeh, Ameesh Makadia
Symmetric orthogonalization via SVD, and closely related procedures, are well-known techniques for projecting matrices onto $O(n)$ or $SO(n)$.
2 code implementations • NeurIPS 2020 • Carlos Esteves, Ameesh Makadia, Kostas Daniilidis
In this paper, we present a new type of spherical CNN that allows anisotropic filters in an efficient way, without ever leaving the spherical domain.
Ranked #20 on Semantic Segmentation on Stanford2D3D Panoramic
no code implementations • 10 Apr 2020 • Carlos Esteves
The second, by Cohen et al. (NeurIPS'19), generalizes the first to a larger class of networks, with feature maps as fields on homogeneous spaces.
1 code implementation • ICCV 2019 • Carlos Esteves, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis
Several popular approaches to 3D vision tasks process multiple views of the input independently with deep neural networks pre-trained on natural images, achieving view permutation invariance through a single round of pooling over all views.
1 code implementation • 6 Dec 2018 • Carlos Esteves, Avneesh Sud, Zhengyi Luo, Kostas Daniilidis, Ameesh Makadia
This embedding encodes images with 3D shape properties and is equivariant to 3D rotations of the observed object.
no code implementations • 6 Sep 2018 • Carlos Esteves, Kostas Daniilidis, Ameesh Makadia
With the recent proliferation of consumer-grade 360{\deg} cameras, it is worth revisiting visual perception challenges with spherical cameras given the potential benefit of their global field of view.
3 code implementations • ECCV 2018 • Carlos Esteves, Christine Allen-Blanchette, Ameesh Makadia, Kostas Daniilidis
We address the problem of 3D rotation equivariance in convolutional neural networks.
no code implementations • ICCV 2017 • Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis
We consider the problem of finding consistent matches across multiple images.
1 code implementation • ICLR 2018 • Carlos Esteves, Christine Allen-Blanchette, Xiaowei Zhou, Kostas Daniilidis
The result is a network invariant to translation and equivariant to both rotation and scale.