no code implementations • 17 Apr 2025 • Hongyu Li, James Akl, Srinath Sridhar, Tye Brady, Taskin Padir
Object 6D pose estimation is a critical challenge in robotics, particularly for manipulation tasks.
no code implementations • 14 Apr 2025 • Xiaoyan Cong, Jiayi Shen, Zekun Li, Rao Fu, Tao Lu, Srinath Sridhar
Large-scale pre-trained image-to-3D generative models have exhibited remarkable capabilities in diverse shape generations.
no code implementations • 10 Apr 2025 • Kefan Chen, Sergiu Oprea, Justin Theiss, Sreyas Mohan, Srinath Sridhar, Aayush Prakash
With the rising interest from the community in digital avatars coupled with the importance of expressions and gestures in communication, modeling natural avatar behavior remains an important challenge across many industries such as teleconferencing, gaming, and AR/VR.
no code implementations • 24 Feb 2025 • Hongyu Li, Mingxi Jia, Tuluhan Akbulut, Yu Xiang, George Konidaris, Srinath Sridhar
Building on this representation, we introduce a new visuo-haptic transformer-based object pose tracker that seamlessly integrates visual and haptic input.
no code implementations • 3 Feb 2025 • Aashish Rai, Dilin Wang, Mihir Jain, Nikolaos Sarafianos, Arthur Chen, Srinath Sridhar, Aayush Prakash
The compressed UVGS can be treated as typical RGB images.
no code implementations • 18 Dec 2024 • Tao Lu, Ankit Dhiman, R Srinath, Emre Arslan, Angela Xing, Yuanbo Xiangli, R Venkatesh Babu, Srinath Sridhar
In contrast, our goal is to reduce the optimization time by training for fewer steps while maintaining high rendering quality.
no code implementations • 5 Dec 2024 • Rao Fu, Dingxi Zhang, Alex Jiang, Wanjia Fu, Austin Funk, Daniel Ritchie, Srinath Sridhar
Understanding bimanual human hand activities is a critical problem in AI and robotics.
no code implementations • 3 Dec 2024 • Kefan Chen, Chaerin Min, Linguang Zhang, Shreyas Hampali, Cem Keskin, Srinath Sridhar
We present FoundHand, a large-scale domain-specific diffusion model for synthesizing single and dual hand images.
no code implementations • 5 Nov 2024 • Arnab Dey, Cheng-You Lu, Andrew I. Comport, Srinath Sridhar, Chin-Teng Lin, Jean Martinet
Recent advancements in radiance field rendering show promising results in 3D scene representation, where Gaussian splatting-based techniques emerge as state-of-the-art due to their quality and efficiency.
no code implementations • 19 Oct 2024 • Xianghao Xu, Srinath Sridhar, Daniel Ritchie
We propose a zero-shot text-driven 3D shape deformation system that deforms an input 3D mesh of a manufactured object to fit an input text description.
no code implementations • 30 Jul 2024 • Aashish Rai, Srinath Sridhar
We introduce EgoSonics, a method to generate semantically meaningful and synchronized audio tracks conditioned on silent egocentric videos.
no code implementations • 7 Jun 2024 • Chaerin Min, Srinath Sridhar
We propose GenHeld to address the inverse problem of synthesizing held objects conditioned on 3D hand model or 2D image.
no code implementations • 22 Apr 2024 • Rahul Sajnani, Jeroen Vanbaar, Jie Min, Kapil Katyal, Srinath Sridhar
We present GeoDiffuser, a zero-shot optimization-based method that unifies common 2D and 3D image-based object editing capabilities into a single method.
no code implementations • 9 Apr 2024 • Arnab Dey, Di Yang, Rohith Agaram, Antitza Dantcheva, Andrew I. Comport, Srinath Sridhar, Jean Martinet
In this paper, we introduce a novel approach, termed GHNeRF, designed to address these limitations by learning 2D/3D joint locations of human subjects with NeRF representation.
no code implementations • 6 Apr 2024 • Gaurav Singh, Sanket Kalwar, Md Faizal Karim, Bipasha Sen, Nagamanikandan Govindan, Srinath Sridhar, K Madhava Krishna
Efficiently generating grasp poses tailored to specific regions of an object is vital for various robotic manipulation tasks, especially in a dual-arm setup.
no code implementations • 11 Dec 2023 • Rao Fu, Zehao Wen, Zichen Liu, Srinath Sridhar
Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text into well-structured and textured indoor scenes at a house-scale.
no code implementations • CVPR 2024 • Chandradeep Pokhariya, Ishaan N Shah, Angela Xing, Zekun Li, Kefan Chen, Avinash Sharma, Srinath Sridhar
Since our representation uses Gaussian primitives, it enables us to efficiently and accurately estimate contacts between the hand and the object.
no code implementations • ICCV 2023 • Ankit Dhiman, Srinath R, Harsh Rangwani, Rishubh Parihar, Lokesh R Boregowda, Srinath Sridhar, R Venkatesh Babu
We propose Strata-NeRF, a single neural radiance field that implicitly captures a scene with multiple levels.
no code implementations • CVPR 2024 • Cheng-You Lu, Peisen Zhou, Angela Xing, Chandradeep Pokhariya, Arnab Dey, Ishaan Shah, Rugved Mavidipalli, Dylan Hu, Andrew Comport, Kefan Chen, Srinath Sridhar
Advances in neural fields are enabling high-fidelity capture of the shape and appearance of dynamic 3D scenes.
no code implementations • NeurIPS 2023 • Bipasha Sen, Gaurav Singh, Aditya Agarwal, Rohith Agaram, K Madhava Krishna, Srinath Sridhar
Neural Radiance Fields (NeRF) have become an increasingly popular representation to capture high-quality appearance and shape of scenes and objects.
no code implementations • ICCV 2023 • Yiqing Liang, Eliot Laidlaw, Alexander Meyerowitz, Srinath Sridhar, James Tompkin
From video, we reconstruct a neural volume that captures time-varying color, density, scene flow, semantics, and attention information.
no code implementations • CVPR 2023 • Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas Guibas
Humans universally dislike the task of cleaning up a messy room.
no code implementations • 17 Jan 2023 • Bipasha Sen, Aditya Agarwal, Gaurav Singh, Brojeshwar B., Srinath Sridhar, Madhava Krishna
Unlike existing methods that depend on an external canonicalization, SCARP performs canonicalization, pose estimation, and shape completion in a single network, improving the performance by 45% over the existing baselines.
1 code implementation • CVPR 2023 • Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar
We present Canonical Field Network (CaFi-Net), a self-supervised method to canonicalize the 3D pose of instances from an object category represented as neural fields, specifically neural radiance fields (NeRFs).
no code implementations • CVPR 2023 • Aditya Sanghi, Rao Fu, Vivian Liu, Karl Willis, Hooman Shayani, Amir Hosein Khasahmadi, Srinath Sridhar, Daniel Ritchie
Recent works have demonstrated that natural language can be used to generate and edit 3D shapes.
1 code implementation • 19 Jul 2022 • Rao Fu, Xiao Zhan, YiWen Chen, Daniel Ritchie, Srinath Sridhar
Results show that our method can generate shapes consistent with text descriptions, and shapes evolve gradually as more phrases are added.
1 code implementation • 17 Jun 2022 • Xianghao Xu, Yifan Ruan, Srinath Sridhar, Daniel Ritchie
We operationalize this concept with an algorithm that optimizes a shape's part motion parameters such that it can transform into other shapes in the collection.
no code implementations • 12 Jun 2022 • Trevor Houchens, Cheng-You Lu, Shivam Duggal, Rao Fu, Srinath Sridhar
We propose Omnidirectional Distance Fields (ODFs), a new 3D shape representation that encodes geometry by storing the depth to the object's surface from any 3D position in any viewing direction.
1 code implementation • CVPR 2022 • Rahul Sajnani, Adrien Poulenard, Jivitesh Jain, Radhika Dua, Leonidas J. Guibas, Srinath Sridhar
ConDor is a self-supervised method that learns to Canonicalize the 3D orientation and position for full and partial 3D point clouds.
no code implementations • 13 Dec 2021 • Bryce Blinn, Alexander Ding, R. Kenny Jones, Manolis Savva, Srinath Sridhar, Daniel Ritchie
The body-shape-conditioned models produce chairs which will be comfortable for a person with the given body shape; the pose-conditioned models produce chairs which accommodate the given sitting pose.
1 code implementation • 22 Nov 2021 • Yiheng Xie, Towaki Takikawa, Shunsuke Saito, Or Litany, Shiqin Yan, Numair Khan, Federico Tombari, James Tompkin, Vincent Sitzmann, Srinath Sridhar
Recent advances in machine learning have created increasing interest in solving visual computing problems using a class of coordinate-based neural networks that parametrize physical properties of scenes or objects across space and time.
no code implementations • 17 May 2021 • Ge Zhang, Or Litany, Srinath Sridhar, Leonidas Guibas
We present StrobeNet, a method for category-level 3D reconstruction of articulating objects from one or more unposed RGB images.
1 code implementation • ICCV 2021 • Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas J. Guibas
We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape.
1 code implementation • 25 Nov 2020 • Rahul Sajnani, AadilMehdi Sanchawala, Krishna Murthy Jatavallabhula, Srinath Sridhar, K. Madhava Krishna
We present DRACO, a method for Dense Reconstruction And Canonicalization of Object shape from one or more RGB images.
1 code implementation • ECCV 2020 • Jiahui Lei, Srinath Sridhar, Paul Guerrero, Minhyuk Sung, Niloy Mitra, Leonidas J. Guibas
We investigate the problem of learning to generate 3D parametric surface representations for novel object instances, as seen from one or more views.
1 code implementation • NeurIPS 2020 • Davis Rempe, Tolga Birdal, Yongheng Zhao, Zan Gojcic, Srinath Sridhar, Leonidas J. Guibas
We propose CaSPR, a method to learn object-centric Canonical Spatiotemporal Point Cloud Representations of dynamically moving or evolving objects.
no code implementations • 26 Feb 2020 • Or Litany, Ari Morcos, Srinath Sridhar, Leonidas Guibas, Judy Hoffman
We seek to learn a representation on a large annotated data source that generalizes to a target domain using limited new supervision.
no code implementations • 6 Feb 2020 • Zhangsihao Yang, Or Litany, Tolga Birdal, Srinath Sridhar, Leonidas Guibas
In this work, we wish to challenge this practice and use a neural network to learn descriptors directly from the raw mesh.
1 code implementation • 16 Jan 2020 • Davis Rempe, Srinath Sridhar, He Wang, Leonidas J. Guibas
Experiments show that we can accurately predict the changes in state for unseen object geometries and initial conditions.
1 code implementation • NeurIPS 2019 • Srinath Sridhar, Davis Rempe, Julien Valentin, Sofien Bouaziz, Leonidas J. Guibas
We investigate the problem of learning category-specific 3D shape reconstruction from a variable number of RGB views of previously unobserved object instances.
9 code implementations • CVPR 2019 • He Wang, Srinath Sridhar, Jingwei Huang, Julien Valentin, Shuran Song, Leonidas J. Guibas
The goal of this paper is to estimate the 6D pose and dimensions of unseen object instances in an RGB-D image.
Ranked #2 on
6D Pose Estimation using RGBD
on CAMERA25
no code implementations • 2 Jan 2019 • Davis Rempe, Srinath Sridhar, He Wang, Leonidas J. Guibas
In this work, we focus on predicting the dynamics of 3D rigid objects, in particular an object's final resting position and total rotation when subjected to an impulsive force.
5 code implementations • 9 Dec 2017 • Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, Christian Theobalt
Our approach uses novel occlusion-robust pose-maps (ORPM) which enable full body pose inference even under strong partial occlusions by other people and objects in the scene.
Ranked #3 on
3D Multi-Person Pose Estimation (root-relative)
on MuPoTS-3D
(MPJPE metric)
3D Human Pose Estimation
3D Multi-Person Pose Estimation (absolute)
+2
no code implementations • CVPR 2018 • Franziska Mueller, Florian Bernard, Oleksandr Sotnychenko, Dushyant Mehta, Srinath Sridhar, Dan Casas, Christian Theobalt
We address the highly challenging problem of real-time 3D hand tracking based on a monocular RGB-only sequence.
1 code implementation • 3 May 2017 • Dushyant Mehta, Srinath Sridhar, Oleksandr Sotnychenko, Helge Rhodin, Mohammad Shafiei, Hans-Peter Seidel, Weipeng Xu, Dan Casas, Christian Theobalt
A real-time kinematic skeleton fitting method uses the CNN output to yield temporally stable 3D global pose reconstructions on the basis of a coherent kinematic skeleton.
Ranked #16 on
Pose Estimation
on Leeds Sports Poses
no code implementations • ICCV 2017 • Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt
We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments.
no code implementations • 16 Oct 2016 • Srinath Sridhar, Franziska Mueller, Michael Zollhöfer, Dan Casas, Antti Oulasvirta, Christian Theobalt
However, due to difficult occlusions, fast motions, and uniform hand appearance, jointly tracking hand and object pose is more challenging than tracking either of the two separately.
no code implementations • CVPR 2015 • Srinath Sridhar, Franziska Mueller, Antti Oulasvirta, Christian Theobalt
In the optimization step, a novel objective function combines the detected part labels and a Gaussian mixture representation of the depth to estimate a pose that best fits the depth.
no code implementations • 11 Feb 2016 • Srinath Sridhar, Helge Rhodin, Hans-Peter Seidel, Antti Oulasvirta, Christian Theobalt
In this paper, we propose a new approach that tracks the full skeleton motion of the hand from multiple RGB cameras in real-time.