Search Results for author: Rawal Khirodkar

Found 13 papers, 4 papers with code

Generalizable Neural Human Renderer

no code implementations22 Apr 2024 Mana Masuda, Jinhyung Park, Shun Iwase, Rawal Khirodkar, Kris Kitani

While recent advancements in animatable human rendering have achieved remarkable results, they require test-time optimization for each subject which can be a significant limitation for real-world applications.

Real-Time Simulated Avatar from Head-Mounted Sensors

no code implementations11 Mar 2024 Zhengyi Luo, Jinkun Cao, Rawal Khirodkar, Alexander Winkler, Kris Kitani, Weipeng Xu

We present SimXR, a method for controlling a simulated avatar from information (headset pose and cameras) obtained from AR / VR headsets.

Egocentric Pose Estimation Humanoid Control +1

Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras

no code implementations28 Jan 2024 Yu-Jhe Li, Yan Xu, Rawal Khirodkar, Jinhyung Park, Kris Kitani

In order to evaluate our proposed pipeline, we collect three video sets of RGBD videos recorded from multiple sparse-view depth cameras and ground truth 3D poses are manually annotated.

3D Human Pose Estimation 3D Pose Estimation +2

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

no code implementations30 Nov 2023 Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei HUANG, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge.

Video Understanding

EgoHumans: An Egocentric 3D Multi-Human Benchmark

no code implementations25 May 2023 Rawal Khirodkar, Aayush Bansal, Lingni Ma, Richard Newcombe, Minh Vo, Kris Kitani

We present EgoHumans, a new multi-view multi-human video benchmark to advance the state-of-the-art of egocentric human 3D pose estimation and tracking.

3D Pose Estimation Human Detection

Ego-Humans: An Ego-Centric 3D Multi-Human Benchmark

no code implementations ICCV 2023 Rawal Khirodkar, Aayush Bansal, Lingni Ma, Richard Newcombe, Minh Vo, Kris Kitani

We present EgoHumans, a new multi-view multi-human video benchmark to advance the state-of-the-art of egocentric human 3D pose estimation and tracking.

3D Pose Estimation Human Detection

Sequential Ensembling for Semantic Segmentation

no code implementations8 Oct 2022 Rawal Khirodkar, Brandon Smith, Siddhartha Chandra, Amit Agrawal, Antonio Criminisi

Ensemble approaches for deep-learning-based semantic segmentation remain insufficiently explored despite the proliferation of competitive benchmarks and downstream applications.

Segmentation Semantic Segmentation

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking

7 code implementations CVPR 2023 Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani

Instead of relying only on the linear state estimate (i. e., estimation-centric approach), we use object observations (i. e., the measurements by object detector) to compute a virtual trajectory over the occlusion period to fix the error accumulation of filter parameters during the occlusion period.

Multi-Object Tracking Multiple Object Tracking +3

Occluded Human Mesh Recovery

no code implementations CVPR 2022 Rawal Khirodkar, Shashank Tripathi, Kris Kitani

Along with the input image, we condition the top-down model on spatial context from the image in the form of body-center heatmaps.

Ranked #63 on 3D Human Pose Estimation on 3DPW (using extra training data)

3D Human Pose Estimation Human Mesh Recovery

RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering

1 code implementation ICCV 2021 Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani

Furthermore, we utilize differentiable Levenberg-Marquardt (LM) optimization to refine a pose fast and accurately by minimizing the feature-metric error between the input and rendered image representations without the need of zooming in.

6D Pose Estimation 6D Pose Estimation using RGB +1

Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation

1 code implementation ICCV 2021 Rawal Khirodkar, Visesh Chari, Amit Agrawal, Ambrish Tyagi

Specifically, we achieve 70. 0 AP on CrowdPose and 42. 5 AP on OCHuman test sets, a significant improvement of 2. 4 AP and 6. 5 AP over the prior art, respectively.

2D Human Pose Estimation Keypoint Detection +1

Adversarial Domain Randomization

no code implementations3 Dec 2018 Rawal Khirodkar, Kris M. Kitani

Domain Randomization (DR) is known to require a significant amount of training data for good performance.

Domain Adaptation Image Classification +2

Domain Randomization for Scene-Specific Car Detection and Pose Estimation

1 code implementation14 Nov 2018 Rawal Khirodkar, Donghyun Yoo, Kris M. Kitani

We address the issue of domain gap when making use of synthetic data to train a scene-specific object detector and pose estimator.

3D Pose Estimation Instance Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.