no code implementations • 29 Aug 2024 • Emilia Szymanska, Mihai Dusmanu, Jan-Willem Buurlage, Mahdi Rad, Marc Pollefeys
Answering questions about the spatial properties of the environment poses challenges for existing language and vision foundation models due to a lack of understanding of the 3D world notably in terms of relationships between objects.
no code implementations • 16 May 2024 • Dan Bohus, Sean Andrist, Nick Saw, Ann Paradiso, Ishani Chakraborty, Mahdi Rad
We introduce an open-source system called SIGMA (short for "Situated Interactive Guidance, Monitoring, and Assistance") as a platform for conducting research on task-assistive agents in mixed-reality scenarios.
no code implementations • ICCV 2023 • Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys
Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community.
no code implementations • 17 Sep 2023 • Junan Lin, Zhichao Sun, Enjie Cao, Taein Kwon, Mahdi Rad, Marc Pollefeys
Skeletal Action recognition from an egocentric view is important for applications such as interfaces in AR/VR glasses and human-robot interaction, where the device has limited resources.
3 code implementations • 7 Jul 2022 • Sinisa Stekovic, Mahdi Rad, Alireza Moradi, Friedrich Fraundorfer, Vincent Lepetit
We also introduce a novel differentiable method for rendering the polygonal shapes of these proposals.
1 code implementation • CVPR 2022 • Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit
We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image.
Ranked #4 on hand-object pose on HO-3D v2
2 code implementations • ICCV 2021 • Sinisa Stekovic, Mahdi Rad, Friedrich Fraundorfer, Vincent Lepetit
For this step, we propose a novel differentiable method for rendering the polygonal shapes of these proposals.
no code implementations • 15 Apr 2020 • Mahdi Rad, Peter M. Roth, Vincent Lepetit
We show that our method significantly outperforms standard normalization methods and would also be appear to be universal since it does not have to be re-trained for each new application.
no code implementations • ECCV 2020 • Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yun-hui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim
To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set.
1 code implementation • ECCV 2020 • Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit
In order to deal with occlusions between components of the layout, which is a problem ignored by previous works, we introduce an analysis-by-synthesis method to iteratively refine the 3D layout estimate.
4 code implementations • CVPR 2020 • Shreyas Hampali, Mahdi Rad, Markus Oberweger, Vincent Lepetit
This dataset is currently made of 77, 558 frames, 68 sequences, 10 persons, and 10 objects.
Ranked #16 on 3D Hand Pose Estimation on HO-3D v2
no code implementations • 8 Oct 2018 • Mahdi Rad, Markus Oberweger, Vincent Lepetit
We introduce a novel learning method for 3D pose estimation from color images.
no code implementations • ECCV 2018 • Markus Oberweger, Mahdi Rad, Vincent Lepetit
We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions.
no code implementations • CVPR 2018 • Mahdi Rad, Markus Oberweger, Vincent Lepetit
The ability of using synthetic images for training a Deep Network is extremely valuable as it is easy to create a virtually infinite training set made of such images, while capturing and annotating real images can be very cumbersome.
no code implementations • 31 Aug 2017 • Mahdi Rad, Peter M. Roth, Vincent Lepetit
We therefore propose a novel illumination normalization method that lets us learn to detect objects and estimate their 3D pose under challenging illumination conditions from very few training samples.
2 code implementations • ICCV 2017 • Mahdi Rad, Vincent Lepetit
We introduce a novel method for 3D object detection and pose estimation from color images only.
Ranked #19 on 6D Pose Estimation using RGB on LineMOD
no code implementations • ICCV 2015 • Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit
We present a method that estimates in real-time and under challenging conditions the 3D pose of a known object.