no code implementations • 9 Sep 2023 • Daniel Ajisafe, James Tang, Shih-Yang Su, Bastian Wandt, Helge Rhodin
Human motion capture either requires multi-camera systems or is unreliable using single-view input due to depth ambiguities.
no code implementations • 23 Aug 2023 • Chunjin Song, Bastian Wandt, Helge Rhodin
It is now possible to reconstruct dynamic human motion and shape from a sparse set of cameras using Neural Radiance Fields (NeRF) driven by an underlying skeleton.
no code implementations • 4 Apr 2023 • Shih-Yang Su, Timur Bagautdinov, Helge Rhodin
Previous methods avoid using a template but rely on a costly or ill-posed mapping from observation to canonical space.
no code implementations • CVPR 2023 • Xingzhe He, Gaurav Bharaj, David Ferman, Helge Rhodin, Pablo Garrido
Supervised keypoint localization methods rely on large manually labeled image datasets, where objects can deform, articulate, or occlude.
no code implementations • 10 Nov 2022 • Frank Yu, Sid Fels, Helge Rhodin
The warping with a shallow network reduces latency and the caching operations can further be parallelized to improve the frame rate.
no code implementations • 23 Jun 2022 • Abiramy Kuganesan, Shih-Yang Su, James J. Little, Helge Rhodin
Neural Radiance Fields (NeRFs) increase reconstruction detail for novel view synthesis and scene reconstruction, with applications ranging from large static scenes to dynamic human motion.
1 code implementation • 21 May 2022 • Xingzhe He, Bastian Wandt, Helge Rhodin
Our key ingredients are i) an encoder that predicts keypoint locations in an input image, ii) a shared graph as a latent variable that links the same pairs of keypoints in every image, iii) an intermediate edge map that combines the latent graph edge weights and keypoint locations in a soft, differentiable manner, and iv) an inpainting objective on randomly masked images.
Ranked #1 on
Unsupervised Landmark Detection
on MAFL Unaligned
no code implementations • 6 May 2022 • Xingzhe He, Bastian Wandt, Helge Rhodin
Generative adversarial networks (GANs) can now generate photo-realistic images.
no code implementations • 3 May 2022 • Shih-Yang Su, Timur Bagautdinov, Helge Rhodin
While a few such approaches exist, those have limited generalization capabilities and are prone to learning spurious (chance) correlations between irrelevant body parts, resulting in implausible deformations and missing body parts on unseen poses.
2 code implementations • 29 Apr 2022 • Eric Hedlin, Helge Rhodin, Kwang Moo Yi
While the quality of this pseudo-ground-truth is challenging to assess due to the lack of actual ground-truth SMPL, with the Human 3. 6m dataset, we qualitatively show that our joint locations are more accurate and that our regressor leads to improved pose estimations results on the test set without any need for retraining.
1 code implementation • 28 Feb 2022 • Paritosh Parmar, Amol Gharat, Helge Rhodin
To that end, we propose to learn exercise-oriented image and video representations from unlabeled samples such that a small dataset annotated by experts suffices for supervised error detection.
no code implementations • 22 Dec 2021 • Michael Zwölfer, Dieter Heinrich, Kurt Schindelwig, Bastian Wandt, Helge Rhodin, Joerg Spoerri, Werner Nachbauer
Injury analysis may be one of the most beneficial applications of deep learning based human pose estimation.
1 code implementation • CVPR 2022 • Mohsen Gholami, Bastian Wandt, Helge Rhodin, Rabab Ward, Z. Jane Wang
To this end, we propose AdaptPose, an end-to-end framework that generates synthetic 3D human motions from a source dataset and uses them to fine-tune a 3D pose estimator.
no code implementations • CVPR 2022 • Bastian Wandt, James J. Little, Helge Rhodin
Human pose estimation from single images is a challenging problem that is typically solved by supervised learning.
Ranked #1 on
Unsupervised 3D Human Pose Estimation
on Human3.6M
3D Human Pose Estimation
Unsupervised 3D Human Pose Estimation
+1
1 code implementation • CVPR 2022 • Xingzhe He, Bastian Wandt, Helge Rhodin
Segmenting an image into its parts is a frequent preprocess for high-level vision tasks such as image editing.
Ranked #2 on
Unsupervised Human Pose Estimation
on DeepFashion
no code implementations • 6 Jul 2021 • Marc Habermann, Weipeng Xu, Helge Rhodin, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt
Our texture term exploits the orientation information in the micro-structures of the objects, e. g., the yarn patterns of fabrics.
no code implementations • 14 May 2021 • Mohsen Gholami, Ahmad Rezaei, Helge Rhodin, Rabab Ward, Z. Jane Wang
Estimating 3D human poses from video is a challenging problem.
Ranked #11 on
Weakly-supervised 3D Human Pose Estimation
on Human3.6M
1 code implementation • 29 Mar 2021 • Xingzhe He, Bastian Wandt, Helge Rhodin
Generative adversarial networks (GANs) have attained photo-realistic quality in image generation.
Ranked #4 on
Unsupervised Human Pose Estimation
on DeepFashion
no code implementations • NeurIPS 2021 • Shih-Yang Su, Frank Yu, Michael Zollhoefer, Helge Rhodin
We propose a method to learn a generative neural body model from unlabelled monocular videos by extending Neural Radiance Fields (NeRFs).
1 code implementation • 22 Dec 2020 • Chunjin Song, Yuchi Zhang, Willis Peng, Parmis Mohaghegh, Bastian Wandt, Helge Rhodin
Different from existing models that translate to hand sign language, between speech and text, or text and images, we target immediate and low-level audio to video translation that applies to generic environment sounds as well as human speech.
1 code implementation • ICCV 2021 • Isinsu Katircioglu, Helge Rhodin, Jörg Spörri, Mathieu Salzmann, Pascal Fua
Self-supervised detection and segmentation of foreground objects aims for accuracy without annotated training data.
no code implementations • 2 Dec 2020 • Sina Honari, Victor Constantin, Helge Rhodin, Mathieu Salzmann, Pascal Fua
In this paper we propose an unsupervised feature extraction method to capture temporal information on monocular videos, where we detect and encode subject of interest in each frame and leverage contrastive self-supervised (CSS) learning to extract rich latent vectors.
1 code implementation • CVPR 2021 • Bastian Wandt, Marco Rudolph, Petrissa Zell, Helge Rhodin, Bodo Rosenhahn
Human pose estimation from single images is a challenging problem in computer vision that requires large amounts of labeled training data to be solved accurately.
Ranked #3 on
3D Human Pose Estimation
on SkiPose
Monocular 3D Human Pose Estimation
Weakly-supervised 3D Human Pose Estimation
1 code implementation • CVPR 2021 • Frank Yu, Mathieu Salzmann, Pascal Fua, Helge Rhodin
Our conclusion is that it is important to utilize camera calibration information when available, for classical and deep-learning-based computer vision alike.
no code implementations • 11 Nov 2020 • Isinsu Katircioglu, Helge Rhodin, Victor Constantin, Jörg Spörri, Mathieu Salzmann, Pascal Fua
While supervised object detection and segmentation methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on.
no code implementations • 10 Nov 2020 • Shenyi Pan, Shuxian Fan, Samuel W. K. Wong, James V. Zidek, Helge Rhodin
Specific to the lumber application, we also propose an algorithm to correct any misalignment in the raw timber images during scanning, and contribute the first open-source lumber knot dataset by labeling the elliptical knots in the preprocessed images.
no code implementations • CVPR 2020 • Siyuan Li, Semih Günel, Mirela Ostrek, Pavan Ramdya, Pascal Fua, Helge Rhodin
We compare our approach with existing domain transfer methods and demonstrate improved pose estimation accuracy on Drosophila melanogaster (fruit fly), Caenorhabditis elegans (worm) and Danio rerio (zebrafish), without requiring any manual annotation on the target domain and despite using simplistic off-the-shelf animal characters for simulation, or simple geometric shapes as models.
1 code implementation • CVPR 2020 • Yuan Yao, Nico Schertler, Enrique Rosales, Helge Rhodin, Leonid Sigal, Alla Sheffer
Reconstruction of a 3D shape from a single 2D image is a classical computer vision problem, whose difficulty stems from the inherent ambiguity of recovering occluded or only partially observed surfaces.
1 code implementation • CVPR 2020 • Sena Kiciroglu, Helge Rhodin, Sudipta N. Sinha, Mathieu Salzmann, Pascal Fua
The accuracy of monocular 3D human pose estimation depends on the viewpoint from which the image is captured.
no code implementations • ICCV 2019 • Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin
We show theoretically and empirically that a simple motion trajectory analysis suffices to translate from pixel measurements to the person's metric height, reaching a MAE of up to 3. 9 cm on jumping motions, and that this works without camera and ground plane calibration.
no code implementations • 30 Aug 2019 • Roman Bachmann, Jörg Spörri, Pascal Fua, Helge Rhodin
We propose a method for estimating an athlete's global 3D position and articulated pose using multiple cameras.
no code implementations • 18 Jul 2019 • Isinsu Katircioglu, Helge Rhodin, Victor Constantin, Jörg Spörri, Mathieu Salzmann, Pascal Fua
While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on.
4 code implementations • 1 Jul 2019 • Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt
The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy.
Ranked #6 on
3D Multi-Person Pose Estimation
on MuPoTS-3D
3D Multi-Person Human Pose Estimation
3D Multi-Person Pose Estimation
+1
1 code implementation • CVPR 2019 • Helge Rhodin, Victor Constantin, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua
To this end, we introduce a self-supervised approach to learning what we call a neural scene decomposition (NSD) that can be exploited for 3D pose estimation.
no code implementations • 25 May 2018 • Semih Günel, Helge Rhodin, Pascal Fua
Recovering a person's height from a single image is important for virtual garment fitting, autonomous driving and surveillance, however, it is also very challenging due to the absence of absolute scale information.
2 code implementations • ECCV 2018 • Helge Rhodin, Mathieu Salzmann, Pascal Fua
In this paper, we propose to overcome this problem by learning a geometry-aware body representation from multi-view images without annotations.
Ranked #27 on
Weakly-supervised 3D Human Pose Estimation
on Human3.6M
no code implementations • 15 Mar 2018 • Weipeng Xu, Avishek Chatterjee, Michael Zollhoefer, Helge Rhodin, Pascal Fua, Hans-Peter Seidel, Christian Theobalt
We tackle these challenges based on a novel lightweight setup that converts a standard baseball cap to a device for high-quality pose estimation based on a single cap-mounted fisheye camera.
no code implementations • CVPR 2018 • Helge Rhodin, Jörg Spörri, Isinsu Katircioglu, Victor Constantin, Frédéric Meyer, Erich Müller, Mathieu Salzmann, Pascal Fua
Accurate 3D human pose estimation from single images is possible with sophisticated deep-net architectures that have been trained on very large datasets.
no code implementations • 7 Aug 2017 • Weipeng Xu, Avishek Chatterjee, Michael Zollhöfer, Helge Rhodin, Dushyant Mehta, Hans-Peter Seidel, Christian Theobalt
Reconstruction from monocular video alone is drastically more challenging, since strong occlusions and the inherent depth ambiguity lead to a highly ill-posed reconstruction problem.
1 code implementation • 3 May 2017 • Dushyant Mehta, Srinath Sridhar, Oleksandr Sotnychenko, Helge Rhodin, Mohammad Shafiei, Hans-Peter Seidel, Weipeng Xu, Dan Casas, Christian Theobalt
A real-time kinematic skeleton fitting method uses the CNN output to yield temporally stable 3D global pose reconstructions on the basis of a coherent kinematic skeleton.
Ranked #16 on
Pose Estimation
on Leeds Sports Poses
no code implementations • 31 Dec 2016 • Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt
Marker-based and marker-less optical skeletal motion-capture methods use an outside-in arrangement of cameras placed around a scene, with viewpoints converging on the center.
no code implementations • 29 Nov 2016 • Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, Christian Theobalt
We propose a CNN-based approach for 3D human body pose estimation from single RGB images that addresses the issue of limited generalizability of models trained solely on the starkly limited publicly available 3D pose data.
Ranked #17 on
Pose Estimation
on Leeds Sports Poses
no code implementations • 21 Oct 2016 • Nadia Robertini, Dan Casas, Helge Rhodin, Hans-Peter Seidel, Christian Theobalt
We propose a new model-based method to accurately reconstruct human performances captured outdoors in a multi-camera setup.
no code implementations • 23 Sep 2016 • Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt
We therefore propose a new method for real-time, marker-less and egocentric motion capture which estimates the full-body skeleton pose from a lightweight stereo pair of fisheye cameras that are attached to a helmet or virtual reality headset.
no code implementations • 28 Jul 2016 • Helge Rhodin, Nadia Robertini, Dan Casas, Christian Richardt, Hans-Peter Seidel, Christian Theobalt
Our method uses a new image formation model with analytic visibility and analytically differentiable alignment energy.
no code implementations • 11 Feb 2016 • Srinath Sridhar, Helge Rhodin, Hans-Peter Seidel, Antti Oulasvirta, Christian Theobalt
In this paper, we propose a new approach that tracks the full skeleton motion of the hand from multiple RGB cameras in real-time.
no code implementations • ICCV 2015 • Helge Rhodin, Nadia Robertini, Christian Richardt, Hans-Peter Seidel, Christian Theobalt
Generative reconstruction methods compute the 3D configuration (such as pose and/or geometry) of a shape by optimizing the overlap of the projected 3D shape model with images.