Search Results for author: Gerard Pons-Moll

Found 74 papers, 34 papers with code

NASA Neural Articulated Shape Approximation

no code implementations ECCV 2020 Boyang Deng, JP Lewis, Timothy Jeruzalski, Gerard Pons-Moll, Geoffrey Hinton, Mohammad Norouzi, Andrea Tagliasacchi

Efficient representation of articulated objects such as human bodies is an important problem in computer vision and graphics.

GEARS: Local Geometry-aware Hand-object Interaction Synthesis

no code implementations2 Apr 2024 Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll

Generating realistic hand motion sequences in interaction with objects has gained increasing attention with the growing interest in digital humans.

Object

Recent Trends in 3D Reconstruction of General Non-Rigid Scenes

no code implementations22 Mar 2024 Raza Yunus, Jan Eric Lenssen, Michael Niemeyer, Yiyi Liao, Christian Rupprecht, Christian Theobalt, Gerard Pons-Moll, Jia-Bin Huang, Vladislav Golyanik, Eddy Ilg

Reconstructing models of the real world, including 3D geometry, appearance, and motion of real scenes, is essential for computer graphics and computer vision.

3D Reconstruction Navigate

NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors

no code implementations5 Mar 2024 Yannan He, Garvita Tiwari, Tolga Birdal, Jan Eric Lenssen, Gerard Pons-Moll

Faithfully modeling the space of articulations is a crucial task that allows recovery and generation of realistic poses, and remains a notorious challenge.

Pose Estimation

CloSe: A 3D Clothing Segmentation Dataset and Model

no code implementations22 Jan 2024 Dimitrije Antić, Garvita Tiwari, Batuhan Ozcomlekci, Riccardo Marin, Gerard Pons-Moll

Additionally, we propose CloSe-Net, the first learning-based 3D clothing segmentation model for fine-grained segmentation from colored point clouds.

Continual Learning Segmentation

NICP: Neural ICP for 3D Human Registration at Scale

1 code implementation21 Dec 2023 Riccardo Marin, Enric Corona, Gerard Pons-Moll

NSR achieves the state of the art over public benchmarks, and the release of its code and checkpoints will provide the community with a powerful tool useful for many downstream tasks like dataset alignments, cleaning, or asset animation.

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering

no code implementations18 Dec 2023 Kim Youwang, Tae-Hyun Oh, Gerard Pons-Moll

We present Paint-it, a text-driven high-fidelity texture map synthesis method for 3D meshes via neural re-parameterized texture optimization.

Texture Synthesis

Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation

no code implementations12 Dec 2023 Xianghui Xie, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll

We generate 1M+ human-object interaction pairs in 3D and leverage this large-scale data to train our HDM (Hierarchical Diffusion Model), a novel method to reconstruct interacting human and unseen objects, without any templates.

Human-Object Interaction Detection Object

GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

no code implementations22 Nov 2023 Berna Kabadayi, Wojciech Zielonka, Bharat Lal Bhatnagar, Gerard Pons-Moll, Justus Thies

For controlling the model, we learn a mapping from 3DMM facial expression parameters to the latent space of the generative model.

Image Generation

NSF: Neural Surface Fields for Human Modeling from Monocular Depth

no code implementations ICCV 2023 Yuxuan Xue, Bharat Lal Bhatnagar, Riccardo Marin, Nikolaos Sarafianos, Yuanlu Xu, Gerard Pons-Moll, Tony Tung

Compared to existing approaches, our method eliminates the expensive per-frame surface extraction while maintaining mesh coherency, and is capable of reconstructing meshes with arbitrary resolution without retraining.

Computational Efficiency Virtual Try-on

Object pop-up: Can we infer 3D objects and their poses from human interactions alone?

1 code implementation CVPR 2023 Ilya A. Petrov, Riccardo Marin, Julian Chibane, Gerard Pons-Moll

The intimate entanglement between objects affordances and human poses is of large interest, among others, for behavioural sciences, cognitive psychology, and Computer Vision communities.

Object

Generating Continual Human Motion in Diverse 3D Scenes

no code implementations4 Apr 2023 Aymen Mir, Xavier Puig, Angjoo Kanazawa, Gerard Pons-Moll

We decompose the continual motion synthesis problem into walking along paths and transitioning in and out of the actions specified by the keypoints, which enables long generation of motions that satisfy scene constraints without explicitly incorporating scene information.

Motion Synthesis Navigate

Visibility Aware Human-Object Interaction Tracking from Single RGB Camera

no code implementations CVPR 2023 Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll

In this work, we propose a novel method to track the 3D human, object, contacts between them, and their relative translation across frames from a single RGB camera, while being robust to heavy occlusions.

3D Human Reconstruction 3D Object Reconstruction +3

HDHumans: A Hybrid Approach for High-fidelity Digital Humans

no code implementations21 Oct 2022 Marc Habermann, Lingjie Liu, Weipeng Xu, Gerard Pons-Moll, Michael Zollhoefer, Christian Theobalt

Photo-real digital human avatars are of enormous importance in graphics, as they enable immersive communication over the globe, improve gaming and entertainment experiences, and can be particularly beneficial for AR and VR settings.

Novel View Synthesis Surface Reconstruction +1

Skeleton-free Pose Transfer for Stylized 3D Characters

1 code implementation28 Jul 2022 Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard Pons-Moll, Yang Zhou

We present the first method that automatically transfers poses between stylized 3D characters without skeletal rigging.

Pose Transfer

Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields

1 code implementation27 Jul 2022 Garvita Tiwari, Dimitrije Antic, Jan Eric Lenssen, Nikolaos Sarafianos, Tony Tung, Gerard Pons-Moll

The resulting high-dimensional implicit function can be differentiated with respect to the input poses and thus can be used to project arbitrary poses onto the manifold by using gradient descent on the set of 3-dimensional hyperspheres.

Denoising

TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement

no code implementations16 May 2022 Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll

The core of our method are TOCH fields, a novel spatio-temporal representation for modeling correspondences between hands and objects during interaction.

Denoising Object +1

Learned Vertex Descent: A New Direction for 3D Human Model Fitting

no code implementations12 May 2022 Enric Corona, Gerard Pons-Moll, Guillem Alenyà, Francesc Moreno-Noguer

An exhaustive evaluation demonstrates that our approach is able to capture the underlying body of clothed people with very different body shapes, achieving a significant improvement compared to state-of-the-art.

Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion

no code implementations5 May 2022 Vladimir Guzov, Julian Chibane, Riccardo Marin, Yannan He, Yunus Saracoglu, Torsten Sattler, Gerard Pons-Moll

In order for widespread adoption of such emerging applications, the sensor setup used to capture the interactions needs to be inexpensive and easy-to-use for non-expert users.

Human-Object Interaction Detection Object +2

COUCH: Towards Controllable Human-Chair Interactions

no code implementations1 May 2022 Xiaohan Zhang, Bharat Lal Bhatnagar, Vladimir Guzov, Sebastian Starke, Gerard Pons-Moll

In this work, we study the problem of synthesizing scene interactions conditioned on different contact positions on the object.

Human-Object Interaction Detection Object

Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation

no code implementations22 Apr 2022 Verica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll

With the aim of obtaining interpretable and controllable scene representations, our model couples learnt scene-specific feature volumes with a scene agnostic neural rendering network.

Neural Rendering Novel View Synthesis

BEHAVE: Dataset and Method for Tracking Human Object Interactions

1 code implementation CVPR 2022 Bharat Lal Bhatnagar, Xianghui Xie, Ilya A. Petrov, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

We present BEHAVE dataset, the first full body human- object interaction dataset with multi-view RGBD frames and corresponding 3D SMPL and object fits along with the annotated contacts between them.

Human-Object Interaction Detection Mixed Reality +1

CHORE: Contact, Human and Object REconstruction from a single RGB image

1 code implementation5 Apr 2022 Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll

However, humans are constantly interacting with the surrounding objects, thus calling for models that can reason about not only the human but also the object and their interaction.

3D Human Reconstruction 3D Object Reconstruction +1

A Deeper Look into DeepCap

no code implementations20 Nov 2021 Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality.

Pose Estimation

Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing

1 code implementation ICCV 2021 Garvita Tiwari, Nikolaos Sarafianos, Tony Tung, Gerard Pons-Moll

Neural-GIF can be trained on raw 3D scans and reconstructs detailed complex surface geometry and deformations.

NRST: Non-rigid Surface Tracking from Monocular Video

no code implementations6 Jul 2021 Marc Habermann, Weipeng Xu, Helge Rhodin, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Our texture term exploits the orientation information in the micro-structures of the objects, e. g., the yarn patterns of fabrics.

Real-time Deep Dynamic Characters

no code implementations4 May 2021 Marc Habermann, Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

We propose a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic appearance learned in a new weakly supervised way from multi-view imagery.

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

1 code implementation CVPR 2021 Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll

Recent neural view synthesis methods have achieved impressive quality and realism, surpassing classical pipelines which rely on multi-view reconstruction.

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

1 code implementation CVPR 2021 Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll

We introduce (HPS) Human POSEitioning System, a method to recover the full 3D pose of a human registered with a 3D scan of the surrounding environment using wearable sensors.

3D Human Pose Estimation 3D Pose Estimation +1

SMPLicit: Topology-aware Generative Model for Clothed People

1 code implementation CVPR 2021 Enric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce SMPLicit, a novel generative model to jointly represent body pose, shape and clothing geometry.

3D Reconstruction

Learning Speech-driven 3D Conversational Gestures from Video

no code implementations13 Feb 2021 Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Lingjie Liu, Hans-Peter Seidel, Gerard Pons-Moll, Mohamed Elgharib, Christian Theobalt

We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input.

3D Face Animation Generative Adversarial Network +2

Adjoint Rigid Transform Network: Task-conditioned Alignment of 3D Shapes

no code implementations1 Feb 2021 Keyang Zhou, Bharat Lal Bhatnagar, Bernt Schiele, Gerard Pons-Moll

The remarkable result is that with only self-supervision, ART facilitates learning a unique canonical orientation for both rigid and nonrigid shapes, which leads to a notable boost in performance of aforementioned tasks.

Disentanglement

D-NeRF: Neural Radiance Fields for Dynamic Scenes

1 code implementation CVPR 2021 Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce D-NeRF, a method that extends neural radiance fields to a dynamic domain, allowing to reconstruct and render novel images of objects under rigid and non-rigid motions from a \emph{single} camera moving around the scene.

Neural Rendering

SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera

1 code implementation2 Nov 2020 Denis Tome, Thiemo Alldieck, Patrick Peluse, Gerard Pons-Moll, Lourdes Agapito, Hernan Badino, Fernando de la Torre

The quantitative evaluation, on synthetic and real-world datasets, shows that our strategy leads to substantial improvements in accuracy over state of the art egocentric approaches.

Egocentric Pose Estimation Pose Estimation

Neural Unsigned Distance Fields for Implicit Function Learning

1 code implementation NeurIPS 2020 Julian Chibane, Aymen Mir, Gerard Pons-Moll

NDF represent surfaces at high resolutions as prior implicit models, but do not require closed surface data, and significantly broaden the class of representable shapes in the output.

Multi-target regression

LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration

no code implementations NeurIPS 2020 Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

Formulating this closed loop is not straightforward because it is not trivial to force the output of the NN to be on the surface of the human model - outside this surface the human model is not even defined.

Self-Supervised Learning

Implicit Feature Networks for Texture Completion from Partial 3D Data

1 code implementation20 Sep 2020 Julian Chibane, Gerard Pons-Moll

Instead, we focus on 3D texture and geometry completion from partial and incomplete 3D scans.

Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction

1 code implementation ECCV 2020 Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

In this work, we present methodology that combines detail-rich implicit functions and parametric representations in order to reconstruct 3D models of people that remain controllable and accurate even in the presence of clothing.

3D Human Pose Estimation 3D Human Reconstruction

SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing

1 code implementation ECCV 2020 Garvita Tiwari, Bharat Lal Bhatnagar, Tony Tung, Gerard Pons-Moll

SizerNet allows to estimate and visualize the dressing effect of a garment in various sizes, and ParserNet allows to edit clothing of an input mesh directly, removing the need for scan segmentation, which is a challenging problem in itself.

3D Human Pose Estimation

Kinematic 3D Object Detection in Monocular Video

2 code implementations ECCV 2020 Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele

In this work, we propose a novel method for monocular video-based 3D object detection which carefully leverages kinematic motion to improve precision of 3D localization.

Monocular 3D Object Detection Object +2

DeepCap: Monocular Human Performance Capture Using Weak Supervision

no code implementations CVPR 2020 Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality.

Pose Estimation

TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style

2 code implementations CVPR 2020 Chaitanya Patel, Zhouyingcheng Liao, Gerard Pons-Moll

While the low-frequency component is predicted from pose, shape and style parameters with an MLP, the high-frequency component is predicted with a mixture of shape-style specific pose models.

3D Human Pose Estimation 3D Shape Reconstruction

Learning to Transfer Texture from Clothing Images to 3D Humans

1 code implementation CVPR 2020 Aymen Mir, Thiemo Alldieck, Gerard Pons-Moll

In this paper, we present a simple yet effective method to automatically transfer textures of clothing images (front and back) to 3D garments worn on top SMPL, in real time.

Image-to-Image Translation Translation +1

Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion

1 code implementation CVPR 2020 Julian Chibane, Thiemo Alldieck, Gerard Pons-Moll

To solve this, we propose Implicit Feature Networks (IF-Nets), which deliver continuous outputs, can handle multiple topologies, and complete shapes for missing or sparse input data retaining the nice properties of recent learned implicit functions, but critically they can also retain detail when it is present in the input data, and can reconstruct articulated humans.

3D Object Reconstruction 3D Reconstruction +1

NASA: Neural Articulated Shape Approximation

no code implementations6 Dec 2019 Boyang Deng, JP Lewis, Timothy Jeruzalski, Gerard Pons-Moll, Geoffrey Hinton, Mohammad Norouzi, Andrea Tagliasacchi

Efficient representation of articulated objects such as human bodies is an important problem in computer vision and graphics.

360-Degree Textures of People in Clothing from a Single Image

no code implementations20 Aug 2019 Verica Lazova, Eldar Insafutdinov, Gerard Pons-Moll

In order to learn our model in a common UV-space, we non-rigidly register the SMPL model to thousands of 3D scans, effectively encoding textures and geometries as images in correspondence.

Image-to-Image Translation Translation

Multi-Garment Net: Learning to Dress 3D People from Images

6 code implementations ICCV 2019 Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll

We present Multi-Garment Network (MGN), a method to predict body shape and clothing, layered on top of the SMPL model from a few frames (1-8) of a video.

3D Human Pose Estimation 3D Shape Reconstruction From A Single 2D Image

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

4 code implementations1 Jul 2019 Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy.

3D Multi-Person Human Pose Estimation 3D Multi-Person Pose Estimation +1

Body Shape Privacy in Images: Understanding Privacy and Preventing Automatic Shape Extraction

no code implementations27 May 2019 Hosnieh Sattar, Katharina Krombholz, Gerard Pons-Moll, Mario Fritz

Modern approaches to pose and body shape estimation have recently achieved strong performance even under challenging real-world conditions.

Recommendation Systems Virtual Try-on

Tex2Shape: Detailed Full Human Body Geometry From a Single Image

1 code implementation ICCV 2019 Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor

From a partial texture, we estimate detailed normal and vector displacement maps, which can be applied to a low-resolution smooth body model to add detail and clothing.

Image-to-Image Translation Translation

AMASS: Archive of Motion Capture as Surface Shapes

4 code implementations ICCV 2019 Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje, Gerard Pons-Moll, Michael J. Black

We achieve this using a new method, MoSh++, that converts mocap data into realistic 3D human meshes represented by a rigged body model; here we use SMPL [doi:10. 1145/2816795. 2818013], which is widely used and provides a standard skeletal representation as well as a fully rigged surface mesh.

SimulCap : Single-View Human Performance Capture with Cloth Simulation

no code implementations CVPR 2019 Tao Yu, Zerong Zheng, Yuan Zhong, Jianhui Zhao, Qionghai Dai, Gerard Pons-Moll, Yebin Liu

This paper proposes a new method for live free-viewpoint human performance capture with dynamic details (e. g., cloth wrinkles) using a single RGBD camera.

Learning to Reconstruct People in Clothing from a Single RGB Camera

1 code implementation CVPR 2019 Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll

We present a learning-based model to infer the personalized 3D shape of people from a few frames (1-8) of a monocular video in which the person is moving, in less than 10 seconds with a reconstruction accuracy of 5mm.

Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera

no code implementations ECCV 2018 Timo von Marcard, Roberto Henschel, Michael J. Black, Bodo Rosenhahn, Gerard Pons-Moll

In this work, we propose a method that combines a single hand-held camera and a set of Inertial Measurement Units (IMUs) attached at the body limbs to estimate accurate 3D poses in the wild.

3D Pose Estimation

Detailed Human Avatars from Monocular Video

1 code implementation3 Aug 2018 Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll

We present a novel method for high detail-preserving human avatar creation from monocular video.

Fashion is Taking Shape: Understanding Clothing Preference Based on Body Shape From Online Sources

no code implementations9 Jul 2018 Hosnieh Sattar, Gerard Pons-Moll, Mario Fritz

To study the correlation between clothing garments and body shape, we collected a new dataset (Fashion Takes Shape), which includes images of users with clothing category annotations.

Video Based Reconstruction of 3D People Models

1 code implementation CVPR 2018 Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll

This paper describes how to obtain accurate 3D body models and texture of arbitrary people from a single, monocular video in which a person is moving.

3D Reconstruction Surface Reconstruction +1

Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB

6 code implementations9 Dec 2017 Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, Christian Theobalt

Our approach uses novel occlusion-robust pose-maps (ORPM) which enable full body pose inference even under strong partial occlusions by other people and objects in the scene.

3D Human Pose Estimation 3D Multi-Person Pose Estimation (absolute) +2

Dynamic FAUST: Registering Human Bodies in Motion

no code implementations CVPR 2017 Federica Bogo, Javier Romero, Gerard Pons-Moll, Michael J. Black

We propose a new mesh registration method that uses both 3D geometry and texture information to register all scans in a sequence to a common reference topology.

A Generative Model of People in Clothing

1 code implementation ICCV 2017 Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler

We present the first image-based generative model of people in clothing for the full body.

Semantic Segmentation

Sparse Inertial Poser: Automatic 3D Human Pose Estimation from Sparse IMUs

no code implementations23 Mar 2017 Timo von Marcard, Bodo Rosenhahn, Michael J. Black, Gerard Pons-Moll

We address the problem of making human motion capture in the wild more practical by using a small set of inertial sensors attached to the body.

3D Human Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.