Search Results for author: Christian Theobalt

Found 215 papers, 51 papers with code

Building Statistical Shape Spaces for 3D Human Modeling

no code implementations • 19 Mar 2015 • Leonid Pishchulin, Stefanie Wuhrer, Thomas Helten, Christian Theobalt, Bernt Schiele

Statistical models of 3D human shape and pose learned from scan databases have developed into valuable tools to solve a variety of vision and graphics problems.

Paper
Add Code

Efficient ConvNet-Based Marker-Less Motion Capture in General Scenes With a Low Number of Cameras

no code implementations • CVPR 2015 • Ahmed Elhayek, Edilson de Aguiar, Arjun Jain, Jonathan Tompson, Leonid Pishchulin, Micha Andriluka, Chris Bregler, Bernt Schiele, Christian Theobalt

Our approach unites a discriminative image-based joint detection method with a model-based generative motion tracking algorithm through a combined pose optimization energy.

Pose Estimation

Paper
Add Code

Efficient Multi-view Performance Capture of Fine-Scale Surface Detail

no code implementations • 5 Feb 2016 • Nadia Robertini, Edilson de Aguiar, Thomas Helten, Christian Theobalt

We present a new effective way for performance capture of deforming meshes with fine-scale time-varying surface detail from multi-view video.

Occlusion Handling

Paper
Add Code

Automatic Face Reenactment

no code implementations • CVPR 2014 • Pablo Garrido, Levi Valgaerts, Ole Rehmsen, Thorsten Thormaehlen, Patrick Perez, Christian Theobalt

We propose an image-based, facial reenactment system that replaces the face of an actor in an existing target video with the face of a user from a source video, while preserving the original target performance.

Clustering Face Model +4

Paper
Add Code

Real-Time Hand Tracking Using a Sum of Anisotropic Gaussians Model

no code implementations • 11 Feb 2016 • Srinath Sridhar, Helge Rhodin, Hans-Peter Seidel, Antti Oulasvirta, Christian Theobalt

In this paper, we propose a new approach that tracks the full skeleton motion of the hand from multiple RGB cameras in real-time.

Paper
Add Code

Semi-supervised Learning with Explicit Relationship Regularization

no code implementations • CVPR 2015 • Kwang In Kim, James Tompkin, Hanspeter Pfister, Christian Theobalt

In many learning tasks, the structure of the target space of a function holds rich information about the relationships between evaluations of functions on different data points.

Constrained Clustering Dimensionality Reduction +1

Paper
Add Code

Local High-order Regularization on Data Manifolds

no code implementations • CVPR 2015 • Kwang In Kim, James Tompkin, Hanspeter Pfister, Christian Theobalt

The iterated graph Laplacian enables high-order regularization, but it has a high computational complexity and so cannot be applied to large problems.

Dimensionality Reduction Vocal Bursts Intensity Prediction

Paper
Add Code

A Versatile Scene Model with Differentiable Visibility Applied to Generative Pose Estimation

no code implementations • ICCV 2015 • Helge Rhodin, Nadia Robertini, Christian Richardt, Hans-Peter Seidel, Christian Theobalt

Generative reconstruction methods compute the 3D configuration (such as pose and/or geometry) of a shape by optimizing the overlap of the projected 3D shape model with images.

Occlusion Handling Pose Estimation

Paper
Add Code

Fast and Robust Hand Tracking Using Detection-Guided Optimization

no code implementations • CVPR 2015 • Srinath Sridhar, Franziska Mueller, Antti Oulasvirta, Christian Theobalt

In the optimization step, a novel objective function combines the detected part labels and a Gaussian mixture representation of the depth to estimate a pose that best fits the depth.

Pose Estimation

Paper
Add Code

Context-guided diffusion for label propagation on graphs

no code implementations • ICCV 2015 • Kwang In Kim, James Tompkin, Hanspeter Pfister, Christian Theobalt

Existing approaches for diffusion on graphs, e. g., for label propagation, are mainly focused on isotropic diffusion, which is induced by the commonly-used graph Laplacian regularizer.

Paper
Add Code

VolumeDeform: Real-time Volumetric Non-rigid Reconstruction

no code implementations • 27 Mar 2016 • Matthias Innmann, Michael Zollhöfer, Matthias Nießner, Christian Theobalt, Marc Stamminger

We cast finding the optimal deformation of space as a non-linear regularized variational optimization problem by enforcing local smoothness and proximity to the input constraints.

Paper
Add Code

BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration

1 code implementation • 5 Apr 2016 • Angela Dai, Matthias Nießner, Michael Zollhöfer, Shahram Izadi, Christian Theobalt

Our approach estimates globally optimized (i. e., bundle adjusted) poses in real-time, supports robust tracking with recovery from gross tracking failures (i. e., relocalization), and re-estimates the 3D model in real-time to ensure global consistency; all within a single framework.

3D Reconstruction Mixed Reality +1

Paper
Code

Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging

no code implementations • 22 Apr 2016 • Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, Matthias Nießner

Many graphics and vision problems can be expressed as non-linear least squares optimizations of objective functions over visual data, such as images and meshes.

Paper
Add Code

General Automatic Human Shape and Motion Capture Using Volumetric Contour Cues

no code implementations • 28 Jul 2016 • Helge Rhodin, Nadia Robertini, Dan Casas, Christian Richardt, Hans-Peter Seidel, Christian Theobalt

Our method uses a new image formation model with analytic visibility and analytically differentiable alignment energy.

Markerless Motion Capture

Paper
Add Code

Dense Wide-Baseline Scene Flow From Two Handheld Video Cameras

no code implementations • 16 Sep 2016 • Christian Richardt, Hyeongwoo Kim, Levi Valgaerts, Christian Theobalt

We finally refine the computed correspondence fields in a variational scene flow formulation.

Vocal Bursts Valence Prediction

Paper
Add Code

EgoCap: Egocentric Marker-less Motion Capture with Two Fisheye Cameras

no code implementations • 23 Sep 2016 • Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt

We therefore propose a new method for real-time, marker-less and egocentric motion capture which estimates the full-body skeleton pose from a lightweight stereo pair of fisheye cameras that are attached to a helmet or virtual reality headset.

Pose Estimation Vocal Bursts Valence Prediction

Paper
Add Code

FaceVR: Real-Time Facial Reenactment and Eye Gaze Control in Virtual Reality

no code implementations • 11 Oct 2016 • Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, Matthias Nießner

Based on reenactment of a prerecorded stereo video of the person without the HMD, FaceVR incorporates photo-realistic re-rendering in real time, thus allowing artificial modifications of face and eye appearances.

Paper
Add Code

Video Depth-From-Defocus

no code implementations • 12 Oct 2016 • Hyeongwoo Kim, Christian Richardt, Christian Theobalt

Many compelling video post-processing effects, in particular aesthetic focus editing and refocusing effects, are feasible if per-frame depth information is available.

Paper
Add Code

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

no code implementations • 16 Oct 2016 • Srinath Sridhar, Franziska Mueller, Michael Zollhöfer, Dan Casas, Antti Oulasvirta, Christian Theobalt

However, due to difficult occlusions, fast motions, and uniform hand appearance, jointly tracking hand and object pose is more challenging than tracking either of the two separately.

Object Object Tracking

Paper
Add Code

Model-based Outdoor Performance Capture

no code implementations • 21 Oct 2016 • Nadia Robertini, Dan Casas, Helge Rhodin, Hans-Peter Seidel, Christian Theobalt

We propose a new model-based method to accurately reconstruct human performances captured outdoors in a multi-camera setup.

Edge Detection

Paper
Add Code

Real-time Halfway Domain Reconstruction of Motion and Geometry

no code implementations • 23 Oct 2016 • Lucas Thies, Michael Zollhöfer, Christian Richardt, Christian Theobalt, Günther Greiner

Our extensive experiments and evaluations show that our approach produces high-quality dense reconstructions of 3D geometry and scene flow at real-time frame rates, and compares favorably to the state of the art.

Paper
Add Code

Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision

no code implementations • 29 Nov 2016 • Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, Christian Theobalt

We propose a CNN-based approach for 3D human body pose estimation from single RGB images that addresses the issue of limited generalizability of models trained solely on the starkly limited publicly available 3D pose data.

Ranked #17 on Pose Estimation on Leeds Sports Poses

Monocular 3D Human Pose Estimation Transfer Learning

Paper
Add Code

EgoCap: Egocentric Marker-less Motion Capture with Two Fisheye Cameras (Extended Abstract)

no code implementations • 31 Dec 2016 • Helge Rhodin, Christian Richardt, Dan Casas, Eldar Insafutdinov, Mohammad Shafiei, Hans-Peter Seidel, Bernt Schiele, Christian Theobalt

Marker-based and marker-less optical skeletal motion-capture methods use an outside-in arrangement of cameras placed around a scene, with viewpoints converging on the center.

Pose Estimation

Paper
Add Code

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

no code implementations • ICCV 2017 • Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Pérez, Christian Theobalt

In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image.

Face Reconstruction Monocular Reconstruction

Paper
Add Code

InverseFaceNet: Deep Monocular Inverse Face Rendering

no code implementations • CVPR 2018 • Hyeongwoo Kim, Michael Zollhöfer, Ayush Tewari, Justus Thies, Christian Richardt, Christian Theobalt

In contrast, we propose to recover high-quality facial pose, shape, expression, reflectance and illumination using a deep neural network that is trained using a large, synthetically created training corpus.

Face Reconstruction Inverse Rendering

Paper
Add Code

Real-time Hand Tracking under Occlusion from an Egocentric RGB-D Sensor

no code implementations • ICCV 2017 • Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt

We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments.

Hand Pose Estimation Pose Tracking +1

Paper
Add Code

VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera

1 code implementation • 3 May 2017 • Dushyant Mehta, Srinath Sridhar, Oleksandr Sotnychenko, Helge Rhodin, Mohammad Shafiei, Hans-Peter Seidel, Weipeng Xu, Dan Casas, Christian Theobalt

A real-time kinematic skeleton fitting method uses the CNN output to yield temporally stable 3D global pose reconstructions on the basis of a coherent kinematic skeleton.

Ranked #16 on Pose Estimation on Leeds Sports Poses

3D Human Pose Estimation

241

Paper
Code

Criteria Sliders: Learning Continuous Database Criteria via Interactive Ranking

no code implementations • 12 Jun 2017 • James Tompkin, Kwang In Kim, Hanspeter Pfister, Christian Theobalt

Large databases are often organized by hand-labeled metadata, or criteria, which are expensive to collect.

Paper
Add Code

MonoPerfCap: Human Performance Capture from Monocular Video

no code implementations • 7 Aug 2017 • Weipeng Xu, Avishek Chatterjee, Michael Zollhöfer, Helge Rhodin, Dushyant Mehta, Hans-Peter Seidel, Christian Theobalt

Reconstruction from monocular video alone is drastically more challenging, since strong occlusions and the inherent depth ambiguity lead to a highly ill-posed reconstruction problem.

Monocular Reconstruction Pose Estimation +1

Paper
Add Code

HandSeg: An Automatically Labeled Dataset for Hand Segmentation from Depth Images

no code implementations • 16 Nov 2017 • Abhishake Kumar Bojja, Franziska Mueller, Sri Raghu Malireddi, Markus Oberweger, Vincent Lepetit, Christian Theobalt, Kwang Moo Yi, Andrea Tagliasacchi

We propose an automatic method for generating high-quality annotations for depth-based hand segmentation, and introduce a large-scale hand segmentation dataset.

Data Augmentation Hand Segmentation +1

Paper
Add Code

DS*: Tighter Lifting-Free Convex Relaxations for Quadratic Matching Problems

no code implementations • CVPR 2018 • Florian Bernard, Christian Theobalt, Michael Moeller

In this work we study convex relaxations of quadratic optimisation problems over permutation matrices.

Graph Matching

Paper
Add Code

GANerated Hands for Real-time 3D Hand Tracking from Monocular RGB

no code implementations • CVPR 2018 • Franziska Mueller, Florian Bernard, Oleksandr Sotnychenko, Dushyant Mehta, Srinath Sridhar, Dan Casas, Christian Theobalt

We address the highly challenging problem of real-time 3D hand tracking based on a monocular RGB-only sequence.

Image-to-Image Translation Translation

Paper
Add Code

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz

no code implementations • CVPR 2018 • Ayush Tewari, Michael Zollhöfer, Pablo Garrido, Florian Bernard, Hyeongwoo Kim, Patrick Pérez, Christian Theobalt

To alleviate this problem, we present the first approach that jointly learns 1) a regressor for face shape, expression, reflectance and illumination on the basis of 2) a concurrently learned parametric face model.

Face Model Monocular Reconstruction

Paper
Add Code

Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB

6 code implementations • 9 Dec 2017 • Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, Christian Theobalt

Our approach uses novel occlusion-robust pose-maps (ORPM) which enable full body pose inference even under strong partial occlusions by other people and objects in the scene.

Ranked #3 on 3D Multi-Person Pose Estimation (root-relative) on MuPoTS-3D (MPJPE metric)

3D Human Pose Estimation 3D Multi-Person Pose Estimation (absolute) +2

634

Paper
Code

LIME: Live Intrinsic Material Estimation

no code implementations • CVPR 2018 • Abhimitra Meka, Maxim Maximov, Michael Zollhoefer, Avishek Chatterjee, Hans-Peter Seidel, Christian Richardt, Christian Theobalt

We present the first end to end approach for real time material estimation for general object shapes with uniform material that only requires a single color image as input.

Foreground Segmentation Image-to-Image Translation +3

Paper
Add Code

Video Based Reconstruction of 3D People Models

1 code implementation • CVPR 2018 • Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll

This paper describes how to obtain accurate 3D body models and texture of arbitrary people from a single, monocular video in which a person is moving.

3D Reconstruction Surface Reconstruction +1

643

Paper
Code

Mo2Cap2: Real-time Mobile 3D Motion Capture with a Cap-mounted Fisheye Camera

no code implementations • 15 Mar 2018 • Weipeng Xu, Avishek Chatterjee, Michael Zollhoefer, Helge Rhodin, Pascal Fua, Hans-Peter Seidel, Christian Theobalt

We tackle these challenges based on a novel lightweight setup that converts a standard baseball cap to a device for high-quality pose estimation based on a single cap-mounted fisheye camera.

Ranked #6 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

3D Pose Estimation Egocentric Pose Estimation

Paper
Add Code

Synchronisation of Partial Multi-Matchings via Non-negative Factorisations

no code implementations • 16 Mar 2018 • Florian Bernard, Johan Thunberg, Jorge Goncalves, Christian Theobalt

In order to deal with the inherent non-convexity of the permutation synchronisation problem, we use an initialisation procedure based on a novel rotation scheme applied to the solution of the spectral relaxation.

Clustering

Paper
Add Code

A Hybrid Model for Identity Obfuscation by Face Replacement

no code implementations • ECCV 2018 • Qianru Sun, Ayush Tewari, Weipeng Xu, Mario Fritz, Christian Theobalt, Bernt Schiele

As more and more personal photos are shared and tagged in social media, avoiding privacy risks such as unintended recognition becomes increasingly challenging.

Face Generation

Paper
Add Code

HeadOn: Real-time Reenactment of Human Portrait Videos

no code implementations • 29 May 2018 • Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner

We propose HeadOn, the first real-time source-to-target reenactment approach for complete human portrait videos that enables transfer of torso and head motion, face expression, and eye gaze.

Paper
Add Code

Deep Video Portraits

no code implementations • 29 May 2018 • Hyeongwoo Kim, Pablo Garrido, Ayush Tewari, Weipeng Xu, Justus Thies, Matthias Nießner, Patrick Pérez, Christian Richardt, Michael Zollhöfer, Christian Theobalt

In order to enable source-to-target video re-animation, we render a synthetic target video with the reconstructed head animation parameters from a source video, and feed it into the trained network -- thus taking full control of the target.

Face Model

Paper
Add Code

Detailed Human Avatars from Monocular Video

1 code implementation • 3 Aug 2018 • Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll

We present a novel method for high detail-preserving human avatar creation from monocular video.

154

Paper
Code

Neural Rendering and Reenactment of Human Actor Videos

no code implementations • 11 Sep 2018 • Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wenping Wang, Christian Theobalt

In contrast to conventional human character rendering, we do not require the availability of a production-quality photo-realistic 3D model of the human, but instead rely on a video sequence in conjunction with a (medium-quality) controllable 3D template model of the person.

Generative Adversarial Network Image Generation +1

Paper
Add Code

LiveCap: Real-time Human Performance Capture from Monocular Video

no code implementations • 5 Oct 2018 • Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Our method is the first real-time monocular approach for full-body performance capture.

Paper
Add Code

Higher-order Projected Power Iterations for Scalable Multi-Matching

no code implementations • 26 Nov 2018 • Florian Bernard, Johan Thunberg, Paul Swoboda, Christian Theobalt

The matching of multiple objects (e. g. shapes or images) is a fundamental problem in vision and graphics.

Paper
Add Code

IGNOR: Image-guided Neural Object Rendering

no code implementations • 26 Nov 2018 • Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner

Based on this 3D proxy, the appearance of a captured view can be warped into a new target view as in classical image-based rendering.

Image Generation Novel View Synthesis +1

Paper
Add Code

On Implicit Filter Level Sparsity in Convolutional Neural Networks

no code implementations • CVPR 2019 • Dushyant Mehta, Kwang In Kim, Christian Theobalt

We investigate filter level sparsity that emerges in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay.

L2 Regularization

Paper
Add Code

FML: Face Model Learning from Videos

no code implementations • CVPR 2019 • Ayush Tewari, Florian Bernard, Pablo Garrido, Gaurav Bharaj, Mohamed Elgharib, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

In contrast, we propose multi-frame video-based self-supervised training of a deep network that (i) learns a face identity model both in shape and appearance while (ii) jointly learning to reconstruct 3D faces.

3D Reconstruction Face Model

Paper
Add Code

Learning a Disentangled Embedding for Monocular 3D Shape Retrieval and Pose Estimation

no code implementations • 24 Dec 2018 • Kyaw Zaw Lin, Weipeng Xu, Qianru Sun, Christian Theobalt, Tat-Seng Chua

We propose a novel approach to jointly perform 3D shape retrieval and pose estimation from monocular images. In order to make the method robust to real-world image variations, e. g. complex textures and backgrounds, we learn an embedding space from 3D data that only includes the relevant information, namely the shape and pose.

3D Object Retrieval 3D Shape Classification +3

Paper
Add Code

Learning to Reconstruct People in Clothing from a Single RGB Camera

1 code implementation • CVPR 2019 • Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll

We present a learning-based model to infer the personalized 3D shape of people from a few frames (1-8) of a monocular video in which the person is moving, in less than 10 seconds with a reconstruction accuracy of 5mm.

244

Paper
Code

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

no code implementations • CVPR 2019 • Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Gerard Pons-Moll, Christian Theobalt

Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations.

3D Pose Estimation Monocular 3D Human Pose Estimation

Paper
Add Code

Tex2Shape: Detailed Full Human Body Geometry From a Single Image

1 code implementation • ICCV 2019 • Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor

From a partial texture, we estimate detailed normal and vector displacement maps, which can be applied to a low-resolution smooth body model to add detail and clothing.

Image-to-Image Translation Translation

269

Paper
Code

IsMo-GAN: Adversarial Learning for Monocular Non-Rigid 3D Reconstruction

no code implementations • 27 Apr 2019 • Soshi Shimada, Vladislav Golyanik, Christian Theobalt, Didier Stricker

The majority of the existing methods for non-rigid 3D surface regression from monocular 2D images require an object template or point tracks over multiple frames as an input, and are still far from real-time processing rates.

3D Reconstruction Generative Adversarial Network

Paper
Add Code

Implicit Filter Sparsification In Convolutional Neural Networks

no code implementations • 13 May 2019 • Dushyant Mehta, Kwang In Kim, Christian Theobalt

We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay.

L2 Regularization

Paper
Add Code

Emergence of Implicit Filter Sparsity in Convolutional Neural Networks

no code implementations • ICML Workshop Deep_Phenomen 2019 • Dushyant Mehta, Kwang In Kim, Christian Theobalt

We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained using adaptive gradient descent techniques with L2 regularization or weight decay.

L2 Regularization

Paper
Add Code

DEMEA: Deep Mesh Autoencoders for Non-Rigidly Deforming Objects

no code implementations • ECCV 2020 • Edgar Tretschk, Ayush Tewari, Michael Zollhöfer, Vladislav Golyanik, Christian Theobalt

Mesh autoencoders are commonly used for dimensionality reduction, sampling and mesh modeling.

3D Reconstruction Dimensionality Reduction

Paper
Add Code

EgoFace: Egocentric Face Performance Capture and Videorealistic Reenactment

no code implementations • 26 May 2019 • Mohamed Elgharib, Mallikarjun BR, Ayush Tewari, Hyeongwoo Kim, Wentao Liu, Hans-Peter Seidel, Christian Theobalt

Our lightweight setup allows operations in uncontrolled environments, and lends itself to telepresence applications such as video-conferencing from dynamic environments.

Paper
Add Code

Text-based Editing of Talking-head Video

1 code implementation • 4 Jun 2019 • Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B. Goldman, Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agrawala

To edit a video, the user has to only edit the transcript, and an optimization strategy then chooses segments of the input corpus as base material.

Face Model Sentence +3

408

Paper
Code

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

4 code implementations • 1 Jul 2019 • Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy.

Ranked #7 on 3D Multi-Person Pose Estimation on MuPoTS-3D

3D Multi-Person Human Pose Estimation 3D Multi-Person Pose Estimation +1

29,624

Paper
Code

DispVoxNets: Non-Rigid Point Set Alignment with Supervised Learning Proxies

no code implementations • 24 Jul 2019 • Soshi Shimada, Vladislav Golyanik, Edgar Tretschk, Didier Stricker, Christian Theobalt

We introduce a supervised-learning framework for non-rigid point set alignment of a new kind - Displacements on Voxels Networks (DispVoxNets) - which abstracts away from the point set representation and regresses 3D displacement fields on regularly sampled proxy 3D voxel grids.

Paper
Add Code

Real-Time Global Illumination Decomposition of Videos

no code implementations • 6 Aug 2019 • Abhimitra Meka, Mohammad Shafiei, Michael Zollhoefer, Christian Richardt, Christian Theobalt

We propose the first approach for the decomposition of a monocular color video into direct and indirect illumination components in real time.

Paper
Add Code

Multi-Garment Net: Learning to Dress 3D People from Images

6 code implementations • ICCV 2019 • Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll

We present Multi-Garment Network (MGN), a method to predict body shape and clothing, layered on top of the SMPL model from a few frames (1-8) of a video.

3D Human Pose Estimation 3D Shape Reconstruction From A Single 2D Image

277

Paper
Code

EventCap: Monocular 3D Capture of High-Speed Human Motions using an Event Camera

no code implementations • CVPR 2020 • Lan Xu, Weipeng Xu, Vladislav Golyanik, Marc Habermann, Lu Fang, Christian Theobalt

The high frame rate is a critical requirement for capturing fast human motions.

Paper
Add Code

3D Morphable Face Models -- Past, Present and Future

1 code implementation • 3 Sep 2019 • Bernhard Egger, William A. P. Smith, Ayush Tewari, Stefanie Wuhrer, Michael Zollhoefer, Thabo Beeler, Florian Bernard, Timo Bolkart, Adam Kortylewski, Sami Romdhani, Christian Theobalt, Volker Blanz, Thomas Vetter

In this paper, we provide a detailed survey of 3D Morphable Face Models over the 20 years since they were first proposed.

862

Paper
Code

Intrinsic Dynamic Shape Prior for Fast, Sequential and Dense Non-Rigid Structure from Motion with Detection of Temporally-Disjoint Rigidity

no code implementations • 5 Sep 2019 • Vladislav Golyanik, André Jonas, Didier Stricker, Christian Theobalt

The reasons for the slow dissemination are the severe ill-posedness, high sensitivity to motion and deformation cues and the difficulty to obtain reliable point tracks in the vast majority of practical scenarios.

Paper
Add Code

Neural Style-Preserving Visual Dubbing

no code implementations • 5 Sep 2019 • Hyeongwoo Kim, Mohamed Elgharib, Michael Zollhöfer, Hans-Peter Seidel, Thabo Beeler, Christian Richardt, Christian Theobalt

We present a style-preserving visual dubbing approach from single video inputs, which maintains the signature style of target actors when modifying facial expressions, including mouth motions, to match foreign languages.

Generative Adversarial Network

Paper
Add Code

Convex Optimisation for Inverse Kinematics

no code implementations • 24 Oct 2019 • Tarun Yenamandra, Florian Bernard, Jiayi Wang, Franziska Mueller, Christian Theobalt

We consider the problem of inverse kinematics (IK), where one wants to find the parameters of a given kinematic skeleton that best explain a set of observed 3D joint locations.

Paper
Add Code

DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

1 code implementation • 9 Dec 2019 • Aljaž Božič, Michael Zollhöfer, Christian Theobalt, Matthias Nießner

Applying data-driven approaches to non-rigid 3D reconstruction has been difficult, which we believe can be attributed to the lack of a large-scale training corpus.

3D Reconstruction RGB-D Reconstruction

181

Paper
Code

Neural Voice Puppetry: Audio-driven Facial Reenactment

1 code implementation • ECCV 2020 • Justus Thies, Mohamed Elgharib, Ayush Tewari, Christian Theobalt, Matthias Nießner

Neural Voice Puppetry has a variety of use-cases, including audio-driven video avatars, video dubbing, and text-driven video synthesis of a talking head.

Face Model Neural Rendering +2

Paper
Code

A Quantum Computational Approach to Correspondence Problems on Point Sets

no code implementations • CVPR 2020 • Vladislav Golyanik, Christian Theobalt

Modern adiabatic quantum computers (AQC) are already used to solve difficult combinatorial optimisation problems in various domains of science.

Paper
Add Code

Image-guided Neural Object Rendering

no code implementations • ICLR 2020 • Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner

Based on this 3D proxy, the appearance of a captured view can be warped into a new target view as in classical image-based rendering.

Image Generation Object

Paper
Add Code

Neural Human Video Rendering by Learning Dynamic Textures and Rendering-to-Video Translation

no code implementations • 14 Jan 2020 • Lingjie Liu, Weipeng Xu, Marc Habermann, Michael Zollhoefer, Florian Bernard, Hyeongwoo Kim, Wenping Wang, Christian Theobalt

In this paper, we propose a novel human video synthesis method that approaches these limiting factors by explicitly disentangling the learning of time-coherent fine-scale details from the embedding of the human in 2D screen space.

Image-to-Image Translation Novel View Synthesis +1

Paper
Add Code

MINA: Convex Mixed-Integer Programming for Non-Rigid Shape Alignment

no code implementations • CVPR 2020 • Florian Bernard, Zeeshan Khan Suri, Christian Theobalt

We present a convex mixed-integer programming formulation for non-rigid shape matching.

Paper
Add Code

DeepCap: Monocular Human Performance Capture Using Weak Supervision

no code implementations • CVPR 2020 • Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality.

Pose Estimation

Paper
Add Code

Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data

2 code implementations • CVPR 2020 • Yuxiao Zhou, Marc Habermann, Weipeng Xu, Ikhsanul Habibie, Christian Theobalt, Feng Xu

We present a novel method for monocular hand shape and pose estimation at unprecedented runtime performance of 100fps and at state-of-the-art accuracy.

Pose Estimation

937

Paper
Code

StyleRig: Rigging StyleGAN for 3D Control over Portrait Images

no code implementations • CVPR 2020 • Ayush Tewari, Mohamed Elgharib, Gaurav Bharaj, Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

StyleGAN generates photorealistic portrait images of faces with eyes, teeth, hair and context (neck, shoulders, background), but lacks a rig-like control over semantic face parameters that are interpretable in 3D, such as face pose, expressions, and scene illumination.

Paper
Add Code

Occlusion-Aware Depth Estimation with Adaptive Normal Constraints

1 code implementation • ECCV 2020 • Xiaoxiao Long, Lingjie Liu, Christian Theobalt, Wenping Wang

We present a new learning-based method for multi-frame depth estimation from a color video, which is a fundamental problem in scene understanding, robot navigation or handheld 3D reconstruction.

3D Reconstruction Depth Estimation +2

Paper
Code

HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map

no code implementations • CVPR 2020 • Jameel Malik, Ibrahim Abdelaziz, Ahmed Elhayek, Soshi Shimada, Sk Aziz Ali, Vladislav Golyanik, Christian Theobalt, Didier Stricker

The input to our method is a 3D voxelized depth map, and we rely on two hand shape representations.

3D Hand Pose Estimation

Paper
Add Code

State of the Art on Neural Rendering

no code implementations • 8 Apr 2020 • Ayush Tewari, Ohad Fried, Justus Thies, Vincent Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, Tomas Simon, Jason Saragih, Matthias Nießner, Rohit Pandey, Sean Fanello, Gordon Wetzstein, Jun-Yan Zhu, Christian Theobalt, Maneesh Agrawala, Eli Shechtman, Dan B. Goldman, Michael Zollhöfer

Neural rendering is a new and rapidly emerging field that combines generative machine learning techniques with physical knowledge from computer graphics, e. g., by the integration of differentiable rendering into network training.

BIG-bench Machine Learning Image Generation +2

Paper
Add Code

Vid2Curve: Simultaneous Camera Motion Estimation and Thin Structure Reconstruction from an RGB Video

1 code implementation • 7 May 2020 • Peng Wang, Lingjie Liu, Nenglun Chen, Hung-Kuo Chu, Christian Theobalt, Wenping Wang

We propose the first approach that simultaneously estimates camera motion and reconstructs the geometry of complex 3D thin structures in high quality from a color video captured by a handheld camera.

Motion Estimation Occlusion Handling +1

193

Paper
Code

VideoForensicsHQ: Detecting High-quality Manipulated Face Videos

no code implementations • 20 May 2020 • Gereon Fox, Wentao Liu, Hyeongwoo Kim, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

We introduce a new benchmark dataset for face video forgery detection, of unprecedented quality.

Vocal Bursts Intensity Prediction

Paper
Add Code

Generative Model-Based Loss to the Rescue: A Method to Overcome Annotation Errors for Depth-Based Hand Pose Estimation

no code implementations • 6 Jul 2020 • Jiayi Wang, Franziska Mueller, Florian Bernard, Christian Theobalt

We propose to use a model-based generative loss for training hand pose estimators on depth images based on a volumetric hand model.

Hand Pose Estimation

Paper
Add Code

Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction

1 code implementation • ECCV 2020 • Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

In this work, we present methodology that combines detail-rich implicit functions and parametric representations in order to reconstruct 3D models of people that remain controllable and accurate even in the presence of clothing.

3D Human Pose Estimation 3D Human Reconstruction

219

Paper
Code

Neural Sparse Voxel Fields

1 code implementation • NeurIPS 2020 • Lingjie Liu, Jiatao Gu, Kyaw Zaw Lin, Tat-Seng Chua, Christian Theobalt

We also demonstrate several challenging tasks, including multi-scene learning, free-viewpoint rendering of a moving human, and large-scale scene rendering.

783

Paper
Code

Face2Face: Real-time Face Capture and Reenactment of RGB Videos

2 code implementations • CVPR 2016 • Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, Matthias Nießner

Our goal is to animate the facial expressions of the target video by a source actor and re-render the manipulated output video in a photo-realistic fashion.

Paper
Code

PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations

no code implementations • ECCV 2020 • Edgar Tretschk, Ayush Tewari, Vladislav Golyanik, Michael Zollhöfer, Carsten Stoll, Christian Theobalt

At the level of patches, objects across different categories share similarities, which leads to more generalizable models.

Point Cloud Completion

Paper
Add Code

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

no code implementations • 20 Aug 2020 • Soshi Shimada, Vladislav Golyanik, Weipeng Xu, Christian Theobalt

We, therefore, present PhysCap, the first algorithm for physically plausible, real-time and marker-less human 3D motion capture with a single colour camera at 25 fps.

Paper
Add Code

Monocular Reconstruction of Neural Face Reflectance Fields

no code implementations • CVPR 2021 • Mallikarjun B R., Ayush Tewari, Tae-Hyun Oh, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt

The reflectance field of a face describes the reflectance properties responsible for complex lighting effects including diffuse, specular, inter-reflection and self shadowing.

Monocular Reconstruction

Paper
Add Code

PIE: Portrait Image Embedding for Semantic Control

no code implementations • 20 Sep 2020 • Ayush Tewari, Mohamed Elgharib, Mallikarjun B R., Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt

We present the first approach for embedding real portrait images in the latent space of StyleGAN, which allows for intuitive editing of the head pose, facial expression, and scene illumination in the image.

Face Model

Paper
Add Code

Fast Gravitational Approach for Rigid Point Set Registration with Ordinary Differential Equations

no code implementations • 28 Sep 2020 • Sk Aziz Ali, Kerem Kahraman, Christian Theobalt, Didier Stricker, Vladislav Golyanik

This article introduces a new physics-based method for rigid point set alignment called Fast Gravitational Approach (FGA).

Paper
Add Code

Learning Complete 3D Morphable Face Models from Images and Videos

no code implementations • CVPR 2021 • Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

Our network design and loss functions ensure a disentangled parameterization of not only identity and albedo, but also, for the first time, an expression basis.

3D Face Reconstruction Monocular Reconstruction +1

Paper
Add Code

LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration

no code implementations • NeurIPS 2020 • Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

Formulating this closed loop is not straightforward because it is not trivial to force the output of the NN to be on the surface of the human model - outside this surface the human model is not even defined.

Self-Supervised Learning

Paper
Add Code

Deep Physics-aware Inference of Cloth Deformation for Monocular Human Performance Capture

no code implementations • 25 Nov 2020 • Yue Li, Marc Habermann, Bernhard Thomaszewski, Stelian Coros, Thabo Beeler, Christian Theobalt

Recent monocular human performance capture approaches have shown compelling dense tracking results of the full body from a single RGB camera.

Paper
Add Code

Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks

1 code implementation • CVPR 2021 • Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang

We present a novel method for multi-view depth estimation from a single video, which is a critical task in various applications, such as perception, reconstruction and robot navigation.

Depth Estimation Robot Navigation

Paper
Code

i3DMM: Deep Implicit 3D Morphable Model of Human Heads

1 code implementation • CVPR 2021 • Tarun Yenamandra, Ayush Tewari, Florian Bernard, Hans-Peter Seidel, Mohamed Elgharib, Daniel Cremers, Christian Theobalt

Our approach has the following favorable properties: (i) It is the first full head morphable model that includes hair.

Paper
Code

Pose-Guided Human Animation from a Single Image in the Wild

no code implementations • CVPR 2021 • Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, Christian Theobalt

We present a new pose transfer method for synthesizing a human animation from a single image of a person controlled by a sequence of body poses.

Pose Transfer

Paper
Add Code

Monocular Real-time Full Body Capture with Inter-part Correlations

no code implementations • CVPR 2021 • Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian Theobalt, Feng Xu

We present the first method for real-time full body capture that estimates shape and motion of body and hands together with a dynamic 3D face model from a single color image.

Ranked #11 on 3D Hand Pose Estimation on FreiHAND

3D Hand Pose Estimation Computational Efficiency +1

Paper
Add Code

EventHands: Real-Time Neural 3D Hand Pose Estimation from an Event Stream

1 code implementation • ICCV 2021 • Viktor Rudnev, Vladislav Golyanik, Jiayi Wang, Hans-Peter Seidel, Franziska Mueller, Mohamed Elgharib, Christian Theobalt

Due to the different data modality of event cameras compared to classical cameras, existing methods cannot be directly applied to and re-trained for event streams.

3D Hand Pose Estimation

Paper
Code

High-Fidelity Neural Human Motion Transfer from Monocular Video

1 code implementation • CVPR 2021 • Moritz Kappel, Vladislav Golyanik, Mohamed Elgharib, Jann-Ole Henningson, Hans-Peter Seidel, Susana Castillo, Christian Theobalt, Marcus Magnor

We address these limitations for the first time in the literature and present a new framework which performs high-fidelity and temporally-consistent human motion transfer with natural pose-dependent non-rigid deformations, for several types of loose garments.

Image Generation Vocal Bursts Intensity Prediction

Paper
Code

Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Dynamic Scene From Monocular Video

2 code implementations • ICCV 2021 • Edgar Tretschk, Ayush Tewari, Vladislav Golyanik, Michael Zollhöfer, Christoph Lassner, Christian Theobalt

We show that a single handheld consumer-grade camera is sufficient to synthesize sophisticated renderings of a dynamic scene from novel virtual camera views, e. g. a `bullet-time' video effect.

Novel View Synthesis Video Editing

353

Paper
Code

Neural Re-Rendering of Humans from a Single Image

no code implementations • ECCV 2020 • Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt

Human re-rendering from a single image is a starkly under-constrained problem, and state-of-the-art algorithms often exhibit undesired artefacts, such as over-smoothing, unrealistic distortions of the body parts and garments, or implausible changes of the texture.

Translation

Paper
Add Code

Quantum Permutation Synchronization

no code implementations • CVPR 2021 • Tolga Birdal, Vladislav Golyanik, Christian Theobalt, Leonidas Guibas

We present QuantumSync, the first quantum algorithm for solving a synchronization problem in the context of computer vision.

Paper
Add Code

Learning Speech-driven 3D Conversational Gestures from Video

no code implementations • 13 Feb 2021 • Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Lingjie Liu, Hans-Peter Seidel, Gerard Pons-Moll, Mohamed Elgharib, Christian Theobalt

We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input.

Ranked #4 on Gesture Generation on BEAT2

3D Face Animation Generative Adversarial Network +2

Paper
Add Code

Style and Pose Control for Image Synthesis of Humans from a Single Monocular View

no code implementations • 22 Feb 2021 • Kripasindhu Sarkar, Vladislav Golyanik, Lingjie Liu, Christian Theobalt

Photo-realistic re-rendering of a human from a single image with explicit control over body pose, shape and appearance enables a wide range of applications, such as human appearance transfer, virtual try-on, motion imitation, and novel view synthesis.

Image Generation Novel View Synthesis +1

Paper
Add Code

HumanGAN: A Generative Model of Humans Images

no code implementations • 11 Mar 2021 • Kripasindhu Sarkar, Lingjie Liu, Vladislav Golyanik, Christian Theobalt

We address these limitations and present a generative model for images of dressed humans offering control over pose, local body part appearance and garment style.

Pose Transfer

Paper
Add Code

PhotoApp: Photorealistic Appearance Editing of Head Portraits

1 code implementation • 13 Mar 2021 • Mallikarjun B R, Ayush Tewari, Abdallah Dib, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Louis Chevallier, Mohamed Elgharib, Christian Theobalt

We present an approach for high-quality intuitive editing of the camera viewpoint and scene illumination in a portrait image.

Paper
Code

Synthesis of Compositional Animations from Textual Descriptions

1 code implementation • ICCV 2021 • Anindita Ghosh, Noshaba Cheema, Cennet Oguz, Christian Theobalt, Philipp Slusallek

Our model can generate plausible pose sequences for short sentences describing single actions as well as long compositional sentences describing multiple sequential and superimposed actions.

Motion Synthesis Sentence

Paper
Code

Adaptive Surface Normal Constraint for Depth Estimation

1 code implementation • ICCV 2021 • Xiaoxiao Long, Cheng Lin, Lingjie Liu, Wei Li, Christian Theobalt, Ruigang Yang, Wenping Wang

We present a novel method for single image depth estimation using surface normal constraints.

Depth Estimation

Paper
Code

Towards High Fidelity Monocular Face Reconstruction with Rich Reflectance using Self-supervised Learning and Ray Tracing

1 code implementation • ICCV 2021 • Abdallah Dib, Cedric Thebault, Junghyun Ahn, Philippe-Henri Gosselin, Christian Theobalt, Louis Chevallier

In this paper, we build our work on the aforementioned approaches and propose a new method that greatly improves reconstruction quality and robustness in general scenes.

Ranked #12 on 3D Face Reconstruction on NoW Benchmark

3D Face Reconstruction Monocular Reconstruction +1

Paper
Code

Efficient and Differentiable Shadow Computation for Inverse Problems

no code implementations • ICCV 2021 • Linjie Lyu, Marc Habermann, Lingjie Liu, Mallikarjun B R, Ayush Tewari, Christian Theobalt

Differentiable rendering has received increasing interest for image-based inverse problems.

Paper
Add Code

Estimating Egocentric 3D Human Pose in Global Space

1 code implementation • ICCV 2021 • Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Christian Theobalt

Furthermore, these methods suffer from limited accuracy and temporal instability due to ambiguities caused by the monocular setup and the severe occlusion in a strongly distorted egocentric perspective.

Ranked #4 on Egocentric Pose Estimation on SceneEgo (using extra training data)

Egocentric Pose Estimation

Paper
Code

Differentiable Event Stream Simulator for Non-Rigid 3D Tracking

no code implementations • 30 Apr 2021 • Jalees Nehvi, Vladislav Golyanik, Franziska Mueller, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

This paper introduces the first differentiable simulator of event streams, i. e., streams of asynchronous brightness change signals recorded by event cameras.

Paper
Add Code

Neural Monocular 3D Human Motion Capture with Physical Awareness

no code implementations • 3 May 2021 • Soshi Shimada, Vladislav Golyanik, Weipeng Xu, Patrick Pérez, Christian Theobalt

We present a new trainable system for physically plausible markerless 3D human motion capture, which achieves state-of-the-art results in a broad range of challenging scenarios.

3D Pose Estimation

Paper
Add Code

Real-time Deep Dynamic Characters

no code implementations • 4 May 2021 • Marc Habermann, Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

We propose a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic appearance learned in a new weakly supervised way from multi-view imagery.

Paper
Add Code

Q-Match: Iterative Shape Matching via Quantum Annealing

no code implementations • ICCV 2021 • Marcel Seelbach Benkner, Zorah Lähner, Vladislav Golyanik, Christof Wunderlich, Christian Theobalt, Michael Moeller

Finding shape correspondences can be formulated as an NP-hard quadratic assignment problem (QAP) that becomes infeasible for shapes with high sampling density.

Paper
Add Code

Neural Actor: Neural Free-view Synthesis of Human Actors with Pose Control

no code implementations • 3 Jun 2021 • Lingjie Liu, Marc Habermann, Viktor Rudnev, Kripasindhu Sarkar, Jiatao Gu, Christian Theobalt

To address this problem, we utilize a coarse body model as the proxy to unwarp the surrounding 3D space into a canonical pose.

Paper
Add Code

Real-time Pose and Shape Reconstruction of Two Interacting Hands With a Single Depth Camera

no code implementations • 15 Jun 2021 • Franziska Mueller, Micah Davis, Florian Bernard, Oleksandr Sotnychenko, Mickeal Verschoor, Miguel A. Otaduy, Dan Casas, Christian Theobalt

We present a novel method for real-time pose and shape reconstruction of two strongly interacting hands.

Physical Simulations

Paper
Add Code

NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction

6 code implementations • NeurIPS 2021 • Peng Wang, Lingjie Liu, YuAn Liu, Christian Theobalt, Taku Komura, Wenping Wang

In NeuS, we propose to represent a surface as the zero-level set of a signed distance function (SDF) and develop a new volume rendering method to train a neural SDF representation.

Novel View Synthesis Surface Reconstruction

1,472

Paper
Code

Fast Simultaneous Gravitational Alignment of Multiple Point Sets

no code implementations • 21 Jun 2021 • Vladislav Golyanik, Soshi Shimada, Christian Theobalt

The problem of simultaneous rigid alignment of multiple unordered point sets which is unbiased towards any of the inputs has recently attracted increasing interest, and several reliable methods have been newly proposed.

Paper
Add Code

RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video

no code implementations • 22 Jun 2021 • Jiayi Wang, Franziska Mueller, Florian Bernard, Suzanne Sorli, Oleksandr Sotnychenko, Neng Qian, Miguel A. Otaduy, Dan Casas, Christian Theobalt

Moreover, we demonstrate that our approach offers previously unseen two-hand tracking performance from RGB, and quantitatively and qualitatively outperforms existing RGB-based methods that were not explicitly designed for two-hand interactions.

3D Reconstruction Sign Language Recognition

Paper
Add Code

HandVoxNet++: 3D Hand Shape and Pose Estimation using Voxel-Based Neural Networks

no code implementations • 2 Jul 2021 • Jameel Malik, Soshi Shimada, Ahmed Elhayek, Sk Aziz Ali, Christian Theobalt, Vladislav Golyanik, Didier Stricker

To address the limitations of the existing methods, we develop HandVoxNet++, i. e., a voxel-based deep network with 3D and graph convolutions trained in a fully supervised manner.

3D Hand Pose Estimation

Paper
Add Code

NRST: Non-rigid Surface Tracking from Monocular Video

no code implementations • 6 Jul 2021 • Marc Habermann, Weipeng Xu, Helge Rhodin, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Our texture term exploits the orientation information in the micro-structures of the objects, e. g., the yarn patterns of fabrics.

Paper
Add Code

Egocentric Videoconferencing

no code implementations • 7 Jul 2021 • Mohamed Elgharib, Mohit Mendiratta, Justus Thies, Matthias Nießner, Hans-Peter Seidel, Ayush Tewari, Vladislav Golyanik, Christian Theobalt

Even holding a mobile phone camera in the front of the face while sitting for a long duration is not convenient.

Face Reenactment Mixed Reality

Paper
Add Code

Self-supervised Outdoor Scene Relighting

no code implementations • ECCV 2020 • Ye Yu, Abhimitra Meka, Mohamed Elgharib, Hans-Peter Seidel, Christian Theobalt, William A. P. Smith

Outdoor scene relighting is a challenging problem that requires good understanding of the scene geometry, illumination and albedo.

Paper
Add Code

Adiabatic Quantum Graph Matching with Permutation Matrix Constraints

no code implementations • 8 Jul 2021 • Marcel Seelbach Benkner, Vladislav Golyanik, Christian Theobalt, Michael Moeller

In this work, we address such problems with emerging quantum computing technology and propose several reformulations of QAPs as unconstrained problems suitable for efficient execution on quantum hardware.

Graph Matching valid

Paper
Add Code

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

no code implementations • 15 Jul 2021 • Gereon Fox, Ayush Tewari, Mohamed Elgharib, Christian Theobalt

We demonstrate that it suffices to train our temporal architecture on only 10 minutes of footage of 1 subject for about 6 hours.

Paper
Add Code

Neural Rays for Occlusion-aware Image-based Rendering

1 code implementation • CVPR 2022 • YuAn Liu, Sida Peng, Lingjie Liu, Qianqian Wang, Peng Wang, Christian Theobalt, Xiaowei Zhou, Wenping Wang

On such a 3D point, these generalization methods will include inconsistent image features from invisible views, which interfere with the radiance field construction.

Neural Rendering Novel View Synthesis +1

395

Paper
Code

Gravity-Aware Monocular 3D Human-Object Reconstruction

no code implementations • ICCV 2021 • Rishabh Dabral, Soshi Shimada, Arjun Jain, Christian Theobalt, Vladislav Golyanik

We evaluate GraviCap on a new dataset with ground-truth annotations for persons and different objects undergoing free flights.

Human-Object Interaction Detection Object +1

Paper
Add Code

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis

1 code implementation • ICLR 2022 • Jiatao Gu, Lingjie Liu, Peng Wang, Christian Theobalt

We perform volume rendering only to produce a low-resolution feature map and progressively apply upsampling in 2D to address the first issue.

Image Generation

953

Paper
Code

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

1 code implementation • NeurIPS 2021 • Xingang Pan, Xudong Xu, Chen Change Loy, Christian Theobalt, Bo Dai

Motivated by the observation that a 3D object should look realistic from multiple viewpoints, these methods introduce a multi-view constraint as regularization to learn valid 3D radiance fields from 2D images.

3D-Aware Image Synthesis 3D Shape Reconstruction +2

145

Paper
Code

Advances in Neural Rendering

1 code implementation • 10 Nov 2021 • Ayush Tewari, Justus Thies, Ben Mildenhall, Pratul Srinivasan, Edgar Tretschk, Yifan Wang, Christoph Lassner, Vincent Sitzmann, Ricardo Martin-Brualla, Stephen Lombardi, Tomas Simon, Christian Theobalt, Matthias Niessner, Jonathan T. Barron, Gordon Wetzstein, Michael Zollhoefer, Vladislav Golyanik

The reconstruction of such a scene representation from observations using differentiable rendering losses is known as inverse graphics or inverse rendering.

Inverse Rendering Neural Rendering

Paper
Code

A Deeper Look into DeepCap

no code implementations • 20 Nov 2021 • Marc Habermann, Weipeng Xu, Michael Zollhoefer, Gerard Pons-Moll, Christian Theobalt

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality.

Pose Estimation

Paper
Add Code

EgoRenderer: Rendering Human Avatars from Egocentric Camera Images

no code implementations • ICCV 2021 • Tao Hu, Kripasindhu Sarkar, Lingjie Liu, Matthias Zwicker, Christian Theobalt

We next combine the target pose image and the textures into a combined feature image, which is transformed into the output color image using a neural image translation network.

Texture Synthesis Translation

Paper
Add Code

NeRF for Outdoor Scene Relighting

no code implementations • 9 Dec 2021 • Viktor Rudnev, Mohamed Elgharib, William Smith, Lingjie Liu, Vladislav Golyanik, Christian Theobalt

Photorealistic editing of outdoor scenes from photographs requires a profound understanding of the image formation process and an accurate estimation of the scene geometry, reflectance and illumination.

Paper
Add Code

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

no code implementations • 10 Dec 2021 • Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin

The wide span of viewing positions within these scenes yields multi-scale renderings with very different levels of detail, which poses great challenges to neural radiance field and biases it towards compromised results.

Paper
Add Code

Multimodal Image Synthesis and Editing: The Generative AI Era

2 code implementations • 27 Dec 2021 • Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Lingjie Liu, Adam Kortylewski, Christian Theobalt, Eric Xing

With superb power in modeling the interaction among multimodal information, multimodal image synthesis and editing has become a hot research topic in recent years.

Image Generation

748

Paper
Code

f-SfT: Shape-From-Template With a Physics-Based Deformation Model

no code implementations • CVPR 2022 • Navami Kairanda, Edith Tretschk, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik

In contrast to previous works, this paper proposes a new SfT approach explaining 2D observations through physical simulations accounting for forces and material properties.

3D Reconstruction Physical Simulations

Paper
Add Code

Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision

no code implementations • CVPR 2022 • Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Diogo Luvizon, Christian Theobalt

Specifically, we first generate pseudo labels for the EgoPW dataset with a spatio-temporal optimization method by incorporating the external-view supervision.

Ranked #4 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

Egocentric Pose Estimation

Paper
Add Code

Playable Environments: Video Manipulation in Space and Time

1 code implementation • CVPR 2022 • Willi Menapace, Stéphane Lathuilière, Aliaksandr Siarohin, Christian Theobalt, Sergey Tulyakov, Vladislav Golyanik, Elisa Ricci

We present Playable Environments - a new representation for interactive video generation and manipulation in space and time.

Video Generation

Paper
Code

φ-SfT: Shape-from-Template with a Physics-Based Deformation Model

no code implementations • 22 Mar 2022 • Navami Kairanda, Edith Tretschk, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik

In contrast to previous works, this paper proposes a new SfT approach explaining 2D observations through physical simulations accounting for forces and material properties.

3D Reconstruction Physical Simulations

Paper
Add Code

Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images

no code implementations • CVPR 2022 • Ayush Tewari, Mallikarjun B R, Xingang Pan, Ohad Fried, Maneesh Agrawala, Christian Theobalt

Our model can disentangle the geometry and appearance variations in the scene, i. e., we can independently sample from the geometry and appearance spaces of the generative model.

Disentanglement

Paper
Add Code

FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions

no code implementations • 30 Mar 2022 • Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers

To this end, we learn a signed distance function (SDF) along with our DDF model to represent a class of shapes.

3D Reconstruction Inverse Rendering

Paper
Add Code

Direct Dense Pose Estimation

no code implementations • 4 Apr 2022 • Liqian Ma, Lingjie Liu, Christian Theobalt, Luc van Gool

In addition, DDP is computationally more efficient than previous dense pose estimation methods, and it reduces jitters when applied to a video sequence, which is a problem plaguing the previous methods.

Action Recognition Pose Estimation +2

Paper
Add Code

BEHAVE: Dataset and Method for Tracking Human Object Interactions

1 code implementation • CVPR 2022 • Bharat Lal Bhatnagar, Xianghui Xie, Ilya A. Petrov, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

We present BEHAVE dataset, the first full body human- object interaction dataset with multi-view RGBD frames and corresponding 3D SMPL and object fits along with the annotated contacts between them.

Human-Object Interaction Detection Mixed Reality +1

133

Paper
Code

HULC: 3D Human Motion Capture with Pose Manifold Sampling and Dense Contact Guidance

no code implementations • 11 May 2022 • Soshi Shimada, Vladislav Golyanik, Zhi Li, Patrick Pérez, Weipeng Xu, Christian Theobalt

Marker-less monocular 3D human motion capture (MoCap) with scene interactions is a challenging research topic relevant for extended reality, robotics and virtual avatar generation.

Paper
Add Code

Physics Informed Neural Fields for Smoke Reconstruction with Sparse Data

no code implementations • 14 Jun 2022 • Mengyu Chu, Lingjie Liu, Quan Zheng, Erik Franz, Hans-Peter Seidel, Christian Theobalt, Rhaleb Zayer

With a hybrid architecture that separates static and dynamic contents, fluid interactions with static obstacles are reconstructed for the first time without additional geometry input or human labeling.

Paper
Add Code

Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation Model

no code implementations • 16 Jun 2022 • Erik C. M. Johnson, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt

Capturing general deforming scenes from monocular RGB video is crucial for many computer graphics and vision applications.

3D Reconstruction 4D reconstruction +1

Paper
Add Code

GAN2X: Non-Lambertian Inverse Rendering of Image GANs

no code implementations • 18 Jun 2022 • Xingang Pan, Ayush Tewari, Lingjie Liu, Christian Theobalt

2D images are observations of the 3D physical world depicted with the geometry, material, and illumination components.

3D Face Reconstruction Inverse Rendering

Paper
Add Code

EventNeRF: Neural Radiance Fields from a Single Colour Event Camera

no code implementations • CVPR 2023 • Viktor Rudnev, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik

Asynchronously operating event cameras find many applications due to their high dynamic range, vanishingly low motion blur, low latency and low data bandwidth.

3D Reconstruction Novel View Synthesis +1

Paper
Add Code

Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions

no code implementations • 25 Jun 2022 • Weilin Wan, Lei Yang, Lingjie Liu, Zhuoying Zhang, Ruixing Jia, Yi-King Choi, Jia Pan, Christian Theobalt, Taku Komura, Wenping Wang

We also observe that an object's intrinsic physical properties are useful for the object motion prediction, and thus design a set of object dynamic descriptors to encode such intrinsic properties.

Human-Object Interaction Detection motion prediction +1

Paper
Add Code

NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors

no code implementations • 27 Jun 2022 • Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang

The key idea of NeuRIS is to integrate estimated normal of indoor scenes as a prior in a neural rendering framework for reconstructing large texture-less shapes and, importantly, to do this in an adaptive manner to also enable the reconstruction of irregular shapes with fine details.

3D Reconstruction Neural Rendering

Paper
Add Code

Neural Radiance Transfer Fields for Relightable Novel-view Synthesis with Global Illumination

no code implementations • 27 Jul 2022 • Linjie Lyu, Ayush Tewari, Thomas Leimkuehler, Marc Habermann, Christian Theobalt

Given a set of images of a scene, the re-rendering of this scene from novel views and lighting conditions is an important and challenging problem in Computer Vision and Graphics.

Disentanglement Novel View Synthesis

Paper
Add Code

UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture

no code implementations • 2 Aug 2022 • Hiroyasu Akada, Jian Wang, Soshi Shimada, Masaki Takahashi, Christian Theobalt, Vladislav Golyanik

We present UnrealEgo, i. e., a new large-scale naturalistic dataset for egocentric 3D human pose estimation.

Ranked #2 on Egocentric Pose Estimation on UnrealEgo

Egocentric Pose Estimation Keypoint Estimation

Paper
Add Code

MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes

1 code implementation • 17 Aug 2022 • Zhi Li, Soshi Shimada, Bernt Schiele, Christian Theobalt, Vladislav Golyanik

3D human motion capture from monocular RGB images respecting interactions of a subject with complex and possibly deformable environments is a very challenging, ill-posed and under-explored problem.

3D Human Pose Estimation

Paper
Code

Neural Novel Actor: Learning a Generalized Animatable Neural Representation for Human Actors

no code implementations • 25 Aug 2022 • Yiming Wang, Qingzhe Gao, Libin Liu, Lingjie Liu, Christian Theobalt, Baoquan Chen

The learned representation can be used to synthesize novel view images of an arbitrary person from a sparse set of cameras, and further animate them with the user's pose control.

Attribute

Paper
Add Code

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

1 code implementation • 26 Aug 2022 • Tong Wu, Jiaqi Wang, Xingang Pan, Xudong Xu, Christian Theobalt, Ziwei Liu, Dahua Lin

Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene.

Surface Reconstruction

400

Paper
Code

HandFlow: Quantifying View-Dependent 3D Ambiguity in Two-Hand Reconstruction with Normalizing Flow

no code implementations • 4 Oct 2022 • Jiayi Wang, Diogo Luvizon, Franziska Mueller, Florian Bernard, Adam Kortylewski, Dan Casas, Christian Theobalt

Through this, we demonstrate the quality of our probabilistic reconstruction and show that explicit ambiguity modeling is better-suited for this challenging problem.

valid

Paper
Add Code

HiFECap: Monocular High-Fidelity and Expressive Capture of Human Performances

no code implementations • 11 Oct 2022 • Yue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt

Furthermore, we show that HiFECap outperforms the state-of-the-art human performance capture approaches qualitatively and quantitatively while for the first time capturing all aspects of the human.

Vocal Bursts Intensity Prediction

Paper
Add Code

HDHumans: A Hybrid Approach for High-fidelity Digital Humans

no code implementations • 21 Oct 2022 • Marc Habermann, Lingjie Liu, Weipeng Xu, Gerard Pons-Moll, Michael Zollhoefer, Christian Theobalt

Photo-real digital human avatars are of enormous importance in graphics, as they enable immersive communication over the globe, improve gaming and entertainment experiences, and can be particularly beneficial for AR and VR settings.

Novel View Synthesis Surface Reconstruction +1

Paper
Add Code

State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

no code implementations • 27 Oct 2022 • Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics.

3D Reconstruction

Paper
Add Code

gCoRF: Generative Compositional Radiance Fields

no code implementations • 31 Oct 2022 • Mallikarjun BR, Ayush Tewari, Xingang Pan, Mohamed Elgharib, Christian Theobalt

We start with a global generative model (GAN) and learn to decompose it into different semantic parts using supervision from 2D segmentation masks.

Image Generation

Paper
Add Code

Batch-based Model Registration for Fast 3D Sherd Reconstruction

no code implementations • ICCV 2023 • Jiepeng Wang, Congyi Zhang, Peng Wang, Xin Li, Peter J. Cobb, Christian Theobalt, Wenping Wang

In this work, we aim to develop a portable, high-throughput, and accurate reconstruction system for efficient digitization of fragments excavated in archaeological sites.

3D Reconstruction

Paper
Add Code

An Implicit Parametric Morphable Dental Model

no code implementations • 21 Nov 2022 • Congyi Zhang, Mohamed Elgharib, Gereon Fox, Min Gu, Christian Theobalt, Wenping Wang

Current dental models use an explicit mesh scene representation and model only the teeth, ignoring the gum.

Paper
Add Code

GlowGAN: Unsupervised Learning of HDR Images from LDR Images in the Wild

no code implementations • ICCV 2023 • Chao Wang, Ana Serrano, Xingang Pan, Bin Chen, Hans-Peter Seidel, Christian Theobalt, Karol Myszkowski, Thomas Leimkuehler

Most in-the-wild images are stored in Low Dynamic Range (LDR) form, serving as a partial observation of the High Dynamic Range (HDR) visual world.

Generative Adversarial Network inverse tone mapping +2

Paper
Add Code

NeuralUDF: Learning Unsigned Distance Fields for Multi-view Reconstruction of Surfaces with Arbitrary Topologies

no code implementations • CVPR 2023 • Xiaoxiao Long, Cheng Lin, Lingjie Liu, YuAn Liu, Peng Wang, Christian Theobalt, Taku Komura, Wenping Wang

In this paper, we propose to represent surfaces as the Unsigned Distance Function (UDF) and develop a new volume rendering scheme to learn the neural UDF representation.

Neural Rendering

Paper
Add Code

Fast Non-Rigid Radiance Fields from Monocularized Data

no code implementations • 2 Dec 2022 • Moritz Kappel, Vladislav Golyanik, Susana Castillo, Christian Theobalt, Marcus Magnor

The reconstruction and novel view synthesis of dynamic scenes recently gained increased attention.

3D Reconstruction Novel View Synthesis

Paper
Add Code

MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis

no code implementations • CVPR 2023 • Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt

Conventional methods for human motion synthesis are either deterministic or struggle with the trade-off between motion diversity and motion quality.

Ranked #4 on Motion Synthesis on AIST++

Denoising Motion Synthesis

Paper
Add Code

NeuS2: Fast Learning of Neural Implicit Surfaces for Multi-view Reconstruction

1 code implementation • ICCV 2023 • Yiming Wang, Qin Han, Marc Habermann, Kostas Daniilidis, Christian Theobalt, Lingjie Liu

Recent methods for neural surface representation and rendering, for example NeuS, have demonstrated the remarkably high-quality reconstruction of static scenes.

Surface Reconstruction

563

Paper
Code

IMos: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions

1 code implementation • 14 Dec 2022 • Anindita Ghosh, Rishabh Dabral, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek

Is it possible to synthesize such motion plausibly with a diverse set of objects and instructions?

Human-Object Interaction Detection Motion Synthesis

Paper
Code

Scene-aware Egocentric 3D Human Pose Estimation

1 code implementation • CVPR 2023 • Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Diogo Luvizon, Christian Theobalt

To this end, we propose an egocentric depth estimation network to predict the scene depth map from a wide-view egocentric fisheye camera while mitigating the occlusion of the human body with a depth-inpainting network.

Ranked #3 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

Depth Estimation Egocentric Pose Estimation

Paper
Code

Imitator: Personalized Speech-driven 3D Facial Animation

no code implementations • ICCV 2023 • Balamurugan Thambiraja, Ikhsanul Habibie, Sadegh Aliakbarian, Darren Cosker, Christian Theobalt, Justus Thies

To address this, we present Imitator, a speech-driven facial expression synthesis method, which learns identity-specific details from a short input video and produces novel facial expressions matching the identity-specific speaking style and facial idiosyncrasies of the target actor.

Paper
Add Code

F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories

no code implementations • CVPR 2023 • Peng Wang, YuAn Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang

Existing fast grid-based NeRF training frameworks, like Instant-NGP, Plenoxels, DVGO, or TensoRF, are mainly designed for bounded scenes and rely on space warping to handle unbounded scenes.

Novel View Synthesis

Paper
Add Code

Scene-Aware 3D Multi-Human Motion Capture from a Single Camera

1 code implementation • 12 Jan 2023 • Diogo Luvizon, Marc Habermann, Vladislav Golyanik, Adam Kortylewski, Christian Theobalt

In this work, we consider the problem of estimating the 3D position of multiple humans in a scene as well as their body shape and articulation from a single RGB video recorded with a static camera.

Position

111

Paper
Code

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion

no code implementations • 20 Feb 2023 • Jiatao Gu, Alex Trevithick, Kai-En Lin, Josh Susskind, Christian Theobalt, Lingjie Liu, Ravi Ramamoorthi

Novel view synthesis from a single image requires inferring occluded regions of objects and scenes whilst simultaneously maintaining semantic and physical consistency with the input.

Novel View Synthesis

Paper
Add Code

Regularized Vector Quantization for Tokenized Image Synthesis

no code implementations • CVPR 2023 • Jiahui Zhang, Fangneng Zhan, Christian Theobalt, Shijian Lu

The first is a prior distribution regularization which measures the discrepancy between a prior token distribution and the predicted token distribution to avoid codebook collapse and low codebook utilization.

Image Generation Quantization

Paper
Add Code

Grid-guided Neural Radiance Fields for Large Urban Scenes

no code implementations • CVPR 2023 • Linning Xu, Yuanbo Xiangli, Sida Peng, Xingang Pan, Nanxuan Zhao, Christian Theobalt, Bo Dai, Dahua Lin

An alternative solution is to use a feature grid representation, which is computationally efficient and can naturally scale to a large scene with increased grid resolutions.

Paper
Add Code

HQ3DAvatar: High Quality Controllable 3D Head Avatar

no code implementations • 25 Mar 2023 • Kartik Teotia, Mallikarjun B R, Xingang Pan, Hyeongwoo Kim, Pablo Garrido, Mohamed Elgharib, Christian Theobalt

This paper presents a novel approach to building highly photorealistic digital head avatars.

Optical Flow Estimation Vocal Bursts Intensity Prediction

Paper
Add Code

CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes

no code implementations • CVPR 2023 • Harshil Bhatia, Edith Tretschk, Zorah Lähner, Marcel Seelbach Benkner, Michael Moeller, Christian Theobalt, Vladislav Golyanik

Jointly matching multiple, non-rigidly deformed 3D shapes is a challenging, $\mathcal{NP}$-hard problem.

Paper
Add Code

F$^{2}$-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories

1 code implementation • 28 Mar 2023 • Peng Wang, YuAn Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang

Based on our analysis, we further propose a novel space-warping method called perspective warping, which allows us to handle arbitrary trajectories in the grid-based NeRF framework.

Novel View Synthesis

893

Paper
Code

GVP: Generative Volumetric Primitives

no code implementations • 31 Mar 2023 • Mallikarjun B R, Xingang Pan, Mohamed Elgharib, Christian Theobalt

Advances in 3D-aware generative models have pushed the boundary of image synthesis with explicit camera control.

Image Generation Knowledge Distillation

Paper
Add Code

OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images

no code implementations • 17 Apr 2023 • Bingchen Zhao, Jiahao Wang, Wufei Ma, Artur Jesslen, Siwei Yang, Shaozuo Yu, Oliver Zendel, Christian Theobalt, Alan Yuille, Adam Kortylewski

Enhancing the robustness of vision algorithms in real-world scenarios is challenging.

3D Pose Estimation Benchmarking +4

Paper
Add Code

EgoLocate: Real-time Motion Capture, Localization, and Mapping with Sparse Body-mounted Sensors

no code implementations • 2 May 2023 • Xinyu Yi, Yuxiao Zhou, Marc Habermann, Vladislav Golyanik, Shaohua Pan, Christian Theobalt, Feng Xu

We integrate the two techniques together in EgoLocate, a system that simultaneously performs human motion capture (mocap), localization, and mapping in real time from sparse body-mounted sensors, including 6 inertial measurement units (IMUs) and a monocular phone camera.

Simultaneous Localization and Mapping

Paper
Add Code

General Neural Gauge Fields

1 code implementation • 5 May 2023 • Fangneng Zhan, Lingjie Liu, Adam Kortylewski, Christian Theobalt

In this work, we extend this problem to a general paradigm with a taxonomy of discrete \& continuous cases, and develop a learning framework to jointly optimize gauge transformations and neural fields.

Representation Learning

151

Paper
Code

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

5 code implementations • 18 May 2023 • Xingang Pan, Ayush Tewari, Thomas Leimkühler, Lingjie Liu, Abhimitra Meka, Christian Theobalt

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects.

Image Manipulation Point Tracking +1

34,901

Paper
Code

Weakly Supervised 3D Open-vocabulary Segmentation

1 code implementation • NeurIPS 2023 • Kunhao Liu, Fangneng Zhan, Jiahui Zhang, Muyu Xu, Yingchen Yu, Abdulmotaleb El Saddik, Christian Theobalt, Eric Xing, Shijian Lu

Open-vocabulary segmentation of 3D scenes is a fundamental function of human perception and thus a crucial objective in computer vision research.

Segmentation

Paper
Code

AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars

no code implementations • 1 Jun 2023 • Mohit Mendiratta, Xingang Pan, Mohamed Elgharib, Kartik Teotia, Mallikarjun B R, Ayush Tewari, Vladislav Golyanik, Adam Kortylewski, Christian Theobalt

Our method edits the full head in a canonical space, and then propagates these edits to remaining time steps via a pretrained deformation network.

Paper
Add Code

VINECS: Video-based Neural Character Skinning

no code implementations • 3 Jul 2023 • Zhouyingcheng Liao, Vladislav Golyanik, Marc Habermann, Christian Theobalt

However, the former methods typically predict solely static skinning weights, which perform poorly for highly articulated poses, and the latter ones either require dense 3D character scans in different poses or cannot generate an explicit mesh with vertex correspondence over time.

Paper
Add Code

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields

no code implementations • ICCV 2023 • Muyu Xu, Fangneng Zhan, Jiahui Zhang, Yingchen Yu, Xiaoqin Zhang, Christian Theobalt, Ling Shao, Shijian Lu

Neural Radiance Field (NeRF) has shown impressive performance in novel view synthesis via implicit scene representation.

Novel View Synthesis

Paper
Add Code

SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes

no code implementations • 16 Aug 2023 • Edith Tretschk, Vladislav Golyanik, Michael Zollhoefer, Aljaz Bozic, Christoph Lassner, Christian Theobalt

We propose SceNeRFlow to reconstruct a general, non-rigid scene in a time-consistent manner.

4D reconstruction Novel View Synthesis

Paper
Add Code

ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors

no code implementations • 24 Aug 2023 • Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik, Marc Habermann, Christian Theobalt

Given an unseen object and a reference pose-object pair, we optimise for the object-aware pose that is closest in the feature space to the reference pose.

Motion Synthesis Object

Paper
Add Code

NeuralClothSim: Neural Deformation Fields Meet the Kirchhoff-Love Thin Shell Theory

no code implementations • 24 Aug 2023 • Navami Kairanda, Marc Habermann, Christian Theobalt, Vladislav Golyanik

Cloth simulation is an extensively studied problem, with a plethora of solutions available in computer graphics literature.

Paper
Add Code

Decaf: Monocular Deformation Capture for Face and Hand Interactions

no code implementations • 28 Sep 2023 • Soshi Shimada, Vladislav Golyanik, Patrick Pérez, Christian Theobalt

At the core of our neural approach are a variational auto-encoder supplying the hand-face depth prior and modules that guide the 3D tracking by estimating the contacts and the deformations.

Paper
Add Code

Diffusion Posterior Illumination for Ambiguity-aware Inverse Rendering

1 code implementation • 30 Sep 2023 • Linjie Lyu, Ayush Tewari, Marc Habermann, Shunsuke Saito, Michael Zollhöfer, Thomas Leimkühler, Christian Theobalt

We further conduct an extensive comparative study of different priors on illumination used in previous work on inverse rendering.

Denoising Inverse Rendering

Paper
Code

State of the Art on Diffusion Models for Visual Computing

no code implementations • 11 Oct 2023 • Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein

The field of visual computing is rapidly advancing due to the emergence of generative artificial intelligence (AI), which unlocks unprecedented capabilities for the generation, editing, and reconstruction of images, videos, and 3D scenes.

Paper
Add Code

Wonder3D: Single Image to 3D using Cross-Domain Diffusion

no code implementations • 23 Oct 2023 • Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, YuAn Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang

In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images. Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry.

Image to 3D

Paper
Add Code

3D-QAE: Fully Quantum Auto-Encoding of 3D Point Clouds

no code implementations • 9 Nov 2023 • Lakshika Rathi, Edith Tretschk, Christian Theobalt, Rishabh Dabral, Vladislav Golyanik

It is trained on collections of 3D point clouds to produce their compressed representations.

Quantum Machine Learning

Paper
Add Code

DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields

no code implementations • 18 Nov 2023 • Yu Chi, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski

The generated data is applicable across various computer vision tasks, including video segmentation and 3D point cloud segmentation.

Point Cloud Segmentation Segmentation +2

Paper
Add Code

Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement

no code implementations • 28 Nov 2023 • Jian Wang, Zhe Cao, Diogo Luvizon, Lingjie Liu, Kripasindhu Sarkar, Danhang Tang, Thabo Beeler, Christian Theobalt

In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion.

Ranked #1 on Egocentric Pose Estimation on GlobalEgoMocap Test Dataset (using extra training data)

Egocentric Pose Estimation Hand Detection +2

Paper
Add Code

ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions

no code implementations • 28 Nov 2023 • Anindita Ghosh, Rishabh Dabral, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek

Current approaches for 3D human motion synthesis generate high-quality animations of digital humans performing a wide variety of actions and gestures.

Denoising Motion Synthesis

Paper
Add Code

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

no code implementations • 28 Nov 2023 • Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, YuAn Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang

The experiments demonstrate the superior performance of Surf-D in shape generation across multiple modalities as conditions.

3D Reconstruction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.