Search Results for author: Tomas Simon

Found 27 papers, 14 papers with code

Photogeometric Scene Flow for High-Detail Dynamic 3D Reconstruction

no code implementations • ICCV 2015 • Paulo F. U. Gotardo, Tomas Simon, Yaser Sheikh, Iain Matthews

This paper proposes photogeometric scene flow (PGSF) for high-quality dynamic 3D reconstruction.

3D Reconstruction Optical Flow Estimation +1

Paper
Add Code

Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

61 code implementations • CVPR 2017 • Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh

We present an approach to efficiently detect the 2D pose of multiple people in an image.

Ranked #6 on Keypoint Detection on MPII Multi-Person

2D Human Pose Estimation 2D Pose Estimation +3

29,805

Paper
Code

Panoptic Studio: A Massively Multiview System for Social Interaction Capture

1 code implementation • 9 Dec 2016 • Hanbyul Joo, Tomas Simon, Xulong Li, Hao liu, Lei Tan, Lin Gui, Sean Banerjee, Timothy Godisart, Bart Nabbe, Iain Matthews, Takeo Kanade, Shohei Nobuhara, Yaser Sheikh

The core challenges in capturing social interactions are: (1) occlusion is functional and frequent; (2) subtle motion needs to be measured over a space large enough to host a social group; (3) human appearance and configuration variation is immense; and (4) attaching markers to the body may prime the nature of interactions.

390

Paper
Code

Hand Keypoint Detection in Single Images using Multiview Bootstrapping

39 code implementations • CVPR 2017 • Tomas Simon, Hanbyul Joo, Iain Matthews, Yaser Sheikh

The method is used to train a hand keypoint detector for single images.

Keypoint Detection

29,805

Paper
Code

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

no code implementations • CVPR 2018 • Hanbyul Joo, Tomas Simon, Yaser Sheikh

We present a unified deformation model for the markerless capture of multiple scales of human movement, including facial expressions, body motion, and hand gestures.

Paper
Add Code

Deep Appearance Models for Face Rendering

1 code implementation • 1 Aug 2018 • Stephen Lombardi, Jason Saragih, Tomas Simon, Yaser Sheikh

At inference time, we condition the decoding network on the viewpoint of the camera in order to generate the appropriate texture for rendering.

702

Paper
Code

OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

50 code implementations • 18 Dec 2018 • Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, Yaser Sheikh

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Ranked #4 on Pose Estimation on MPII Single Person

2D Pose Estimation Keypoint Detection +1

29,805

Paper
Code

LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds

no code implementations • CVPR 2019 • Chun-Liang Li, Tomas Simon, Jason Saragih, Barnabás Póczos, Yaser Sheikh

As input, we take a sequence of point clouds to be registered as well as an artist-rigged mesh, i. e. a template mesh equipped with a linear-blend skinning (LBS) deformation space parameterized by a skeleton hierarchy.

Paper
Add Code

Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in A Triadic Interaction

1 code implementation • CVPR 2019 • Hanbyul Joo, Tomas Simon, Mina Cikara, Yaser Sheikh

We present a new research task and a dataset to understand human social interactions via computational methods, to ultimately endow machines with the ability to encode and decode a broad channel of social signals humans use.

Paper
Code

Neural Volumes: Learning Dynamic Renderable Volumes from Images

1 code implementation • 18 Jun 2019 • Stephen Lombardi, Tomas Simon, Jason Saragih, Gabriel Schwartz, Andreas Lehrmann, Yaser Sheikh

Modeling and rendering of dynamic scenes is challenging, as natural scenes often contain complex phenomena such as thin structures, evolving topology, translucency, scattering, occlusion, and biological motion.

418

Paper
Code

Single-Network Whole-Body Pose Estimation

2 code implementations • ICCV 2019 • Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh

We present the first single-network approach for 2D~whole-body pose estimation, which entails simultaneous localization of body, face, hands, and feet keypoints.

Multi-Task Learning Pose Estimation

564

Paper
Code

PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

3 code implementations • CVPR 2020 • Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo

Although current approaches have demonstrated the potential in real world settings, they still fail to produce reconstructions with the level of detail often present in the input images.

Ranked #1 on 3D Object Reconstruction From A Single Image on BUFF

3D Human Pose Estimation 3D Human Reconstruction +3

9,414

Paper
Code

State of the Art on Neural Rendering

no code implementations • 8 Apr 2020 • Ayush Tewari, Ohad Fried, Justus Thies, Vincent Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, Tomas Simon, Jason Saragih, Matthias Nießner, Rohit Pandey, Sean Fanello, Gordon Wetzstein, Jun-Yan Zhu, Christian Theobalt, Maneesh Agrawala, Eli Shechtman, Dan B. Goldman, Michael Zollhöfer

Neural rendering is a new and rapidly emerging field that combines generative machine learning techniques with physical knowledge from computer graphics, e. g., by the integration of differentiable rendering into network training.

BIG-bench Machine Learning Image Generation +2

Paper
Add Code

Learning Compositional Radiance Fields of Dynamic Human Heads

1 code implementation • CVPR 2021 • Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhöfer

In addition, we show that the learned dynamic radiance field can be used to synthesize novel unseen expressions based on a global animation code.

Neural Rendering Synthetic Data Generation

702

Paper
Code

PVA: Pixel-aligned Volumetric Avatars

no code implementations • 7 Jan 2021 • Amit Raj, Michael Zollhoefer, Tomas Simon, Jason Saragih, Shunsuke Saito, James Hays, Stephen Lombardi

Volumetric models typically employ a global code to represent facial expressions, such that they can be driven by a small set of animation parameters.

Paper
Add Code

Mixture of Volumetric Primitives for Efficient Neural Rendering

1 code implementation • 2 Mar 2021 • Stephen Lombardi, Tomas Simon, Gabriel Schwartz, Michael Zollhoefer, Yaser Sheikh, Jason Saragih

Real-time rendering and animation of humans is a core function in games, movies, and telepresence applications.

Neural Rendering

702

Paper
Code

SimPoE: Simulated Character Control for 3D Human Pose Estimation

no code implementations • CVPR 2021 • Ye Yuan, Shih-En Wei, Tomas Simon, Kris Kitani, Jason Saragih

Based on this refined kinematic pose, the policy learns to compute dynamics-based control (e. g., joint torques) of the character to advance the current-frame pose estimate to the pose estimate of the next frame.

Ranked #229 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

Pixel Codec Avatars

1 code implementation • CVPR 2021 • Shugao Ma, Tomas Simon, Jason Saragih, Dawei Wang, Yuecheng Li, Fernando de la Torre, Yaser Sheikh

Telecommunication with photorealistic avatars in virtual or augmented reality is a promising path for achieving authentic face-to-face communication in 3D over remote physical distances.

702

Paper
Code

Driving-Signal Aware Full-Body Avatars

no code implementations • 21 May 2021 • Timur Bagautdinov, Chenglei Wu, Tomas Simon, Fabian Prada, Takaaki Shiratori, Shih-En Wei, Weipeng Xu, Yaser Sheikh, Jason Saragih

The core intuition behind our method is that better drivability and generalization can be achieved by disentangling the driving signals and remaining generative factors, which are not available during animation.

Imputation

Paper
Add Code

Pixel-Aligned Volumetric Avatars

no code implementations • CVPR 2021 • Amit Raj, Michael Zollhofer, Tomas Simon, Jason Saragih, Shunsuke Saito, James Hays, Stephen Lombardi

Volumetric models typically employ a global code to represent facial expressions, such that they can be driven by a small set of animation parameters.

Ranked #5 on Generalizable Novel View Synthesis on ZJU-MoCap

Generalizable Novel View Synthesis

Paper
Add Code

Advances in Neural Rendering

1 code implementation • 10 Nov 2021 • Ayush Tewari, Justus Thies, Ben Mildenhall, Pratul Srinivasan, Edgar Tretschk, Yifan Wang, Christoph Lassner, Vincent Sitzmann, Ricardo Martin-Brualla, Stephen Lombardi, Tomas Simon, Christian Theobalt, Matthias Niessner, Jonathan T. Barron, Gordon Wetzstein, Michael Zollhoefer, Vladislav Golyanik

The reconstruction of such a scene representation from observations using differentiable rendering losses is known as inverse graphics or inverse rendering.

Inverse Rendering Neural Rendering

Paper
Code

Drivable Volumetric Avatars using Texel-Aligned Features

no code implementations • 20 Jul 2022 • Edoardo Remelli, Timur Bagautdinov, Shunsuke Saito, Tomas Simon, Chenglei Wu, Shih-En Wei, Kaiwen Guo, Zhe Cao, Fabian Prada, Jason Saragih, Yaser Sheikh

To circumvent this, we propose a novel volumetric avatar representation by extending mixtures of volumetric primitives to articulated objects.

Paper
Add Code

Multiface: A Dataset for Neural Face Rendering

1 code implementation • 22 Jul 2022 • Cheng-hsin Wuu, Ningyuan Zheng, Scott Ardisson, Rohan Bali, Danielle Belko, Eric Brockmeyer, Lucas Evans, Timothy Godisart, Hyowon Ha, Xuhua Huang, Alexander Hypes, Taylor Koska, Steven Krenn, Stephen Lombardi, Xiaomin Luo, Kevyn McPhail, Laura Millerschoen, Michal Perdoch, Mark Pitts, Alexander Richard, Jason Saragih, Junko Saragih, Takaaki Shiratori, Tomas Simon, Matt Stewart, Autumn Trimble, Xinshuo Weng, David Whitewolf, Chenglei Wu, Shoou-I Yu, Yaser Sheikh

Along with the release of the dataset, we conduct ablation studies on the influence of different model architectures toward the model's interpolation capacity of novel viewpoint and expressions.

Novel View Synthesis

702

Paper
Code

MEGANE: Morphable Eyeglass and Avatar Network

no code implementations • CVPR 2023 • Junxuan Li, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Hongdong Li, Jason Saragih

However, modeling the geometric and appearance interactions of glasses and the face of virtual representations of humans is challenging.

Image Generation Inverse Rendering

Paper
Add Code

RelightableHands: Efficient Neural Relighting of Articulated Hand Models

no code implementations • CVPR 2023 • Shun Iwase, Shunsuke Saito, Tomas Simon, Stephen Lombardi, Timur Bagautdinov, Rohan Joshi, Fabian Prada, Takaaki Shiratori, Yaser Sheikh, Jason Saragih

To achieve generalization, we condition the student model with physics-inspired illumination features such as visibility, diffuse shading, and specular reflections computed on a coarse proxy geometry, maintaining a small computational overhead.

Paper
Add Code

Relightable Gaussian Codec Avatars

no code implementations • 6 Dec 2023 • Shunsuke Saito, Gabriel Schwartz, Tomas Simon, Junxuan Li, Giljoo Nam

The fidelity of relighting is bounded by both geometry and appearance representations.

Paper
Add Code

URHand: Universal Relightable Hands

no code implementations • 10 Jan 2024 • Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito

To simplify the personalization process while retaining photorealism, we build a powerful universal relightable prior based on neural relighting from multi-view images of hands captured in a light stage with hundreds of identities.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.