About

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Subtasks

Datasets

Greatest papers with code

KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects

CVPR 2020 google-research/google-research

We address two problems: first, we establish an easy method for capturing and labeling 3D keypoints on desktop objects with an RGB camera; and second, we develop a deep neural network, called $KeyPose$, that learns to accurately predict object poses using 3D keypoints, from stereo input, and works even for transparent objects.

3D POSE ESTIMATION MOTION CAPTURE

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

1 Jul 2019rwightman/pytorch-image-models

The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy.

MONOCULAR 3D HUMAN POSE ESTIMATION MOTION CAPTURE

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation

3 Feb 2021makcedward/nlpaug

In this paper, we apply multiscale area attention in a deep convolutional neural network to attend emotional characteristics with varied granularities and therefore the classifier can benefit from an ensemble of attentions with different scales.

DATA AUGMENTATION MOTION CAPTURE SPEECH EMOTION RECOGNITION

Unsupervised learning with sparse space-and-time autoencoders

26 Nov 2018facebookresearch/SparseConvNet

We use spatially-sparse two, three and four dimensional convolutional autoencoder networks to model sparse structures in 2D space, 3D space, and 3+1=4 dimensional space-time.

4 HANDWRITING RECOGNITION MOTION CAPTURE

FrankMocap: Fast Monocular 3D Hand and Body Motion Capture by Regression and Integration

19 Aug 2020facebookresearch/frankmocap

To construct FrankMocap, we build the state-of-the-art monocular 3D "hand" motion capture method by taking the hand part of the whole body parametric model (SMPL-X).

3D HAND POSE ESTIMATION 3D HUMAN RECONSTRUCTION 3D POSE ESTIMATION MOTION CAPTURE

Scalable Gradients for Stochastic Differential Equations

5 Jan 2020google-research/torchsde

The adjoint sensitivity method scalably computes gradients of solutions to ordinary differential equations.

MOTION CAPTURE VARIATIONAL INFERENCE VIDEO PREDICTION

Skeleton-Aware Networks for Deep Motion Retargeting

12 May 2020DeepMotionEditing/deep-motion-editing

In other words, our operators form the building blocks of a new deep motion processing framework that embeds the motion into a common latent space, shared by a collection of homeomorphic skeletons.

HIERARCHICAL STRUCTURE MOTION CAPTURE MOTION RETARGETING MOTION SYNTHESIS

Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data

CVPR 2020 CalciferZh/minimal-hand

We present a novel method for monocular hand shape and pose estimation at unprecedented runtime performance of 100fps and at state-of-the-art accuracy.

MOTION CAPTURE POSE ESTIMATION