TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Motion Synthesis	HumanML3D	MotionDiffuse	FID	0.630	# 18
Motion Synthesis	HumanML3D	MotionDiffuse	Diversity	9.410	# 15
Motion Synthesis	HumanML3D	MotionDiffuse	Multimodality	1.553	# 16
Motion Synthesis	HumanML3D	MotionDiffuse	R Precision Top3	0.782	# 13
Motion Synthesis	KIT Motion-Language	MotionDiffuse	FID	1.954	# 17
Motion Synthesis	KIT Motion-Language	MotionDiffuse	R Precision Top3	0.739	# 12
Motion Synthesis	KIT Motion-Language	MotionDiffuse	Diversity	11.10	# 3
Motion Synthesis	KIT Motion-Language	MotionDiffuse	Multimodality	0.730	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motiondiffuse-text-driven-human-motion/motion-synthesis-on-kit-motion-language)](https://paperswithcode.com/sota/motion-synthesis-on-kit-motion-language?p=motiondiffuse-text-driven-human-motion)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motiondiffuse-text-driven-human-motion/motion-synthesis-on-humanml3d)](https://paperswithcode.com/sota/motion-synthesis-on-humanml3d?p=motiondiffuse-text-driven-human-motion)`

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

31 Aug 2022 · Mingyuan Zhang, Zhongang Cai, Liang Pan, Fangzhou Hong, Xinying Guo, Lei Yang, Ziwei Liu ·

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions conditioned on natural languages. However, it remains challenging to achieve diverse and fine-grained motion generation with various text inputs. To address this problem, we propose MotionDiffuse, the first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods. 1) Probabilistic Mapping. Instead of a deterministic language-motion mapping, MotionDiffuse generates motions through a series of denoising steps in which variations are injected. 2) Realistic Synthesis. MotionDiffuse excels at modeling complicated data distribution and generating vivid motion sequences. 3) Multi-Level Manipulation. MotionDiffuse responds to fine-grained instructions on body parts, and arbitrary-length motion synthesis with time-varied text prompts. Our experiments show MotionDiffuse outperforms existing SoTA methods by convincing margins on text-driven motion generation and action-conditioned motion generation. A qualitative analysis further demonstrates MotionDiffuse's controllability for comprehensive motion generation. Homepage: https://mingyuan-zhang.github.io/projects/MotionDiffuse.html

PDF Abstract

Code

Add Remove Mark official

mingyuan-zhang/MotionDiffuse official

↳ Quickstart in

Colab

Spaces

769

viiika/diffusion-conductor

Tasks

Add Remove

Denoising

Motion Synthesis

Datasets

NeRF

AMASS

HumanML3D KIT Motion-Language

Results from the Paper

Edit

Ranked #17 on Motion Synthesis on KIT Motion-Language

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Motion Synthesis	HumanML3D	MotionDiffuse	FID	0.630	# 18	Compare
			Diversity	9.410	# 15	Compare
			Multimodality	1.553	# 16	Compare
			R Precision Top3	0.782	# 13	Compare
Motion Synthesis	KIT Motion-Language	MotionDiffuse	FID	1.954	# 17	Compare
			R Precision Top3	0.739	# 12	Compare
			Diversity	11.10	# 3	Compare
			Multimodality	0.730	# 19	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove