About

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Subtasks

Datasets

Greatest papers with code

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

23 Aug 2020Rudrabha/Wav2Lip

However, they fail to accurately morph the lip movements of arbitrary identities in dynamic, unconstrained talking face videos, resulting in significant parts of the video being out-of-sync with the new audio.

UNCONSTRAINED LIP-SYNCHRONIZATION

Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars

ECCV 2020 saic-violet/bilayer-model

The texture image is generated offline, warped and added to the coarse image to ensure a high effective resolution of synthesized head views.

NEURAL RENDERING TALKING HEAD GENERATION

ReenactGAN: Learning to Reenact Faces via Boundary Transfer

ECCV 2018 wywu/ReenactGAN

A transformer is subsequently used to adapt the boundary of source face to the boundary of target face.

FACE REENACTMENT TALKING FACE GENERATION TALKING HEAD GENERATION

Talking-head Generation with Rhythmic Head Motion

16 Jul 2020lelechen63/Talking-head-Generation-with-Rhythmic-Head-Motion

When people deliver a speech, they naturally move heads, and this rhythmic head motion conveys prosodic information.

TALKING HEAD GENERATION

What comprises a good talking-head video generation?: A Survey and Benchmark

7 May 2020lelechen63/talking-head-generation-survey

In this work, we present a carefully-designed benchmark for evaluating talking-head video generation with standardized dataset pre-processing strategies.

TALKING HEAD GENERATION VIDEO GENERATION

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation

ECCV 2020 uniBruce/Mead

The synthesis of natural emotional reactions is an essentialcriteria in vivid talking-face video generation.

TALKING FACE GENERATION TALKING HEAD GENERATION VIDEO GENERATION

Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation

16 Apr 2021FuxiVirtualHuman/Write-a-Speaker

To be specific, our framework consists of a speaker-independent stage and a speaker-specific stage.

FACE MODEL MOTION CAPTURE TALKING HEAD GENERATION VIDEO GENERATION