About

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics.

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Subtasks

Datasets

Greatest papers with code

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

23 Aug 2020Rudrabha/Wav2Lip

However, they fail to accurately morph the lip movements of arbitrary identities in dynamic, unconstrained talking face videos, resulting in significant parts of the video being out-of-sync with the new audio.

UNCONSTRAINED LIP-SYNCHRONIZATION

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation

20 Jul 2018Hangz-nju-cuhk/Talking-Face-Generation-DAVS

Talking face generation aims to synthesize a sequence of face images that correspond to a clip of speech.

LIP READING TALKING FACE GENERATION VIDEO RETRIEVAL

Capture, Learning, and Synthesis of 3D Speaking Styles

CVPR 2019 TimoBolkart/voca

To address this, we introduce a unique 4D face dataset with about 29 minutes of 4D scans captured at 60 fps and synchronized audio from 12 speakers.

3D FACE ANIMATION TALKING FACE GENERATION

Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss

CVPR 2019 lelechen63/ATVGnet

We devise a cascade GAN approach to generate talking face video, which is robust to different face shapes, view angles, facial characteristics, and noisy audio conditions.

TALKING FACE GENERATION

Talking Face Generation by Conditional Recurrent Adversarial Network

13 Apr 2018susanqq/Talking_Face_Generation

Given an arbitrary face image and an arbitrary speech clip, the proposed work attempts to generating the talking face video with accurate lip synchronization while maintaining smooth transition of both lip and facial movement over the entire video clip.

CONSTRAINED LIP-SYNCHRONIZATION VIDEO GENERATION