Browse > Computer Vision > Face Generation > Talking Face Generation

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics.

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Leaderboards

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Greatest papers with code

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation

20 Jul 2018Hangz-nju-cuhk/Talking-Face-Generation-DAVS

Talking face generation aims to synthesize a sequence of face images that correspond to a clip of speech.

TALKING FACE GENERATION VIDEO RETRIEVAL

Capture, Learning, and Synthesis of 3D Speaking Styles

CVPR 2019 TimoBolkart/voca

To address this, we introduce a unique 4D face dataset with about 29 minutes of 4D scans captured at 60 fps and synchronized audio from 12 speakers.

3D FACE ANIMATION TALKING FACE GENERATION

Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss

CVPR 2019 lelechen63/ATVGnet

We devise a cascade GAN approach to generate talking face video, which is robust to different face shapes, view angles, facial characteristics, and noisy audio conditions.

TALKING FACE GENERATION

Towards Automatic Face-to-Face Translation

ACM Multimedia, 2019 2019 Rudrabha/LipGAN

In light of the recent breakthroughs in automatic machine translation systems, we propose a novel approach that we term as "Face-to-Face Translation".

 SOTA for Talking Face Generation on LRW (using extra training data)

FACE TO FACE TRANSLATION MACHINE TRANSLATION TALKING FACE GENERATION