TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unconstrained Lip-synchronization	LRS2	Wav2Lip	LSE-D	6.386	# 3
Unconstrained Lip-synchronization	LRS2	Wav2Lip	LSE-C	7.781	# 1
Unconstrained Lip-synchronization	LRS2	Wav2Lip	FID	4.887	# 3
Unconstrained Lip-synchronization	LRS2	Wav2Lip + GAN	LSE-D	6.469	# 2
Unconstrained Lip-synchronization	LRS2	Wav2Lip + GAN	FID	4.446	# 2
Unconstrained Lip-synchronization	LRS3	Wav2Lip + GAN	LSE-D	6.986	# 1
Unconstrained Lip-synchronization	LRS3	Wav2Lip + GAN	LSE-C	7.574	# 2
Unconstrained Lip-synchronization	LRS3	Wav2Lip + GAN	FID	4.35	# 1
Unconstrained Lip-synchronization	LRS3	Wav2Lip	LSE-D	6.652	# 2
Unconstrained Lip-synchronization	LRS3	Wav2Lip	LSE-C	7.887	# 1
Unconstrained Lip-synchronization	LRS3	Wav2Lip	FID	4.844	# 2
Unconstrained Lip-synchronization	LRW	Wav2Lip + GAN	LSE-D	6.774	# 1
Unconstrained Lip-synchronization	LRW	Wav2Lip + GAN	LSE-C	7.263	# 2
Unconstrained Lip-synchronization	LRW	Wav2Lip + GAN	FID	2.475	# 1
Unconstrained Lip-synchronization	LRW	Wav2Lip	LSE-D	6.512	# 2
Unconstrained Lip-synchronization	LRW	Wav2Lip	LSE-C	7.49	# 1
Unconstrained Lip-synchronization	LRW	Wav2Lip	FID	3.189	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-lip-sync-expert-is-all-you-need-for-speech/lip-sync-on-lrs3)](https://paperswithcode.com/sota/lip-sync-on-lrs3?p=a-lip-sync-expert-is-all-you-need-for-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-lip-sync-expert-is-all-you-need-for-speech/lip-sync-on-lrw)](https://paperswithcode.com/sota/lip-sync-on-lrw?p=a-lip-sync-expert-is-all-you-need-for-speech)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-lip-sync-expert-is-all-you-need-for-speech/lip-sync-on-lrs2)](https://paperswithcode.com/sota/lip-sync-on-lrs2?p=a-lip-sync-expert-is-all-you-need-for-speech)`

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

23 Aug 2020 · K R Prajwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C. V. Jawahar ·

In this work, we investigate the problem of lip-syncing a talking face video of an arbitrary identity to match a target speech segment. Current works excel at producing accurate lip movements on a static image or videos of specific people seen during the training phase. However, they fail to accurately morph the lip movements of arbitrary identities in dynamic, unconstrained talking face videos, resulting in significant parts of the video being out-of-sync with the new audio. We identify key reasons pertaining to this and hence resolve them by learning from a powerful lip-sync discriminator. Next, we propose new, rigorous evaluation benchmarks and metrics to accurately measure lip synchronization in unconstrained videos. Extensive quantitative evaluations on our challenging benchmarks show that the lip-sync accuracy of the videos generated by our Wav2Lip model is almost as good as real synced videos. We provide a demo video clearly showing the substantial impact of our Wav2Lip model and evaluation benchmarks on our website: \url{cvit.iiit.ac.in/research/projects/cvit-projects/a-lip-sync-expert-is-all-you-need-for-speech-to-lip-generation-in-the-wild}. The code and models are released at this GitHub repository: \url{github.com/Rudrabha/Wav2Lip}. You can also try out the interactive demo at this link: \url{bhaasha.iiit.ac.in/lipsync}.

PDF Abstract

Code

Add Remove Mark official

Rudrabha/Wav2Lip official

↳ Quickstart in

Colab

9,244

mowshon/lipsync

PrashanthaTP/wav2mov

rockstar-0000/lip_sync_test

↳ Quickstart in

Colab

Tasks

Add Remove

MORPH

Talking Face Generation

Talking Head Generation

Unconstrained Lip-synchronization

Datasets

LRW

LRS2

Results from the Paper

Edit

Ranked #1 on Unconstrained Lip-synchronization on LRS3 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unconstrained Lip-synchronization	LRS2	Wav2Lip	LSE-D	6.386	# 3	Compare
			LSE-C	7.781	# 1	Compare
			FID	4.887	# 3	Compare
Unconstrained Lip-synchronization	LRS2	Wav2Lip + GAN	LSE-D	6.469	# 2	Compare
Unconstrained Lip-synchronization	LRS2	Wav2Lip + GAN	FID	4.446	# 2	Compare
Unconstrained Lip-synchronization	LRS3	Wav2Lip + GAN	LSE-D	6.986	# 1	Compare
			LSE-C	7.574	# 2	Compare
			FID	4.35	# 1	Compare
Unconstrained Lip-synchronization	LRS3	Wav2Lip	LSE-D	6.652	# 2	Compare
			LSE-C	7.887	# 1	Compare
			FID	4.844	# 2	Compare
Unconstrained Lip-synchronization	LRW	Wav2Lip + GAN	LSE-D	6.774	# 1	Compare
			LSE-C	7.263	# 2	Compare
			FID	2.475	# 1	Compare
Unconstrained Lip-synchronization	LRW	Wav2Lip	LSE-D	6.512	# 2	Compare
			LSE-C	7.49	# 1	Compare
			FID	3.189	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove