Lip to Speech Synthesis

Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.


Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis

Rudrabha/Lip2Wav CVPR 2020

In this work, we explore the task of lip to speech synthesis, i. e., learning to generate natural speech given only the lip movements of a speaker.

Lip to Speech Synthesis with Visual Context Attentional GAN

ms-dot-k/Visual-Context-Attentional-GAN NeurIPS 2021

In this paper, we propose a novel lip-to-speech generative adversarial network, Visual Context Attentional GAN (VCA-GAN), which can jointly model local and global lip movements during speech synthesis.

Show Me Your Face, And I'll Tell You How You Speak

chris10m/lip2speech 28 Jun 2022

When we speak, the prosody and content of the speech can be inferred from the movement of our lips.