Talking Head Generation

40 papers with code • 7 benchmarks • 3 datasets

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Benchmarks

Add a Result

These leaderboards are used to track progress in Talking Head Generation

Dataset	Best Model	Compare
VoxCeleb2 - 1-shot learning	Fast Bi-layer Avatars (medium size)	See all
VoxCeleb1 - 1-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 8-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 32-shot learning	Few-shot Adversarial Model	See all
VoxCeleb2 - 8-shot learning	CainGAN	See all
VoxCeleb2 - 32-shot learning	Few-shot Adversarial Model	See all
100 sleep nights of 8 caregivers	Ashok	See all

Datasets

Subtasks

Unconstrained Lip-synchronization

Most implemented papers

Most implemented Social Latest No code

Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation

FuxiVirtualHuman/Write-a-Speaker • 16 Apr 2021

To be specific, our framework consists of a speaker-independent stage and a speaker-specific stage.

Paper
Code

Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

tpulkit/txt2vid • • 26 Jun 2021

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure.

Paper
Code

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

wangsuzhen/Audio2Head • • 20 Jul 2021

As this keypoint based representation models the motions of facial regions, head, and backgrounds integrally, our method can better constrain the spatial and temporal consistency of the generated videos.

Paper
Code

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation

YuanxunLu/LiveSpeechPortraits • • 22 Sep 2021

The first stage is a deep neural network that extracts deep audio features along with a manifold projection to project the features to the target person's speech space.

Paper
Code

AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment

kangyeolk/animeceleb • • 15 Nov 2021

We present a novel Animation CelebHeads dataset (AnimeCeleb) to address an animation head reenactment.

Paper
Code

AI-generated characters for supporting personalized learning and well-being

mitmedialab/AI-generated-characters • Nature Machine Intelligence 2021

Advancements in machine learning have recently enabled the hyper-realistic synthesis of prose, images, audio and video data, in what is referred to as artificial intelligence (AI)-generated media.

Paper
Code

Depth-Aware Generative Adversarial Network for Talking Head Video Generation

harlanhong/cvpr2022-dagan • • CVPR 2022

In a more dense way, the depth is also utilized to learn 3D-aware cross-modal (i. e. appearance and depth) attention to guide the generation of motion fields for warping source image representations.

Paper
Code

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

megvii-research/MM2022-ViCoPerceptualHeadGeneration • • 26 Jun 2022

This paper reports our solution for ACM Multimedia ViCo 2022 Conversational Head Generation Challenge, which aims to generate vivid face-to-face conversation videos based on audio and reference images.

Paper
Code

Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis

sstzal/DFRF • • 24 Jul 2022

Thus the facial radiance field can be flexibly adjusted to the new identity with few reference images.

Paper
Code

Compressing Video Calls using Synthetic Talking Heads

berlin0610/awesome-generative-face-video-coding • • 7 Oct 2022

We use a state-of-the-art face reenactment network to detect key points in the non-pivot frames and transmit them to the receiver.

Paper
Code

Talking Head Generation

Benchmarks Add a Result

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result