Talking Head Generation

40 papers with code • 7 benchmarks • 3 datasets

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Benchmarks

Add a Result

These leaderboards are used to track progress in Talking Head Generation

Dataset	Best Model	Compare
VoxCeleb2 - 1-shot learning	Fast Bi-layer Avatars (medium size)	See all
VoxCeleb1 - 1-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 8-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 32-shot learning	Few-shot Adversarial Model	See all
VoxCeleb2 - 8-shot learning	CainGAN	See all
VoxCeleb2 - 32-shot learning	Few-shot Adversarial Model	See all
100 sleep nights of 8 caregivers	Ashok	See all

Datasets

Subtasks

Unconstrained Lip-synchronization

Latest papers

Most implemented Social Latest No code

MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation

KU-CVLAB/MoDiTalker • • 28 Mar 2024

AToM excels in capturing subtle lip movements by leveraging an audio attention mechanism.

28 Mar 2024

Paper
Code

Adaptive Super Resolution For One-Shot Talking-Head Generation

songluchuan/adasr-talkinghead • • 23 Mar 2024

In this work, we propose an adaptive high-quality talking-head video generation method, which synthesizes high-resolution video without additional pre-trained modules.

120

23 Mar 2024

Paper
Code

A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos

zwx8981/adth-qa • 11 Mar 2024

However, performance evaluation research lags behind the development of talking head generation techniques.

11 Mar 2024

Paper
Code

SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis

ZiqiaoPeng/SyncTalk • • 29 Nov 2023

A lifelike talking head requires synchronized coordination of subject identity, lip movements, facial expressions, and head poses.

764

29 Nov 2023

Paper
Code

Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation

yuangan/eat_code • • ICCV 2023

Audio-driven talking-head synthesis is a popular research topic for virtual human-related applications.

199

10 Sep 2023

Paper
Code

Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation

zhichaowang970201/text-to-video • 12 Aug 2023

In the second stage, an audio-driven talking head generation method is employed to produce compelling videos privided the audio generated in the first stage.

12 Aug 2023

Paper
Code

Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation

harlanhong/iccv2023-mcnet • • ICCV 2023

Talking head video generation aims to animate a human face in a still image with dynamic poses and expressions using motion information derived from a target-driving video, while maintaining the person's identity in the source image.

212

19 Jul 2023

Paper
Code

A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation

louisbearing/hmo-audio • 4 Jul 2023

Animating still face images with deep generative models using a speech input signal is an active research topic and has seen important recent progress.

04 Jul 2023

Paper
Code

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

fedenoce/s2l-s2d • • 2 Jun 2023

This paper presents a novel approach for generating 3D talking heads from raw audio inputs.

02 Jun 2023

Paper
Code

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

renderme-360/renderme-360 • • NeurIPS 2023

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

213

22 May 2023

Paper
Code

Talking Head Generation

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result