Talking Head Generation

40 papers with code • 7 benchmarks • 3 datasets

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation

KU-CVLAB/MoDiTalker 28 Mar 2024

AToM excels in capturing subtle lip movements by leveraging an audio attention mechanism.

89
28 Mar 2024

Adaptive Super Resolution For One-Shot Talking-Head Generation

songluchuan/adasr-talkinghead 23 Mar 2024

In this work, we propose an adaptive high-quality talking-head video generation method, which synthesizes high-resolution video without additional pre-trained modules.

120
23 Mar 2024

A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos

zwx8981/adth-qa 11 Mar 2024

However, performance evaluation research lags behind the development of talking head generation techniques.

5
11 Mar 2024

SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis

ZiqiaoPeng/SyncTalk 29 Nov 2023

A lifelike talking head requires synchronized coordination of subject identity, lip movements, facial expressions, and head poses.

764
29 Nov 2023

Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation

yuangan/eat_code ICCV 2023

Audio-driven talking-head synthesis is a popular research topic for virtual human-related applications.

199
10 Sep 2023

Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation

zhichaowang970201/text-to-video 12 Aug 2023

In the second stage, an audio-driven talking head generation method is employed to produce compelling videos privided the audio generated in the first stage.

5
12 Aug 2023

Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation

harlanhong/iccv2023-mcnet ICCV 2023

Talking head video generation aims to animate a human face in a still image with dynamic poses and expressions using motion information derived from a target-driving video, while maintaining the person's identity in the source image.

212
19 Jul 2023

A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation

louisbearing/hmo-audio 4 Jul 2023

Animating still face images with deep generative models using a speech input signal is an active research topic and has seen important recent progress.

3
04 Jul 2023

Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation

fedenoce/s2l-s2d 2 Jun 2023

This paper presents a novel approach for generating 3D talking heads from raw audio inputs.

54
02 Jun 2023

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

renderme-360/renderme-360 NeurIPS 2023

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

213
22 May 2023