no code implementations • 18 Sep 2023 • Zheng-Yan Sheng, Yang Ai, Yan-Nian Chen, Zhen-Hua Ling
This paper presents a novel task, zero-shot voice conversion based on face images (zero-shot FaceVC), which aims at converting the voice characteristics of an utterance from any source speaker to a newly coming target speaker, solely relying on a single face image of the target speaker.
no code implementations • 9 May 2023 • Zheng-Yan Sheng, Yang Ai, Zhen-Hua Ling
In this paper, we propose a zero-shot personalized Lip2Speech synthesis method, in which face images control speaker identities.