Lip Sync Multimodal Video Data

Voice and matching lip language video filmed with 250 people by multi-devices simultaneously, aligned precisely by pulse signal, with high accuracy. It can be used in multi-modal learning algorithms research in speech and image fields.

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Modalities


Languages