1 code implementation • Research Square Preprint 2023 • Jason Abohwo
In recent years, generative models have made impressive advancements towards realistic output; in particular, models working in the modalities of text and audio have reached a level of quality at which generated text or audio cannot be easily distinguished from real text or audio Despite these revolutionary advancements, the synthesis of realistic and temporally consistent videos is still in its inception.
Ranked #1 on Text-to-Video Generation on UCF-101