no code implementations • 31 Dec 2023 • Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn
Within recent approaches to text-to-video (T2V) generation, achieving controllability in the synthesized video is often a challenge.
no code implementations • 26 Sep 2023 • Wan-Duo Kurt Ma, Muhammad Ghifary, J. P. Lewis, Byungkuk Choi, Haekwang Eom
This paper describes FDLS (Facial Deep Learning Solver), which is Weta Digital's solution to these challenges.
no code implementations • 25 Feb 2023 • Wan-Duo Kurt Ma, J. P. Lewis, Avisek Lahiri, Thomas Leung, W. Bastiaan Kleijn
Text-guided diffusion models such as DALLE-2, Imagen, eDiff-I, and Stable Diffusion are able to generate an effectively endless variety of images given only a short text prompt describing the desired image content.
3 code implementations • 5 Aug 2019 • Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn
We introduce the HSIC (Hilbert-Schmidt independence criterion) bottleneck for training deep neural networks.