1 code implementation • 4 Jun 2024 • Tero Karras, Miika Aittala, Tuomas Kynkäänniemi, Jaakko Lehtinen, Timo Aila, Samuli Laine
The primary axes of interest in image-generating diffusion models are image quality, the amount of variation in the results, and how well the results align with a given condition, e. g., a class label or a text prompt.
Ranked #1 on
Image Generation
on ImageNet 512x512
1 code implementation • 11 Apr 2024 • Tuomas Kynkäänniemi, Miika Aittala, Tero Karras, Samuli Laine, Timo Aila, Jaakko Lehtinen
We show that guidance is clearly harmful toward the beginning of the chain (high noise levels), largely unnecessary toward the end (low noise levels), and only beneficial in the middle.
Ranked #6 on
Image Generation
on ImageNet 512x512
6 code implementations • CVPR 2024 • Tero Karras, Miika Aittala, Jaakko Lehtinen, Janne Hellsten, Timo Aila, Samuli Laine
Diffusion models currently dominate the field of data-driven image synthesis with their unparalleled scaling to large datasets.
Ranked #17 on
Image Generation
on ImageNet 512x512
no code implementations • 14 Dec 2022 • Onni Kosomaa, Samuli Laine, Tero Karras, Miika Aittala, Jaakko Lehtinen
We propose a deep learning method for 3D volumetric reconstruction in low-dose helical cone-beam computed tomography.
1 code implementation • 27 Oct 2022 • Eloi Moliner, Jaakko Lehtinen, Vesa Välimäki
This paper presents CQT-Diff, a data-driven generative audio model that can, once trained, be used for solving various different audio inverse problems in a problem-agnostic setting.
1 code implementation • 4 Jul 2022 • Erik Härkönen, Miika Aittala, Tuomas Kynkäänniemi, Samuli Laine, Timo Aila, Jaakko Lehtinen
We introduce the problem of disentangling time-lapse sequences in a way that allows separate, after-the-fact control of overall trends, cyclic effects, and random effects in the images, and describe a technique based on data-driven generative models that achieves this goal.
1 code implementation • 7 Jun 2022 • Tim Brooks, Janne Hellsten, Miika Aittala, Ting-Chun Wang, Timo Aila, Jaakko Lehtinen, Ming-Yu Liu, Alexei A. Efros, Tero Karras
Existing video generation methods often fail to produce new content as a function of time while maintaining consistencies expected in real environments, such as plausible dynamics and object persistence.
2 code implementations • 11 Mar 2022 • Tuomas Kynkäänniemi, Tero Karras, Miika Aittala, Timo Aila, Jaakko Lehtinen
Fr\'echet Inception Distance (FID) is the primary metric for ranking models in data-driven generative modeling.
7 code implementations • NeurIPS 2021 • Tero Karras, Miika Aittala, Samuli Laine, Erik Härkönen, Janne Hellsten, Jaakko Lehtinen, Timo Aila
We observe that despite their hierarchical convolutional nature, the synthesis process of typical generative adversarial networks depends on absolute pixel coordinates in an unhealthy manner.
Ranked #1 on
Image Generation
on FFHQ-U
1 code implementation • 6 Nov 2020 • Samuli Laine, Janne Hellsten, Tero Karras, Yeongho Seol, Jaakko Lehtinen, Timo Aila
We present a modular differentiable renderer design that yields performance superior to previous methods by leveraging existing, highly optimized hardware graphics pipelines.
28 code implementations • NeurIPS 2020 • Tero Karras, Miika Aittala, Janne Hellsten, Samuli Laine, Jaakko Lehtinen, Timo Aila
We also find that the widely used CIFAR-10 is, in fact, a limited data benchmark, and improve the record FID from 5. 59 to 2. 42.
Ranked #1 on
Conditional Image Generation
on ArtBench-10 (32x32)
2 code implementations • NeurIPS 2020 • Erik Härkönen, Aaron Hertzmann, Jaakko Lehtinen, Sylvain Paris
This paper describes a simple technique to analyze Generative Adversarial Networks (GANs) and create interpretable controls for image synthesis, such as change of viewpoint, aging, lighting, and time of day.
126 code implementations • CVPR 2020 • Tero Karras, Samuli Laine, Miika Aittala, Janne Hellsten, Jaakko Lehtinen, Timo Aila
Overall, our improved model redefines the state of the art in unconditional image modeling, both in terms of existing distribution quality metrics as well as perceived image quality.
Ranked #1 on
Image Generation
on LSUN Car 256 x 256
1 code implementation • NeurIPS 2019 • Wenzheng Chen, Jun Gao, Huan Ling, Edward J. Smith, Jaakko Lehtinen, Alec Jacobson, Sanja Fidler
Many machine learning models operate on images, but ignore the fact that images are 2D projections formed by 3D geometry interacting with light, in a process called rendering.
Ranked #4 on
Single-View 3D Reconstruction
on ShapeNet
1 code implementation • 10 Jun 2019 • Markus Kettunen, Erik Härkönen, Jaakko Lehtinen
It has been recently shown that the hidden variables of convolutional neural networks make for an efficient perceptual similarity metric that accurately predicts human judgment on relative image similarity assessment.
10 code implementations • ICCV 2019 • Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz
Unsupervised image-to-image translation methods learn to map images in a given class to an analogous image in a different class, drawing on unstructured (non-registered) datasets of images.
10 code implementations • NeurIPS 2019 • Tuomas Kynkäänniemi, Tero Karras, Samuli Laine, Jaakko Lehtinen, Timo Aila
The ability to automatically estimate the quality and coverage of the samples produced by a generative model is a vital requirement for driving algorithm research.
Ranked #4 on
Image Generation
on FFHQ
no code implementations • ICLR Workshop LLD 2019 • Samuli Laine, Jaakko Lehtinen, Timo Aila
We describe techniques for training high-quality image denoising models that require only single instances of corrupted images as training data.
2 code implementations • NeurIPS 2019 • Samuli Laine, Tero Karras, Jaakko Lehtinen, Timo Aila
We describe a novel method for training high-quality image denoising models based on unorganized collections of corrupted images.
1 code implementation • 5 Oct 2018 • Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen
Proximal Policy Optimization (PPO) is a highly popular model-free reinforcement learning (RL) approach.
1 code implementation • SIGGRAPH 2018 • Tzu-Mao Li, Miika Aittala, Frédo Durand, Jaakko Lehtinen
We introduce a general-purpose differentiable ray tracer, which, to our knowledge, is the first comprehensive solution that is able to compute derivatives of scalar functions over a rendered image with respect to arbitrary scene parameters such as camera pose, scene geometry, materials, and lighting parameters.
21 code implementations • ICML 2018 • Jaakko Lehtinen, Jacob Munkberg, Jon Hasselgren, Samuli Laine, Tero Karras, Miika Aittala, Timo Aila
We apply basic statistical reasoning to signal reconstruction by machine learning -- learning to map corrupted observations to clean signals -- with a simple and powerful conclusion: it is possible to learn to restore images by only looking at corrupted examples, at performance at and sometimes exceeding training using clean data, without explicit image priors or likelihood models of the corruption.
114 code implementations • ICLR 2018 • Tero Karras, Timo Aila, Samuli Laine, Jaakko Lehtinen
We describe a new training methodology for generative adversarial networks.
Ranked #4 on
Image Generation
on LSUN Horse 256 x 256
(Clean-FID (trainfull) metric)
no code implementations • SIGGRAPH 2017 • Tero Karras, Timo Aila, Samuli Laine, Antti Herva, Jaakko Lehtinen
Our deep neural network learns a mapping from input waveforms to the 3D vertex coordinates of a face model, and simultaneously discovers a compact, latent code that disambiguates the variations in facial expression that cannot be explained by the audio alone.
1 code implementation • 21 Sep 2016 • Samuli Laine, Tero Karras, Timo Aila, Antti Herva, Shunsuke Saito, Ronald Yu, Hao Li, Jaakko Lehtinen
We present a real-time deep learning framework for video-based facial performance capture -- the dense 3D tracking of an actor's face given a monocular video.