no code implementations • 28 Aug 2024 • Nikita Kister, István Sárándi, Anna Khoreva, Gerard Pons-Moll
The estimation of 3D human poses from images has progressed tremendously over the last few years as measured on standard benchmarks.
no code implementations • 3 Jul 2024 • Ugur Ali Kaplan, Margret Keuper, Anna Khoreva, Dan Zhang, Yumeng Li
Foundation models (FMs) have revolutionized computer vision, enabling effective learning across different domains.
no code implementations • 1 Jul 2024 • Jiayi Wang, Kevin Alexander Laube, Yumeng Li, Jan Hendrik Metzen, Shin-I Cheng, Julio Borges, Anna Khoreva
Neural layouts are advantageous as they provide rich descriptions of the desired image, containing both semantics and detailed geometry of the scene.
1 code implementation • 30 May 2024 • Massimo Bini, Karsten Roth, Zeynep Akata, Anna Khoreva
Parameter-efficient finetuning (PEFT) has become ubiquitous to adapt foundation models to downstream task requirements while retaining their generalization ability.
1 code implementation • 20 Mar 2024 • Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva
Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content.
1 code implementation • 16 Jan 2024 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva
Current L2I models either suffer from poor editability via text or weak alignment between the generated image and the input layout.
1 code implementation • 20 Jul 2023 • Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva
To address the challenges posed by complex prompts or scenarios involving multiple entities and to achieve improved attribute binding, we propose Divide & Bind.
1 code implementation • 2 Jul 2023 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva
Using the proposed masked noise encoder to randomize style and content combinations in the training set, i. e., intra-source style augmentation (ISSA) effectively increases the diversity of training data and reduces spurious correlation.
no code implementations • 2 Dec 2022 • Edgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva
Prior work has extensively studied the latent space structure of GANs for unconditional image synthesis, enabling global editing of generated images by the unsupervised discovery of interpretable latent directions.
1 code implementation • 18 Oct 2022 • Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva
Using the proposed masked noise encoder to randomize style and content combinations in the training set, ISSA effectively increases the diversity of training data and reduces spurious correlation.
1 code implementation • 15 Sep 2022 • Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva
To this end, inspired by the recent architectural developments of single-image GANs, we introduce our OSMIS model which enables the synthesis of segmentation masks that are precisely aligned to the generated images in the one-shot regime.
1 code implementation • 12 May 2021 • Vadim Sushko, Juergen Gall, Anna Khoreva
Training GANs in low-data regimes remains a challenge, as overfitting often leads to memorization or training divergence.
1 code implementation • 24 Mar 2021 • Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva
In this work, we introduce SIV-GAN, an unconditional generative model that can generate new scene compositions from a single training image or a single video clip.
1 code implementation • ICLR 2021 • Vadim Sushko, Edgar Schönfeld, Dan Zhang, Juergen Gall, Bernt Schiele, Anna Khoreva
By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity with better alignment to their input label maps, making the use of the perceptual loss superfluous.
no code implementations • 25 Nov 2020 • Prateek Katiyar, Anna Khoreva
We therefore propose to improve the established semantic image synthesis evaluation scheme by analyzing separately the performance of generated images on the biased and unbiased classes for the given segmentation network.
Ranked #5 on
Image-to-Image Translation
on ADE20K Labels-to-Photos
(Accuracy metric)
3 code implementations • 28 Feb 2020 • Edgar Schönfeld, Bernt Schiele, Anna Khoreva
The novel discriminator improves over the state of the art in terms of the standard distribution and image quality metrics, enabling the generator to synthesize images with varying structure, appearance and levels of detail, maintaining global and local realism.
Ranked #1 on
Image Generation
on CelebA 128x128
2 code implementations • NeurIPS 2019 • Lukas Hoyer, Mauricio Munoz, Prateek Katiyar, Anna Khoreva, Volker Fischer
Recently, there has been a growing interest in developing saliency methods that provide visual explanations of network predictions.
no code implementations • 21 Mar 2019 • Lukas Hoyer, Patrick Kesper, Anna Khoreva, Volker Fischer
An environment representation (ER) is a substantial part of every autonomous system.
1 code implementation • NeurIPS 2019 • Dan Zhang, Anna Khoreva
Training of Generative Adversarial Networks (GANs) is notoriously fragile, requiring to maintain a careful balance between the generator and the discriminator in order to perform well.
Ranked #3 on
Image Generation
on CelebA-HQ 128x128
no code implementations • 27 Sep 2018 • Dan Zhang, Anna Khoreva
Despite recent progress, Generative Adversarial Networks (GANs) still suffer from training instability, requiring careful consideration of architecture design choices and hyper-parameter tuning.
no code implementations • 21 Apr 2018 • Mihai Fieraru, Anna Khoreva, Leonid Pishchulin, Bernt Schiele
Multi-person pose estimation in images and videos is an important yet challenging task with many applications.
no code implementations • 21 Mar 2018 • Anna Khoreva, Anna Rohrbach, Bernt Schiele
We show that our language-supervised approach performs on par with the methods which have access to a pixel-level mask of the target object on DAVIS'16 and is competitive to methods using scribbles on the challenging DAVIS'17 dataset.
Ranked #1 on
Video Object Segmentation
on DAVIS 2017
(mIoU metric)
4 code implementations • 28 Mar 2017 • Anna Khoreva, Rodrigo Benenson, Eddy Ilg, Thomas Brox, Bernt Schiele
Our approach is suitable for both single and multiple object segmentation.
no code implementations • CVPR 2017 • Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele
We show how to combine both information sources in order to recover 80% of the fully supervised performance - which is the new state of the art in weakly supervised training for pixel-wise semantic labelling.
Ranked #26 on
Semantic Segmentation
on PASCAL VOC 2012 val
2 code implementations • CVPR 2017 • Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung
Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation.
Ranked #6 on
Semi-Supervised Video Object Segmentation
on YouTube
no code implementations • 12 May 2016 • Anna Khoreva, Rodrigo Benenson, Fabio Galasso, Matthias Hein, Bernt Schiele
Graph-based video segmentation methods rely on superpixels as starting point.
no code implementations • CVPR 2017 • Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele
Semantic labelling and instance segmentation are two tasks that require particularly costly annotations.
Ranked #1 on
Semantic Segmentation
on PASCAL VOC 2012 val
(Mean IoU metric)
no code implementations • CVPR 2016 • Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele
State-of-the-art learning based boundary detection methods require extensive training data.
Ranked #2 on
Edge Detection
on SBD
no code implementations • CVPR 2015 • Anna Khoreva, Fabio Galasso, Matthias Hein, Bernt Schiele
Video segmentation has become an important and active research area with a large diversity of proposed approaches.