Search Results for author: Stanislav Frolov

Found 25 papers, 11 papers with code

Spherical Dense Text-to-Image Synthesis

no code implementations18 Feb 2025 Timon Winter, Stanislav Frolov, Brian Bernhard Moser, Andreas Dengel

Specifically, we propose MultiStitchDiffusion (MSTD) and MultiPanFusion (MPF) by integrating MultiDiffusion into StitchDiffusion and PanFusion, respectively.

Image Generation

Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution

no code implementations12 Jan 2025 Ashitha Mudraje, Brian B. Moser, Stanislav Frolov, Andreas Dengel

Satellite imagery is a cornerstone for numerous Remote Sensing (RS) applications; however, limited spatial resolution frequently hinders the precision of such systems, especially in multi-label scene classification tasks as it requires a higher level of detail and feature differentiation.

Classification Image Super-Resolution +1

TKG-DM: Training-free Chroma Key Content Generation Diffusion Model

1 code implementation23 Nov 2024 Ryugo Morita, Stanislav Frolov, Brian Bernhard Moser, Takahiro Shirakawa, Ko Watanabe, Andreas Dengel, Jinjia Zhou

To address this limitation, we present a novel Training-Free Chroma Key Content Generation Diffusion Model (TKG-DM), which optimizes the initial random noise to produce images with foreground objects on a specifiable color background.

Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning

1 code implementation18 Nov 2024 Arundhati S. Shanbhag, Brian B. Moser, Tobias C. Nauen, Stanislav Frolov, Federico Raue, Andreas Dengel

Diffusion models, known for their generative capabilities, have recently shown unexpected potential in image classification tasks by using Bayes' theorem.

Classification Image Classification

Distill the Best, Ignore the Rest: Improving Dataset Distillation with Loss-Value-Based Pruning

1 code implementation18 Nov 2024 Brian B. Moser, Federico Raue, Tobias C. Nauen, Stanislav Frolov, Andreas Dengel

Dataset distillation has gained significant interest in recent years, yet existing approaches typically distill from the entire dataset, potentially including non-beneficial samples.

Dataset Distillation

Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution

1 code implementation18 Nov 2024 Brian B. Moser, Stanislav Frolov, Tobias C. Nauen, Federico Raue, Andreas Dengel

Large-scale, pre-trained Text-to-Image (T2I) diffusion models have gained significant popularity in image generation tasks and have shown unexpected potential in image Super-Resolution (SR).

2k 4k +3

A Low-Resolution Image is Worth 1x1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift

no code implementations15 Nov 2024 Sanath Budakegowdanadoddi Nagaraju, Brian Bernhard Moser, Tobias Christian Nauen, Stanislav Frolov, Federico Raue, Andreas Dengel

Transformer-based Super-Resolution (SR) models have recently advanced image reconstruction quality, yet challenges remain due to computational complexity and an over-reliance on large patch sizes, which constrain fine-grained detail enhancement.

Image Reconstruction Image Super-Resolution

SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time

1 code implementation22 Jul 2024 Stanislav Frolov, Brian B. Moser, Andreas Dengel

Generating high-resolution images with generative models has recently been made widely accessible by leveraging diffusion models pre-trained on large-scale datasets.

Computational Efficiency Denoising +1

Federated Learning for Blind Image Super-Resolution

no code implementations26 Apr 2024 Brian B. Moser, Ahmed Anwar, Federico Raue, Stanislav Frolov, Andreas Dengel

This distinction is crucial as it circumvents the need to precisely model real-world degradations, which limits contemporary blind image SR research.

Federated Learning Image Super-Resolution

ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation

no code implementations11 Apr 2024 Stanislav Frolov, Brian B. Moser, Sebastian Palacio, Andreas Dengel

We present ObjBlur, a novel curriculum learning approach to improve layout-to-image generation models, where the task is to produce realistic images from layouts composed of boxes and labels.

Layout-to-Image Generation

Latent Dataset Distillation with Diffusion Models

no code implementations6 Mar 2024 Brian B. Moser, Federico Raue, Sebastian Palacio, Stanislav Frolov, Andreas Dengel

To address both challenges, this paper proposes Latent Dataset Distillation with Diffusion Models (LD3M) that combine diffusion in latent space with dataset distillation.

Dataset Distillation

Diffusion Models, Image Super-Resolution And Everything: A Survey

no code implementations1 Jan 2024 Brian B. Moser, Arundhati S. Shanbhag, Federico Raue, Stanislav Frolov, Sebastian Palacio, Andreas Dengel

Diffusion Models (DMs) have disrupted the image Super-Resolution (SR) field and further closed the gap between image quality and human perceptual preferences.

Computational Efficiency Image Super-Resolution +2

Dynamic Attention-Guided Diffusion for Image Super-Resolution

no code implementations15 Aug 2023 Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

To address this, we propose ``You Only Diffuse Areas'' (YODA), a dynamic attention-guided diffusion process for image SR. YODA selectively focuses on spatial regions defined by attention maps derived from the low-resolution images and the current denoising time step.

Denoising Image Super-Resolution +1

DWA: Differential Wavelet Amplifier for Image Super-Resolution

1 code implementation10 Jul 2023 Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

This work introduces Differential Wavelet Amplifier (DWA), a drop-in module for wavelet-based image Super-Resolution (SR).

Image Super-Resolution

Are Visual Recognition Models Robust to Image Compression?

no code implementations10 Apr 2023 João Maria Janeiro, Stanislav Frolov, Alaaeldin El-Nouby, Jakob Verbeek

For example, for segmentation mIoU is reduced from 44. 5 to 30. 5 mIoU when compressing to 0. 1 bpp using the best compression model we evaluated.

Image Classification Image Compression +4

DT2I: Dense Text-to-Image Generation from Region Descriptions

no code implementations5 Apr 2022 Stanislav Frolov, Prateek Bansal, Jörn Hees, Andreas Dengel

Our results demonstrate the capability of our approach to generate plausible images of complex scenes using region captions.

Conditional Image Generation Image-text matching +2

Combining Transformer Generators with Convolutional Discriminators

no code implementations21 May 2021 Ricard Durall, Stanislav Frolov, Jörn Hees, Federico Raue, Franz-Josef Pfreundt, Andreas Dengel, Janis Keupe

Transformer models have recently attracted much interest from computer vision researchers and have since been successfully employed for several problems traditionally addressed with convolutional neural networks.

Data Augmentation Image Generation +1

AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable Layout and Style

1 code implementation25 Mar 2021 Stanislav Frolov, Avneesh Sharma, Jörn Hees, Tushar Karayil, Federico Raue, Andreas Dengel

In this paper, we propose a method for attribute controlled image synthesis from layout which allows to specify the appearance of individual objects without affecting the rest of the image.

Attribute Layout-to-Image Generation

Adversarial Text-to-Image Synthesis: A Review

no code implementations25 Jan 2021 Stanislav Frolov, Tobias Hinz, Federico Raue, Jörn Hees, Andreas Dengel

With the advent of generative adversarial networks, synthesizing images from textual descriptions has recently become an active research area.

Adversarial Text Conditional Image Generation +1

Leveraging Visual Question Answering to Improve Text-to-Image Synthesis

no code implementations LANTERN (COLING) 2020 Stanislav Frolov, Shailza Jolly, Jörn Hees, Andreas Dengel

We create additional training samples by concatenating question and answer (QA) pairs and employ a standard VQA model to provide the T2I model with an auxiliary learning signal.

Auxiliary Learning Image Generation +2

Hybrid-S2S: Video Object Segmentation with Recurrent Networks and Correspondence Matching

1 code implementation10 Oct 2020 Fatemeh Azimi, Stanislav Frolov, Federico Raue, Joern Hees, Andreas Dengel

In this work, we study an RNN-based architecture and address some of these issues by proposing a hybrid sequence-to-sequence architecture named HS2S, utilizing a dual mask propagation strategy that allows incorporating the information obtained from correspondence matching.

One-shot visual object segmentation Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.