Search Results for author: Konstantin Dobler

Found 3 papers, 3 papers with code

Efficient Parallelization Layouts for Large-Scale Distributed Model Training

1 code implementation9 Nov 2023 Johannes Hagemann, Samuel Weinbach, Konstantin Dobler, Maximilian Schall, Gerard de Melo

In this work, we conduct a comprehensive ablation study of possible training configurations for large language models.

FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models

2 code implementations23 May 2023 Konstantin Dobler, Gerard de Melo

However, if we want to use a new tokenizer specialized for the target language, we cannot transfer the source model's embedding matrix.

NER Semantic Similarity +2

Art Creation with Multi-Conditional StyleGANs

1 code implementation23 Feb 2022 Konstantin Dobler, Florian Hübscher, Jan Westphal, Alejandro Sierra-Múnera, Gerard de Melo, Ralf Krestel

Our approach is based on the StyleGAN neural network architecture, but incorporates a custom multi-conditional control mechanism that provides fine-granular control over characteristics of the generated paintings, e. g., with regard to the perceived emotion evoked in a spectator.

Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.