Search Results for author: Fabian Paischer

Found 6 papers, 6 papers with code

Linear Alignment of Vision-language Models for Image Captioning

1 code implementation • 10 Jul 2023 • Fabian Paischer, Markus Hofmarcher, Sepp Hochreiter, Thomas Adler

We propose a more efficient training protocol that fits a linear mapping between image and text embeddings of CLIP via a closed-form solution.

Image Captioning Language Modelling

Paper
Code

Learning to Modulate pre-trained Models in RL

1 code implementation • NeurIPS 2023 • Thomas Schmied, Markus Hofmarcher, Fabian Paischer, Razvan Pascanu, Sepp Hochreiter

That is, the performance on the pre-training tasks deteriorates when fine-tuning on new tasks.

Reinforcement Learning (RL)

Paper
Code

Semantic HELM: A Human-Readable Memory for Reinforcement Learning

1 code implementation • NeurIPS 2023 • Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter

Then we feed these tokens to a pretrained language model that serves the agent as memory and provides it with a coherent and human-readable representation of the past.

Dota 2 Language Modelling +3

Paper
Code

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

1 code implementation • 12 Jul 2022 • Christian Steinparz, Thomas Schmied, Fabian Paischer, Marius-Constantin Dinu, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter

Therefore, exploration strategies and learning methods are required that are capable of tracking the steady domain shifts, and adapting to them.

Policy Gradient Methods Q-Learning +2

Paper
Code

History Compression via Language Models in Reinforcement Learning

2 code implementations • 24 May 2022 • Fabian Paischer, Thomas Adler, Vihang Patil, Angela Bitto-Nemling, Markus Holzleitner, Sebastian Lehner, Hamid Eghbal-zadeh, Sepp Hochreiter

We propose to utilize a frozen Pretrained Language Transformer (PLT) for history representation and compression to improve sample efficiency.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models

1 code implementation • NAACL 2022 • Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz

Our method makes training large language models for new languages more accessible and less damaging to the environment.

Cross-Lingual Transfer Word Embeddings

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.