Search Results for author: Simon Holk

Found 1 papers, 0 papers with code

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

no code implementations • 23 Feb 2024 • Simon Holk, Daniel Marta, Iolanda Leite

In this work, we approach the sample-efficiency challenge by expanding the information collected per query to contain both preferences and optional text prompting.

Language Modelling Large Language Model +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.