1 code implementation • 25 May 2023 • Cevahir Koprulu, Ufuk Topcu
Self-paced reinforcement learning (RL) aims to improve the data efficiency of learning by automatically creating sequences, namely curricula, of probability distributions over contexts.
no code implementations • 20 Apr 2022 • Christos Verginis, Cevahir Koprulu, Sandeep Chinchali, Ufuk Topcu
We develop a reinforcement-learning algorithm that infers a reward machine that encodes the underlying task while learning how to execute it, despite the uncertainties of the propositions' truth values.