no code implementations • 15 Oct 2023 • Julien Pourcel, Cédric Colas, Pierre-Yves Oudeyer, Laetitia Teodorescu
We here study automated problem generation in the context of the open-ended space of python programming puzzles.
no code implementations • 21 May 2023 • Cédric Colas, Laetitia Teodorescu, Pierre-Yves Oudeyer, Xingdi Yuan, Marc-Alexandre Côté
Without relying on any hand-coded goal representations, reward functions or curriculum, we show that LMA3 agents learn to master a large diversity of skills in a task-agnostic text-based environment.
no code implementations • 10 Feb 2023 • Laetitia Teodorescu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer
We show the importance of selectivity from the social peer's feedback; that experience replay needs to over-sample examples of rare goals; and that following self-generated goal sequences where the agent's competence is intermediate leads to significant improvements in final performance.
no code implementations • NAACL (Wordplay) 2022 • Laetitia Teodorescu, Eric Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer
In this extended abstract we discuss the opportunities and challenges of studying intrinsically-motivated agents for exploration in textual environments.
1 code implementation • NeurIPS 2021 • Tristan Karch, Laetitia Teodorescu, Katja Hofmann, Clément Moulin-Frier, Pierre-Yves Oudeyer
While there is an extended literature studying how machines can learn grounded language, the topic of how to learn spatio-temporal linguistic concepts is still largely uncharted.
no code implementations • 9 Apr 2020 • Laetitia Teodorescu, Katja Hofmann, Pierre-Yves Oudeyer
Recognizing precise geometrical configurations of groups of objects is a key capability of human spatial cognition, yet little studied in the deep learning literature so far.
no code implementations • 20 Mar 2020 • Tristan Karch, Cédric Colas, Laetitia Teodorescu, Clément Moulin-Frier, Pierre-Yves Oudeyer
This paper investigates the idea of encoding object-centered representations in the design of the reward function and policy architectures of a language-guided reinforcement learning agent.