no code implementations • 27 Feb 2024 • Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons
Past analyses of reinforcement learning from human feedback (RLHF) assume that the human fully observes the environment.
1 code implementation • 3 Jul 2023 • Teun van der Weij, Simon Lermen, Leon Lang
Recently, there has been an increase in interest in evaluating large language models for emergent and dangerous capabilities.
no code implementations • ICLR 2022 • Gabriele Cesa, Leon Lang, Maurice Weiler
This enables us to directly parameterize filters in terms of a band-limited basis on the base space, but also to easily implement steerable CNNs equivariant to a large number of groups.
no code implementations • ICLR 2021 • Leon Lang, Maurice Weiler
Group equivariant convolutional networks (GCNNs) endow classical convolutional networks with additional symmetry priors, which can lead to a considerably improved performance.
no code implementations • 11 Dec 2019 • Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni
Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.
no code implementations • WS 2019 • Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni
Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.