Search Results for author: Leon Lang

Found 6 papers, 1 papers with code

When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning

no code implementations27 Feb 2024 Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

Past analyses of reinforcement learning from human feedback (RLHF) assume that the human fully observes the environment.

Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

1 code implementation3 Jul 2023 Teun van der Weij, Simon Lermen, Leon Lang

Recently, there has been an increase in interest in evaluating large language models for emergent and dangerous capabilities.

A Program to Build E(N)-Equivariant Steerable CNNs

no code implementations ICLR 2022 Gabriele Cesa, Leon Lang, Maurice Weiler

This enables us to directly parameterize filters in terms of a band-limited basis on the base space, but also to easily implement steerable CNNs equivariant to a large number of groups.

A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels

no code implementations ICLR 2021 Leon Lang, Maurice Weiler

Group equivariant convolutional networks (GCNNs) endow classical convolutional networks with additional symmetry priors, which can lead to a considerably improved performance.

Learning to Request Guidance in Emergent Communication

no code implementations11 Dec 2019 Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni

Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.

Imitation Learning

Learning to request guidance in emergent language

no code implementations WS 2019 Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni

Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.

Imitation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.