1 code implementation • 20 Oct 2023 • Henning Bartsch, Ole Jorgensen, Domenic Rosati, Jason Hoelscher-Obermaier, Jacob Pfau
Using this test, we find that despite increases in self-consistency, models usually place significant weight on alternative, inconsistent answers.
no code implementations • 11 Dec 2019 • Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni
Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.
no code implementations • WS 2019 • Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni
Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent.