2 code implementations • 19 Nov 2024 • Laura Ruis, Maximilian Mozes, Juhan Bae, Siddhartha Rao Kamalakara, Dwarak Talupuru, Acyr Locatelli, Robert Kirk, Tim Rocktäschel, Edward Grefenstette, Max Bartolo
We find that, while the models rely on mostly distinct sets of data for each factual question, a document often has a similar influence across different reasoning questions within the same task, indicating the presence of procedural knowledge.
no code implementations • 22 Oct 2024 • Itamar Pres, Laura Ruis, Ekdeep Singh Lubana, David Krueger
Representation engineering methods have recently shown promise for enabling efficient steering of model behavior.
1 code implementation • 9 Feb 2024 • Akbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, Ethan Perez
In anticipation of this, we ask: can weaker models assess the correctness of stronger models?
1 code implementation • NeurIPS 2023 • Laura Ruis, Akbir Khan, Stella Biderman, Sara Hooker, Tim Rocktäschel, Edward Grefenstette
We present our findings as the starting point for further research into evaluating how LLMs interpret language in context and to drive the development of more pragmatic and useful models of human discourse.
1 code implementation • 22 Feb 2022 • Laura Ruis, Brenden Lake
After training on an augmented dataset with almost forty times more adverbs than the original problem, a non-modular baseline is not able to systematically generalize to a novel combination of a known verb and adverb.
4 code implementations • NeurIPS 2020 • Laura Ruis, Jacob Andreas, Marco Baroni, Diane Bouchacourt, Brenden M. Lake
In this paper, we introduce a new benchmark, gSCAN, for evaluating compositional generalization in situated language understanding.
no code implementations • 15 Jan 2020 • Laura Ruis, Mitchell Stern, Julia Proskurnia, William Chan
We propose the Insertion-Deletion Transformer, a novel transformer-based neural architecture and training method for sequence generation.