Search Results for author: James L. McClelland

Found 16 papers, 8 papers with code

SODA: Bottleneck Diffusion Models for Representation Learning

1 code implementation • 29 Nov 2023 • Drew A. Hudson, Daniel Zoran, Mateusz Malinowski, Andrew K. Lampinen, Andrew Jaegle, James L. McClelland, Loic Matthey, Felix Hill, Alexander Lerchner

We introduce SODA, a self-supervised diffusion model, designed for representation learning.

Denoising Image Generation +3

Paper
Code

Causal interventions expose implicit situation models for commonsense language understanding

1 code implementation • 6 Jun 2023 • Takateru Yamakoshi, James L. McClelland, Adele E. Goldberg, Robert D. Hawkins

Accounts of human language processing have long appealed to implicit ``situation models'' that enrich comprehension with relevant but unstated world knowledge.

World Knowledge

Paper
Code

Achieving and Understanding Out-of-Distribution Generalization in Systematic Reasoning in Small-Scale Transformers

no code implementations • 7 Oct 2022 • Andrew J. Nam, Mustafa Abdool, Trevor Maxfield, James L. McClelland

As a step toward understanding how transformer-based systems generalize, we explore the question of OODG in small scale transformers trained with examples from a known distribution.

Out-of-Distribution Generalization Systematic Generalization

Paper
Add Code

Learning to Reason With Relational Abstractions

no code implementations • 6 Oct 2022 • Andrew J. Nam, Mengye Ren, Chelsea Finn, James L. McClelland

Large language models have recently shown promising progress in mathematical reasoning when fine-tuned with human-generated sequences walking through a sequence of solution steps.

Mathematical Reasoning

Paper
Add Code

Systematic Generalization and Emergent Structures in Transformers Trained on Structured Tasks

no code implementations • 2 Oct 2022 • YuXuan Li, James L. McClelland

Transformer networks have seen great success in natural language processing and machine vision, where task objectives such as next word prediction and image classification benefit from nuanced context sensitivity across high-dimensional inputs.

Image Classification Systematic Generalization

Paper
Add Code

Language models show human-like content effects on reasoning tasks

1 code implementation • 14 Jul 2022 • Ishita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Hannah R. Sheahan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, Felix Hill

We evaluate state of the art large language models, as well as humans, and find that the language models reflect many of the same patterns observed in humans across these tasks $\unicode{x2014}$ like humans, models answer more accurately when the semantic content of a task supports the logical inferences.

Language Modelling Logical Reasoning +2

Paper
Code

Can language models learn from explanations in context?

no code implementations • 5 Apr 2022 • Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. McClelland, Jane X. Wang, Felix Hill

In summary, explanations can support the in-context learning of large LMs on challenging tasks.

In-Context Learning

Paper
Add Code

Tell me why! Explanations support learning relational and causal structure

1 code implementation • 7 Dec 2021 • Andrew K. Lampinen, Nicholas A. Roy, Ishita Dasgupta, Stephanie C. Y. Chan, Allison C. Tam, James L. McClelland, Chen Yan, Adam Santoro, Neil C. Rabinowitz, Jane X. Wang, Felix Hill

Inferring the abstract relational and causal structure of the world is a major challenge for reinforcement-learning (RL) agents.

Odd One Out Reinforcement Learning (RL)

Paper
Code

Systematic human learning and generalization from a brief tutorial with explanatory feedback

no code implementations • 10 Jul 2021 • Andrew J. Nam, James L. McClelland

We also find that most of those who master the task can describe a valid solution strategy, and such participants perform better on transfer puzzles than those whose strategy descriptions are vague or incomplete.

High School Mathematics Systematic Generalization +1

Paper
Add Code

Transforming task representations to perform novel tasks

3 code implementations • 8 May 2020 • Andrew K. Lampinen, James L. McClelland

We demonstrate the effectiveness of this framework across a wide variety of tasks and computational paradigms, ranging from regression to image classification and reinforcement learning.

Image Classification Zero-Shot Learning

Paper
Code

Extending Machine Language Models toward Human-Level Language Understanding

no code implementations • 12 Dec 2019 • James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge, Hinrich Schütze

We take language to be a part of a system for understanding and communicating about situations.

Paper
Add Code

Environmental drivers of systematicity and generalization in a situated agent

no code implementations • ICLR 2020 • Felix Hill, Andrew Lampinen, Rosalia Schneider, Stephen Clark, Matthew Botvinick, James L. McClelland, Adam Santoro

The question of whether deep neural networks are good at generalising beyond their immediate training experience is of critical importance for learning-based approaches to AI.

Unity

Paper
Add Code

Zero-shot task adaptation by homoiconic meta-mapping

2 code implementations • 23 May 2019 • Andrew K. Lampinen, James L. McClelland

How can deep learning systems flexibly reuse their knowledge?

Meta-Learning

Paper
Code

A mathematical theory of semantic development in deep neural networks

1 code implementation • 23 Oct 2018 • Andrew M. Saxe, James L. McClelland, Surya Ganguli

An extensive body of empirical research has revealed remarkable regularities in the acquisition, organization, deployment, and neural representation of human semantic knowledge, thereby raising a fundamental conceptual question: what are the theoretical principles governing the ability of neural networks to acquire, organize, and deploy abstract knowledge by integrating across many individual experiences?

Paper
Code

One-shot and few-shot learning of word embeddings

no code implementations • 27 Oct 2017 • Andrew K. Lampinen, James L. McClelland

Standard deep learning systems require thousands or millions of examples to learn a concept, and cannot integrate new concepts easily.

Few-Shot Learning Sentence +1

Paper
Add Code

Exact solutions to the nonlinear dynamics of learning in deep linear neural networks

3 code implementations • 20 Dec 2013 • Andrew M. Saxe, James L. McClelland, Surya Ganguli

We further exhibit a new class of random orthogonal initial conditions on weights that, like unsupervised pre-training, enjoys depth independent learning times.

Unsupervised Pre-training

112

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.