Search Results for author: James B. Simon

Found 13 papers, 5 papers with code

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

no code implementations • 24 Nov 2023 • James B. Simon, Dhruva Karkada, Nikhil Ghosh, Mikhail Belkin

In our era of enormous neural networks, empirical progress has been driven by the philosophy that more is better.

Paper
Add Code

A Spectral Condition for Feature Learning

no code implementations • 26 Oct 2023 • Greg Yang, James B. Simon, Jeremy Bernstein

The push to train ever larger neural networks has motivated the study of initialization and training at large network width.

Paper
Add Code

Les Houches Lectures on Deep Learning at Large & Infinite Width

no code implementations • 4 Sep 2023 • Yasaman Bahri, Boris Hanin, Antonin Brossollet, Vittorio Erba, Christian Keup, Rosalba Pacelli, James B. Simon

These lectures, presented at the 2022 Les Houches Summer School on Statistical Physics and Machine Learning, focus on the infinite-width limit and large-width regime of deep neural networks.

Gaussian Processes

Paper
Add Code

An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression

no code implementations • 22 Jun 2023 • Lijia Zhou, James B. Simon, Gal Vardi, Nathan Srebro

We study the cost of overfitting in noisy kernel ridge regression (KRR), which we define as the ratio between the test error of the interpolating ridgeless model and the test error of the optimally-tuned model.

regression

Paper
Add Code

Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training

no code implementations • 13 Jun 2023 • Abraham J. Fetterman, Ellie Kitanidis, Joshua Albrecht, Zachary Polizzi, Bryden Fogelman, Maksis Knutins, Bartosz Wróblewski, James B. Simon, Kanjun Qiu

Hyperparameter tuning of deep learning models can lead to order-of-magnitude performance gains for the same amount of compute.

Bayesian Optimization Hyperparameter Optimization

Paper
Add Code

On the Stepwise Nature of Self-Supervised Learning

1 code implementation • 27 Mar 2023 • James B. Simon, Maksis Knutins, Liu Ziyin, Daniel Geisz, Abraham J. Fetterman, Joshua Albrecht

We present a simple picture of the training process of joint embedding self-supervised learning methods.

Self-Supervised Learning

Paper
Code

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

1 code implementation • 24 Oct 2022 • Joshua Albrecht, Abraham J. Fetterman, Bryden Fogelman, Ellie Kitanidis, Bartosz Wróblewski, Nicole Seo, Michael Rosenthal, Maksis Knutins, Zachary Polizzi, James B. Simon, Kanjun Qiu

As a benchmark tailored for studying RL generalization, we introduce Avalon, a set of tasks in which embodied agents in highly diverse procedural 3D worlds must survive by navigating terrain, hunting or gathering food, and avoiding hazards.

Navigate Reinforcement Learning (RL)

169

Paper
Code

On Kernel Regression with Data-Dependent Kernels

no code implementations • 4 Sep 2022 • James B. Simon

The primary hyperparameter in kernel regression (KR) is the choice of kernel.

regression

Paper
Add Code

Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting

no code implementations • 14 Jul 2022 • Neil Mallinar, James B. Simon, Amirhesam Abedsoltan, Parthe Pandit, Mikhail Belkin, Preetum Nakkiran

In this work we argue that while benign overfitting has been instructive and fruitful to study, many real interpolating methods like neural networks do not fit benignly: modest noise in the training set causes nonzero (but non-infinite) excess risk at test time, implying these models are neither benign nor catastrophic but rather fall in an intermediate regime.

Learning Theory