Search Results for author: Eric Vanden-Eijnden

Found 30 papers, 11 papers with code

Trainability and Accuracy of Neural Networks: An Interacting Particle System Approach

no code implementations • 2 May 2018 • Grant M. Rotskoff, Eric Vanden-Eijnden

We show that, when the number $n$ of units is large, the empirical distribution of the particles descends on a convex landscape towards the global minimum at a rate independent of $n$, with a resulting approximation error that universally scales as $O(n^{-1})$.

BIG-bench Machine Learning

Paper
Add Code

Dynamical computation of the density of states and Bayes factors using nonequilibrium importance sampling

2 code implementations • 28 Sep 2018 • Grant M. Rotskoff, Eric Vanden-Eijnden

Nonequilibrium sampling is potentially much more versatile than its equilibrium counterpart, but it comes with challenges because the invariant distribution is not typically known when the dynamics breaks detailed balance.

Statistical Mechanics

Paper
Code

Parameters as interacting particles: long time convergence and asymptotic error scaling of neural networks

no code implementations • NeurIPS 2018 • Grant Rotskoff, Eric Vanden-Eijnden

The performance of neural networks on high-dimensional data distributions suggests that it may be possible to parameterize a representation of a given high-dimensional function with controllably small errors, potentially outperforming standard interpolation methods.

Paper
Add Code

Global convergence of neuron birth-death dynamics

no code implementations • 5 Feb 2019 • Grant Rotskoff, Samy Jelassi, Joan Bruna, Eric Vanden-Eijnden

Neural networks with a large number of parameters admit a mean-field description, which has recently served as a theoretical explanation for the favorable training properties of "overparameterized" models.

Paper
Add Code

Optimization and Generalization of Shallow Neural Networks with Quadratic Activation Functions

no code implementations • NeurIPS 2020 • Stefano Sarao Mannelli, Eric Vanden-Eijnden, Lenka Zdeborová

We consider a teacher-student scenario where the teacher has the same structure as the student with a hidden layer of smaller width $m^*\le m$.

Paper
Add Code

Active Importance Sampling for Variational Objectives Dominated by Rare Events: Consequences for Optimization and Generalization

1 code implementation • 11 Aug 2020 • Grant M. Rotskoff, Andrew R. Mitchell, Eric Vanden-Eijnden

Deep neural networks, when optimized with sufficient data, provide accurate representations of high-dimensional functions; in contrast, function approximation techniques that have predominated in scientific computing do not scale well with dimensionality.

BIG-bench Machine Learning Learning Theory

Paper
Code

A Dynamical Central Limit Theorem for Shallow Neural Networks

no code implementations • NeurIPS 2020 • Zhengdao Chen, Grant M. Rotskoff, Joan Bruna, Eric Vanden-Eijnden

Furthermore, if the mean-field dynamics converges to a measure that interpolates the training data, we prove that the asymptotic deviation eventually vanishes in the CLT scaling.

Paper
Add Code

Sharp Asymptotic Estimates for Expectations, Probabilities, and Mean First Passage Times in Stochastic Systems with Small Noise

no code implementations • 8 Mar 2021 • Tobias Grafke, Tobias Schäfer, Eric Vanden-Eijnden

Freidlin-Wentzell theory of large deviations can be used to compute the likelihood of extreme or rare events in stochastic dynamical systems via the solution of an optimization problem.

Statistical Mechanics Optimization and Control Probability Fluid Dynamics

Paper
Add Code

On Energy-Based Models with Overparametrized Shallow Neural Networks

1 code implementation • 15 Apr 2021 • Carles Domingo-Enrich, Alberto Bietti, Eric Vanden-Eijnden, Joan Bruna

Energy-based models (EBMs) are a simple yet powerful framework for generative modeling.

Paper
Code

Dual Training of Energy-Based Models with Overparametrized Shallow Neural Networks

no code implementations • 11 Jul 2021 • Carles Domingo-Enrich, Alberto Bietti, Marylou Gabrié, Joan Bruna, Eric Vanden-Eijnden

In the feature-learning regime, this dual formulation justifies using a two time-scale gradient ascent-descent (GDA) training algorithm in which one updates concurrently the particles in the sample space and the neurons in the parameter space of the energy.

Paper
Add Code

Efficient Bayesian Sampling Using Normalizing Flows to Assist Markov Chain Monte Carlo Methods

no code implementations • ICML Workshop INNF 2021 • Marylou Gabrié, Grant M. Rotskoff, Eric Vanden-Eijnden

Normalizing flows can generate complex target distributions and thus show promise in many applications in Bayesian statistics as an alternative or complement to MCMC for sampling posteriors.

Paper
Add Code

On feature learning in shallow and multi-layer neural networks with global convergence guarantees

no code implementations • ICLR 2022 • Zhengdao Chen, Eric Vanden-Eijnden, Joan Bruna

We study the optimization of over-parameterized shallow and multi-layer neural networks (NNs) in a regime that allows feature learning while admitting non-asymptotic global convergence guarantees.

Paper
Add Code

Neural Galerkin Schemes with Active Learning for High-Dimensional Evolution Equations

1 code implementation • 2 Mar 2022 • Joan Bruna, Benjamin Peherstorfer, Eric Vanden-Eijnden

Neural Galerkin schemes build on the Dirac-Frenkel variational principle to train networks by minimizing the residual sequentially over time, which enables adaptively collecting new training data in a self-informed manner that is guided by the dynamics described by the partial differential equations.

Active Learning Vocal Bursts Intensity Prediction

Paper
Code

On Feature Learning in Neural Networks with Global Convergence Guarantees

no code implementations • 22 Apr 2022 • Zhengdao Chen, Eric Vanden-Eijnden, Joan Bruna

We study the optimization of wide neural networks (NNs) via gradient flow (GF) in setups that allow feature learning while admitting non-asymptotic global convergence guarantees.

Paper
Add Code

Probability flow solution of the Fokker-Planck equation

no code implementations • 9 Jun 2022 • Nicholas M. Boffi, Eric Vanden-Eijnden

The method of choice for integrating the time-dependent Fokker-Planck equation in high-dimension is to generate samples from the solution via integration of the associated stochastic differential equation.

Paper
Add Code

Learning Optimal Flows for Non-Equilibrium Importance Sampling

1 code implementation • 20 Jun 2022 • Yu Cao, Eric Vanden-Eijnden

On the theory side, we discuss how to tailor the velocity field to the target and establish general conditions under which the proposed estimator is a perfect estimator with zero-variance.

Paper
Code

Learning sparse features can lead to overfitting in neural networks

1 code implementation • 24 Jun 2022 • Leonardo Petrini, Francesco Cagnetta, Eric Vanden-Eijnden, Matthieu Wyart

It is widely believed that the success of deep networks lies in their ability to learn a meaningful representation of the features of the data.

Paper
Code

Building Normalizing Flows with Stochastic Interpolants

1 code implementation • 30 Sep 2022 • Michael S. Albergo, Eric Vanden-Eijnden

A generative model based on a continuous-time normalizing flow between any pair of base and target probability densities is proposed.

Benchmarking Density Estimation +1

Paper
Code

A Functional-Space Mean-Field Theory of Partially-Trained Three-Layer Neural Networks

no code implementations • 28 Oct 2022 • Zhengdao Chen, Eric Vanden-Eijnden, Joan Bruna

To understand the training dynamics of neural networks (NNs), prior studies have considered the infinite-width mean-field (MF) limit of two-layer NN, establishing theoretical guarantees of its convergence under gradient flow training as well as its approximation and generalization capabilities.

Paper
Add Code

Stochastic Interpolants: A Unifying Framework for Flows and Diffusions

1 code implementation • 15 Mar 2023 • Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden

The time-dependent probability density function of the stochastic interpolant is shown to satisfy a first-order transport equation as well as a family of forward and backward Fokker-Planck equations with tunable diffusion coefficient.

Denoising

Paper
Code

Efficient Training of Energy-Based Models Using Jarzynski Equality

1 code implementation • NeurIPS 2023 • Davide Carbone, Mengjian Hua, Simon Coste, Eric Vanden-Eijnden

Energy-based models (EBMs) are generative models inspired by statistical physics with a wide range of applications in unsupervised learning.

Paper
Code

Coupling parameter and particle dynamics for adaptive sampling in Neural Galerkin schemes

no code implementations • 27 Jun 2023 • Yuxiao Wen, Eric Vanden-Eijnden, Benjamin Peherstorfer

Training nonlinear parametrizations such as deep neural networks to numerically approximate solutions of partial differential equations is often based on minimizing a loss that includes the residual, which is analytically available in limited settings only.

Paper
Add Code

Deep learning probability flows and entropy production rates in active matter

no code implementations • 22 Sep 2023 • Nicholas M. Boffi, Eric Vanden-Eijnden

We show that a single instance of our network trained on a system of 4096 particles at one packing fraction can generalize to other regions of the phase diagram, including systems with as many as 32768 particles.

Paper
Add Code

Stochastic interpolants with data-dependent couplings

no code implementations • 5 Oct 2023 • Michael S. Albergo, Mark Goldstein, Nicholas M. Boffi, Rajesh Ranganath, Eric Vanden-Eijnden

In this work, using the framework of stochastic interpolants, we formalize how to \textit{couple} the base and the target densities, whereby samples from the base are computed conditionally given samples from the target in a way that is different from (but does preclude) incorporating information about class labels or continuous embeddings.

Super-Resolution

Paper
Add Code

Multimarginal generative modeling with stochastic interpolants

no code implementations • 5 Oct 2023 • Michael S. Albergo, Nicholas M. Boffi, Michael Lindsey, Eric Vanden-Eijnden

Given a set of $K$ probability densities, we consider the multimarginal generative modeling problem of learning a joint distribution that recovers these densities as marginals.

Fairness Style Transfer

Paper
Add Code

Analysis of learning a flow-based generative model from limited sample complexity

1 code implementation • 5 Oct 2023 • Hugo Cui, Florent Krzakala, Eric Vanden-Eijnden, Lenka Zdeborová

We study the problem of training a flow-based generative model, parametrized by a two-layer autoencoder, to sample from a high-dimensional Gaussian mixture.

Denoising

Paper
Code

Learning to Sample Better

no code implementations • 17 Oct 2023 • Michael S. Albergo, Eric Vanden-Eijnden

These lecture notes provide an introduction to recent advances in generative modeling methods based on the dynamical transportation of measures, by means of which samples from a simple base measure are mapped to samples from a target measure of interest.

Paper
Add Code

SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

1 code implementation • 16 Jan 2024 • Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie

We present Scalable Interpolant Transformers (SiT), a family of generative models built on the backbone of Diffusion Transformers (DiT).

Image Generation

485

Paper
Code

Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes

no code implementations • 20 Mar 2024 • Yifan Chen, Mark Goldstein, Mengjian Hua, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden

We propose a framework for probabilistic forecasting of dynamical systems based on generative modeling.

Video Prediction

Paper
Add Code

Sequential-in-time training of nonlinear parametrizations for solving time-dependent partial differential equations

no code implementations • 1 Apr 2024 • huan zhang, Yifan Chen, Eric Vanden-Eijnden, Benjamin Peherstorfer

Sequential-in-time methods solve a sequence of training problems to fit nonlinear parametrizations such as neural networks to approximate solution trajectories of partial differential equations over time.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.