Search Results for author: Belinda Tzen

Found 5 papers, 0 papers with code

Variational Principles for Mirror Descent and Mirror Langevin Dynamics

no code implementations16 Mar 2023 Belinda Tzen, Anant Raj, Maxim Raginsky, Francis Bach

Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a primal-dual convex optimization method that can be tailored to the geometry of the optimization problem at hand through the choice of a strongly convex potential function.

A mean-field theory of lazy training in two-layer neural nets: entropic regularization and controlled McKean-Vlasov dynamics

no code implementations5 Feb 2020 Belinda Tzen, Maxim Raginsky

We first consider the mean-field limit, where the finite population of neurons in the hidden layer is replaced by a continual ensemble, and show that our problem can be phrased as global minimization of a free-energy functional on the space of probability measures over the weights.

Neural Stochastic Differential Equations: Deep Latent Gaussian Models in the Diffusion Limit

no code implementations23 May 2019 Belinda Tzen, Maxim Raginsky

In deep latent Gaussian models, the latent variable is generated by a time-inhomogeneous Markov chain, where at each time step we pass the current state through a parametric nonlinear map, such as a feedforward neural net, and add a small independent Gaussian perturbation.

Variational Inference

Theoretical guarantees for sampling and inference in generative models with latent diffusions

no code implementations5 Mar 2019 Belinda Tzen, Maxim Raginsky

We introduce and study a class of probabilistic generative models, where the latent object is a finite-dimensional diffusion process on a finite time interval and the observed variable is drawn conditionally on the terminal point of the diffusion.

Variational Inference

Local Optimality and Generalization Guarantees for the Langevin Algorithm via Empirical Metastability

no code implementations18 Feb 2018 Belinda Tzen, Tengyuan Liang, Maxim Raginsky

For a particular local optimum of the empirical risk, with an arbitrary initialization, we show that, with high probability, at least one of the following two events will occur: (1) the Langevin trajectory ends up somewhere outside the $\varepsilon$-neighborhood of this particular optimum within a short recurrence time; (2) it enters this $\varepsilon$-neighborhood by the recurrence time and stays there until a potentially exponentially long escape time.

Cannot find the paper you are looking for? You can Submit a new open access paper.