Search Results for author: Anna Kerekes

Found 4 papers, 0 papers with code

Depth Without the Magic: Inductive Bias of Natural Gradient Descent

no code implementations • 22 Nov 2021 • Anna Kerekes, Anna Mészáros, Ferenc Huszár

In gradient descent, changing how we parametrize the model can lead to drastically different optimization trajectories, giving rise to a surprising range of meaningful inductive biases: identifying sparse classifiers or reconstructing low-rank matrices without explicit regularization.

Inductive Bias

Paper
Add Code

Rethinking Sharpness-Aware Minimization as Variational Inference

no code implementations • 19 Oct 2022 • Szilvia Ujváry, Zsigmond Telek, Anna Kerekes, Anna Mészáros, Ferenc Huszár

Sharpness-aware minimization (SAM) aims to improve the generalisation of gradient-based learning by seeking out flat minima.

Variational Inference

Paper
Add Code

Expressiveness Remarks for Denoising Diffusion Models and Samplers

no code implementations • 16 May 2023 • Francisco Vargas, Teodora Reu, Anna Kerekes

Denoising diffusion models are a class of generative models which have recently achieved state-of-the-art results across many domains.

Denoising

Paper
Add Code

Understanding LLMs Requires More Than Statistical Generalization

no code implementations • 3 May 2024 • Patrik Reizinger, Szilvia Ujváry, Anna Mészáros, Anna Kerekes, Wieland Brendel, Ferenc Huszár

The last decade has seen blossoming research in deep learning theory attempting to answer, "Why does deep learning generalize?"

In-Context Learning Learning Theory

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.