no code implementations • 6 Feb 2024 • Ossi Räisä, Joonas Jälkö, Antti Honkela
The remaining subsampling-induced variance decreases with larger batch sizes, so large batches reduce the effective total gradient variance.
no code implementations • 6 Feb 2024 • Ossi Räisä, Antti Honkela
We investigate how our theory works in practice by evaluating the performance of an ensemble over many synthetic datasets for several real datasets and downstream predictors.
2 code implementations • 28 May 2022 • Ossi Räisä, Joonas Jälkö, Samuel Kaski, Antti Honkela
For example, confidence intervals become too narrow, which we demonstrate with a simple experiment.