Search Results for author: Luca Biggio

Found 16 papers, 6 papers with code

Understanding Counting in Small Transformers: The Interplay between Attention and Feed-Forward Layers

1 code implementation16 Jul 2024 Freya Behrens, Luca Biggio, Lenka Zdeborová

From a broader perspective, our analysis offers a framework to understand how the interaction of different architectural components of transformer models shapes diverse algorithmic solutions and approximations.

Memorization

Harnessing Synthetic Datasets: The Role of Shape Bias in Deep Neural Network Generalization

no code implementations10 Nov 2023 Elior Benarous, Sotiris Anagnostidis, Luca Biggio, Thomas Hofmann

In this study, we investigate how neural networks exhibit shape bias during training on synthetic datasets, serving as an indicator of the synthetic data quality.

Diversity

Gemtelligence: Accelerating Gemstone classification with Deep Learning

no code implementations31 May 2023 Tommaso Bendinelli, Luca Biggio, Daniel Nyfeler, Abhigyan Ghosh, Peter Tollan, Moritz Alexander Kirschmann, Olga Fink

The value of luxury goods, particularly investment-grade gemstones, is greatly influenced by their origin and authenticity, sometimes resulting in differences worth millions of dollars.

Classification

Uncertainty Quantification in Machine Learning for Engineering Design and Health Prognostics: A Tutorial

1 code implementation7 May 2023 Venkat Nemani, Luca Biggio, Xun Huan, Zhen Hu, Olga Fink, Anh Tran, Yan Wang, Xiaoge Zhang, Chao Hu

In this tutorial, we aim to provide a holistic lens on emerging UQ methods for ML models with a particular focus on neural networks and the applications of these UQ methods in tackling engineering design as well as prognostics and health management problems.

Decision Making Management +2

Controllable Neural Symbolic Regression

no code implementations20 Apr 2023 Tommaso Bendinelli, Luca Biggio, Pierre-Alexandre Kamienny

In symbolic regression, the goal is to find an analytical expression that accurately fits experimental data with the minimal use of mathematical symbols such as operators, variables, and constants.

Evolutionary Algorithms regression +1

An SDE for Modeling SAM: Theory and Insights

no code implementations19 Jan 2023 Enea Monzio Compagnoni, Luca Biggio, Antonio Orvieto, Frank Norbert Proske, Hans Kersting, Aurelien Lucchi

We study the SAM (Sharpness-Aware Minimization) optimizer which has recently attracted a lot of interest due to its increased performance over more classical variants of stochastic gradient descent.

Cosmology from Galaxy Redshift Surveys with PointNet

no code implementations22 Nov 2022 Sotiris Anagnostidis, Arne Thomsen, Tomasz Kacprzak, Tilman Tröster, Luca Biggio, Alexandre Refregier, Thomas Hofmann

In this work, we aim to improve upon two-point statistics by employing a \textit{PointNet}-like neural network to regress the values of the cosmological parameters directly from point cloud data.

Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse

no code implementations7 Jun 2022 Lorenzo Noci, Sotiris Anagnostidis, Luca Biggio, Antonio Orvieto, Sidak Pal Singh, Aurelien Lucchi

First, we show that rank collapse of the tokens' representations hinders training by causing the gradients of the queries and keys to vanish at initialization.

On the effectiveness of Randomized Signatures as Reservoir for Learning Rough Dynamics

no code implementations2 Jan 2022 Enea Monzio Compagnoni, Anna Scampicchio, Luca Biggio, Antonio Orvieto, Thomas Hofmann, Josef Teichmann

Many finance, physics, and engineering phenomena are modeled by continuous-time dynamical systems driven by highly irregular (stochastic) inputs.

LEMMA Time Series +1

Neural Symbolic Regression that Scales

2 code implementations11 Jun 2021 Luca Biggio, Tommaso Bendinelli, Alexander Neitz, Aurelien Lucchi, Giambattista Parascandolo

We procedurally generate an unbounded set of equations, and simultaneously pre-train a Transformer to predict the symbolic equation from a corresponding set of input-output-pairs.

regression scientific discovery +1

Uncertainty-aware Remaining Useful Life predictor

no code implementations8 Apr 2021 Luca Biggio, Alexander Wieland, Manuel Arias Chao, Iason Kastanis, Olga Fink

Remaining Useful Life (RUL) estimation is the problem of inferring how long a certain industrial asset can be expected to operate within its defined specifications.

Gaussian Processes

Cannot find the paper you are looking for? You can Submit a new open access paper.