Search Results for author: Freya Behrens

Found 5 papers, 2 papers with code

A phase transition between positional and semantic learning in a solvable model of dot-product attention

no code implementations6 Feb 2024 Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborová

We investigate how a dot-product attention layer learns a positional attention matrix (with tokens attending to each other based on their respective positions) and a semantic attention matrix (with tokens attending to each other based on their meaning).

Unveiling the Hessian's Connection to the Decision Boundary

1 code implementation12 Jun 2023 Mahalakshmi Sabanayagam, Freya Behrens, Urte Adomaityte, Anna Dawid

Based on this finding, we provide a new and straightforward approach to studying the complexity of a high-dimensional decision boundary; show that this connection naturally inspires a new generalization measure; and finally, we develop a novel margin estimation technique which, in combination with the generalization measure, precisely identifies minima with simple wide-margin boundaries.

Bandits for Learning to Explain from Explanations

no code implementations7 Feb 2021 Freya Behrens, Stefano Teso, Davide Mottin

We introduce Explearn, an online algorithm that learns to jointly output predictions and explanations for those predictions.

Gaussian Processes Multi-Armed Bandits

Towards Neurally Augmented ALISTA

no code implementations23 Oct 2020 Freya Behrens, Jonathan Sauder, Peter Jung

It is well-established that many iterative sparse reconstruction algorithms such as ISTA can be unrolled to yield a learnable neural network for improved empirical performance.

Neurally Augmented ALISTA

1 code implementation ICLR 2021 Freya Behrens, Jonathan Sauder, Peter Jung

A prime example is learned ISTA (LISTA) where weights, step sizes and thresholds are learned from training data.

Cannot find the paper you are looking for? You can Submit a new open access paper.