Search Results for author: Grzegorz Swirszcz

Found 9 papers, 3 papers with code

Discovering faster matrix multiplication algorithms with reinforcement learning

2 code implementations • Nature 2022 • Alhussein Fawzi, Matej Balog, Aja Huang, Thomas Hubert, Bernardino Romera-Paredes, Mohammadamin Barekatain, Alexander Novikov, Francisco J. R. Ruiz, Julian Schrittwieser, Grzegorz Swirszcz, David Silver, Demis Hassabis, Pushmeet Kohli

Particularly relevant is the case of 4 × 4 matrices in a finite field, where AlphaTensor’s algorithm improves on Strassen’s two-level algorithm for the first time, to our knowledge, since its discovery 50 years ago2.

reinforcement-learning Reinforcement Learning (RL)

8,006

Paper
Code

Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping

2 code implementations • 5 Oct 2021 • James Martens, Andy Ballard, Guillaume Desjardins, Grzegorz Swirszcz, Valentin Dalibard, Jascha Sohl-Dickstein, Samuel S. Schoenholz

Using an extended and formalized version of the Q/C map analysis of Poole et al. (2016), along with Neural Tangent Kernel theory, we identify the main pathologies present in deep networks that prevent them from training fast and generalizing to unseen data, and show how these can be avoided by carefully controlling the "shape" of the network's initialization-time kernel function.

Paper
Code

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling

no code implementations • CVPR 2021 • Mateusz Malinowski, Dimitrios Vytiniotis, Grzegorz Swirszcz, Viorica Patraucean, Joao Carreira

How can neural networks be trained on large-volume temporal data efficiently?

Action Recognition Blocking

Paper
Add Code

Sideways: Depth-Parallel Training of Video Models

no code implementations • CVPR 2020 • Mateusz Malinowski, Grzegorz Swirszcz, Joao Carreira, Viorica Patraucean

We propose Sideways, an approximate backpropagation scheme for training video models.

Paper
Add Code

Verification of Non-Linear Specifications for Neural Networks

no code implementations • ICLR 2019 • Chongli Qin, Krishnamurthy, Dvijotham, Brendan O'Donoghue, Rudy Bunel, Robert Stanforth, Sven Gowal, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli

We show that a number of important properties of interest can be modeled within this class, including conservation of energy in a learned dynamics model of a physical system; semantic consistency of a classifier's output labels under adversarial perturbations and bounding errors in a system that predicts the summation of handwritten digits.

Paper
Add Code

Distilling Policy Distillation

no code implementations • 6 Feb 2019 • Wojciech Marian Czarnecki, Razvan Pascanu, Simon Osindero, Siddhant M. Jayakumar, Grzegorz Swirszcz, Max Jaderberg

The transfer of knowledge from one policy to another is an important tool in Deep Reinforcement Learning.

Paper
Add Code

Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles

no code implementations • ICLR 2019 • Edward Grefenstette, Robert Stanforth, Brendan O'Donoghue, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli

We show that increasing the number of parameters in adversarially-trained models increases their robustness, and in particular that ensembling smaller models while adversarially training the entire ensemble as a single model is a more efficient way of spending said budget than simply using a larger single model.

Self-Driving Cars

Paper
Add Code

Local minima in training of neural networks

1 code implementation • 19 Nov 2016 • Grzegorz Swirszcz, Wojciech Marian Czarnecki, Razvan Pascanu

Given that deep networks are highly nonlinear systems optimized by local gradient methods, why do they not seem to be affected by bad local minima?

Paper
Code

Grouped Orthogonal Matching Pursuit for Variable Selection and Prediction

no code implementations • NeurIPS 2009 • Grzegorz Swirszcz, Naoki Abe, Aurelie C. Lozano

We consider the problem of variable group selection for least squares regression, namely, that of selecting groups of variables for best regression performance, leveraging and adhering to a natural grouping structure within the explanatory variables.

feature selection regression +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.