Search Results for author: Ali Razavi

Found 13 papers, 6 papers with code

Normalizing flows for lattice gauge theory in arbitrary space-time dimension

no code implementations • 3 May 2023 • Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Applications of normalizing flows to the sampling of field configurations in lattice gauge theory have so far been explored almost exclusively in two space-time dimensions.

Paper
Add Code

Aspects of scaling and scalability for flow-based sampling of lattice QCD

no code implementations • 14 Nov 2022 • Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Recent applications of machine-learned normalizing flows to sampling in lattice field theory suggest that such methods may be able to mitigate critical slowing down and topological freezing.

Paper
Add Code

A Generalist Agent

3 code implementations • DeepMind 2022 • Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs.

Ranked #1 on Skill Generalization on RGB-Stacking

Language Modelling Skill Generalization +1

187

Paper
Code

Vector Quantized Models for Planning

no code implementations • 8 Jun 2021 • Sherjil Ozair, Yazhe Li, Ali Razavi, Ioannis Antonoglou, Aäron van den Oord, Oriol Vinyals

Our key insight is to use discrete autoencoders to capture the multiple possible effects of an action in a stochastic environment.

Paper
Add Code

Predicting Video with VQVAE

1 code implementation • 2 Mar 2021 • Jacob Walker, Ali Razavi, Aäron van den Oord

In recent years, the task of video prediction-forecasting future video given past video frames-has attracted attention in the research community.

Ranked #10 on Video Prediction on Kinetics-600 12 frames, 64x64

Video Generation Video Prediction

Paper
Code

Do Transformers Need Deep Long-Range Memory

1 code implementation • 7 Jul 2020 • Jack W. Rae, Ali Razavi

Deep attention models have advanced the modelling of sequential data across many domains.

Deep Attention Language Modelling

609

Paper
Code

Do Transformers Need Deep Long-Range Memory?

no code implementations • ACL 2020 • Jack Rae, Ali Razavi

Deep attention models have advanced the modelling of sequential data across many domains.

Deep Attention Language Modelling

Paper
Add Code

Generating Diverse High-Fidelity Images with VQ-VAE-2

15 code implementations • NeurIPS 2019 • Ali Razavi, Aaron van den Oord, Oriol Vinyals

We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation.

Image Generation Vocal Bursts Intensity Prediction

9,690

Paper
Code

Data-Efficient Image Recognition with Contrastive Predictive Coding

4 code implementations • ICML 2020 • Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord

Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge.

Ranked #6 on Contrastive Learning on imagenet-1k

Contrastive Learning General Classification +5

394

Paper
Code

Generating Diverse High-Resolution Images with VQ-VAE

no code implementations • ICLR Workshop DeepGenStruct 2019 • Ali Razavi, Aaron van den Oord, Oriol Vinyals

We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation.

Image Generation Vocal Bursts Intensity Prediction

Paper
Add Code

Preventing Posterior Collapse with delta-VAEs

no code implementations • ICLR 2019 • Ali Razavi, Aäron van den Oord, Ben Poole, Oriol Vinyals

Due to the phenomenon of "posterior collapse," current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires augmenting the objective so it does not only maximize the likelihood of the data.

Ranked #7 on Image Generation on ImageNet 32x32 (bpd metric)

Image Generation Representation Learning

Paper
Add Code

Hyperbolic Attention Networks

no code implementations • ICLR 2019 • Caglar Gulcehre, Misha Denil, Mateusz Malinowski, Ali Razavi, Razvan Pascanu, Karl Moritz Hermann, Peter Battaglia, Victor Bapst, David Raposo, Adam Santoro, Nando de Freitas

We introduce hyperbolic attention networks to endow neural networks with enough capacity to match the complexity of data with hierarchical and power-law structure.

Machine Translation Question Answering +2

Paper
Add Code

Population Based Training of Neural Networks

9 code implementations • 27 Nov 2017 • Max Jaderberg, Valentin Dalibard, Simon Osindero, Wojciech M. Czarnecki, Jeff Donahue, Ali Razavi, Oriol Vinyals, Tim Green, Iain Dunning, Karen Simonyan, Chrisantha Fernando, Koray Kavukcuoglu

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm.

Machine Translation Model Selection

165

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.