Search Results for author: Aditya Rawal

Found 10 papers, 5 papers with code

Evolving Deep Neural Networks

4 code implementations • 1 Mar 2017 • Risto Miikkulainen, Jason Liang, Elliot Meyerson, Aditya Rawal, Dan Fink, Olivier Francon, Bala Raju, Hormoz Shahrzad, Arshak Navruzyan, Nigel Duffy, Babak Hodjat

The success of deep learning depends on finding an architecture to fit the task.

Image Captioning Language Modelling +1

Paper
Code

From Nodes to Networks: Evolving Recurrent Neural Networks

no code implementations • 12 Mar 2018 • Aditya Rawal, Risto Miikkulainen

Gated recurrent networks such as those composed of Long Short-Term Memory (LSTM) nodes have recently been used to improve state of the art in many sequential processing tasks such as speech recognition and machine translation.

Language Modelling Machine Translation +3

Paper
Add Code

First-Order Preconditioning via Hypergradient Descent

1 code implementation • 18 Oct 2019 • Ted Moskovitz, Rui Wang, Janice Lan, Sanyam Kapoor, Thomas Miconi, Jason Yosinski, Aditya Rawal

Standard gradient descent methods are susceptible to a range of issues that can impede training, such as high correlations and different scaling in parameter space. These difficulties can be addressed by second-order approaches that apply a pre-conditioning matrix to the gradient to improve convergence.

Paper
Code

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

3 code implementations • 17 Dec 2019 • Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley, Jeff Clune

This paper introduces GTNs, discusses their potential, and showcases that they can substantially accelerate learning.

Neural Architecture Search

1,153

Paper
Code

Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity

no code implementations • ICLR 2019 • Thomas Miconi, Aditya Rawal, Jeff Clune, Kenneth O. Stanley

We show that neuromodulated plasticity improves the performance of neural networks on both reinforcement learning and supervised learning tasks.

Language Modelling reinforcement-learning +1

Paper
Add Code

Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

1 code implementation • ICML 2020 • Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeff Clune, Kenneth O. Stanley

Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning.

Reinforcement Learning (RL)

236

Paper
Code

Synthetic Petri Dish: A Novel Surrogate Model for Rapid Architecture Search

1 code implementation • 27 May 2020 • Aditya Rawal, Joel Lehman, Felipe Petroski Such, Jeff Clune, Kenneth O. Stanley

Neural Architecture Search (NAS) explores a large space of architectural motifs -- a compute-intensive process that often involves ground-truth evaluation of each motif by instantiating it within a large network, and training and evaluating the network with thousands of domain-specific data samples.

Neural Architecture Search

Paper
Code

Memory Efficient Continual Learning with Transformers

no code implementations • 9 Mar 2022 • Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

Moreover, applications increasingly rely on large pre-trained neural networks, such as pre-trained Transformers, since the resources or data might not be available in sufficiently large quantities to practitioners to train the model from scratch.

Continual Learning text-classification +1

Paper
Add Code

Continual Learning with Transformers for Image Classification

no code implementations • 28 Jun 2022 • Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

This phenomenon is known as catastrophic forgetting and it is often difficult to prevent due to practical constraints, such as the amount of data that can be stored or the limited computation sources that can be used.

Continual Learning Image Classification +2

Paper
Add Code

Extreme Miscalibration and the Illusion of Adversarial Robustness

no code implementations • 27 Feb 2024 • Vyas Raina, Samson Tan, Volkan Cevher, Aditya Rawal, Sheng Zha, George Karypis

Deep learning-based Natural Language Processing (NLP) models are vulnerable to adversarial attacks, where small perturbations can cause a model to misclassify.

Adversarial Attack Adversarial Robustness

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.