Search Results for author: Joel Lehman

Found 28 papers, 20 papers with code

Quality-Diversity through AI Feedback

no code implementations • 19 Oct 2023 • Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

In many text-generation problems, users may prefer not only a single response, but a diverse range of high-quality outputs from which to choose.

Text Generation

Paper
Add Code

Quality Diversity through Human Feedback

1 code implementation • 18 Oct 2023 • Li Ding, Jenny Zhang, Jeff Clune, Lee Spector, Joel Lehman

Meanwhile, Quality Diversity (QD) algorithms excel at identifying diverse and high-quality solutions but often rely on manually crafted diversity metrics.

Image Generation reinforcement-learning +2

Paper
Code

OMNI: Open-endedness via Models of human Notions of Interestingness

1 code implementation • 2 Jun 2023 • Jenny Zhang, Joel Lehman, Kenneth Stanley, Jeff Clune

An Achilles Heel of open-endedness research is the inability to quantify (and thus prioritize) tasks that are not just learnable, but also $\textit{interesting}$ (e. g., worthwhile and novel).

Paper
Code

Language Model Crossover: Variation through Few-Shot Prompting

1 code implementation • 23 Feb 2023 • Elliot Meyerson, Mark J. Nelson, Herbie Bradley, Adam Gaier, Arash Moradi, Amy K. Hoover, Joel Lehman

The promise of such language model crossover (which is simple to implement and can leverage many different open-source language models) is that it enables a simple mechanism to evolve semantically-rich text representations (with few domain-specific tweaks), and naturally benefits from current progress in language models.

In-Context Learning Language Modelling

507

Paper
Code

Machine Love

no code implementations • 18 Feb 2023 • Joel Lehman

While ML generates much economic value, many of us have problematic relationships with social media and other ML-powered applications.

Artificial Life Philosophy

Paper
Add Code

Evolution through Large Models

no code implementations • 17 Jun 2022 • Joel Lehman, Jonathan Gordon, Shawn Jain, Kamal Ndousse, Cathy Yeh, Kenneth O. Stanley

This paper pursues the insight that large language models (LLMs) trained to generate code can vastly improve the effectiveness of mutation operators applied to programs in genetic programming (GP).

Language Modelling

Paper
Add Code

Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop

no code implementations • 21 Jul 2020 • Shagun Sodhani, Mayoore S. Jaiswal, Lauren Baker, Koustuv Sinha, Carl Shneider, Peter Henderson, Joel Lehman, Ryan Lowe

This report documents ideas for improving the field of machine learning, which arose from discussions at the ML Retrospectives workshop at NeurIPS 2019.

BIG-bench Machine Learning

Paper
Add Code

Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity

1 code implementation • 12 Jun 2020 • Adrien Ecoffet, Jeff Clune, Joel Lehman

This paper proposes that open-ended evolution and artificial life have much to contribute towards the understanding of open-ended AI, focusing here in particular on the safety of open-ended search.

Artificial Life

Paper
Code

Reinforcement Learning Under Moral Uncertainty

1 code implementation • 8 Jun 2020 • Adrien Ecoffet, Joel Lehman

An ambitious goal for machine learning is to create agents that behave ethically: The capacity to abide by human moral norms would greatly expand the context in which autonomous agents could be practically and safely deployed, e. g. fully autonomous vehicles will encounter charged moral decisions that complicate their deployment.

Autonomous Vehicles BIG-bench Machine Learning +3

Paper
Code

Synthetic Petri Dish: A Novel Surrogate Model for Rapid Architecture Search

1 code implementation • 27 May 2020 • Aditya Rawal, Joel Lehman, Felipe Petroski Such, Jeff Clune, Kenneth O. Stanley

Neural Architecture Search (NAS) explores a large space of architectural motifs -- a compute-intensive process that often involves ground-truth evaluation of each motif by instantiating it within a large network, and training and evaluating the network with thousands of domain-specific data samples.

Neural Architecture Search

Paper
Code

First return, then explore

2 code implementations • 27 Apr 2020 • Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only.

Ranked #1 on Atari Games on Atari 2600 Montezuma's Revenge

Montezuma's Revenge reinforcement-learning +1

547

Paper
Code

Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

1 code implementation • ICML 2020 • Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeff Clune, Kenneth O. Stanley

Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning.

Reinforcement Learning (RL)

236

Paper
Code

Learning to Continually Learn

5 code implementations • 21 Feb 2020 • Shawn Beaulieu, Lapo Frati, Thomas Miconi, Joel Lehman, Kenneth O. Stanley, Jeff Clune, Nick Cheney

Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it.

Continual Learning Meta-Learning

113

Paper
Code

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

3 code implementations • 17 Dec 2019 • Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley, Jeff Clune

This paper introduces GTNs, discusses their potential, and showcases that they can substantially accelerate learning.

Neural Architecture Search

1,164

Paper
Code

Evolvability ES: Scalable and Direct Optimization of Evolvability

1 code implementation • 13 Jul 2019 • Alexander Gajewski, Jeff Clune, Kenneth O. Stanley, Joel Lehman

Designing evolutionary algorithms capable of uncovering highly evolvable representations is an open challenge; such evolvability is important because it accelerates evolution and enables fast adaptation to changing circumstances.

Evolutionary Algorithms Meta-Learning

Paper
Code

Towards Empathic Deep Q-Learning

1 code implementation • 26 Jun 2019 • Bart Bussmann, Jacqueline Heinerman, Joel Lehman

As reinforcement learning (RL) scales to solve increasingly complex tasks, interest continues to grow in the fields of AI safety and machine ethics.

Ethics Q-Learning +1

Paper
Code

Evolutionary Computation and AI Safety: Research Problems Impeding Routine and Safe Real-world Application of Evolution

no code implementations • 24 Jun 2019 • Joel Lehman

Recent developments in artificial intelligence and machine learning have spurred interest in the growing field of AI safety, which studies how to prevent human-harming accidents when deploying AI systems.

BIG-bench Machine Learning

Paper
Add Code

Learning Belief Representations for Imitation Learning in POMDPs

1 code implementation • 22 Jun 2019 • Tanmay Gangwani, Joel Lehman, Qiang Liu, Jian Peng

We consider the problem of imitation learning from expert demonstrations in partially observable Markov decision processes (POMDPs).

Continuous Control Imitation Learning +1

Paper
Code

Go-Explore: a New Approach for Hard-Exploration Problems

3 code implementations • 30 Jan 2019 • Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

Go-Explore can also harness human-provided domain knowledge and, when augmented with it, scores a mean of over 650k points on Montezuma's Revenge.

Ranked #1 on Atari Games on Atari 2600 Pitfall!

Imitation Learning Montezuma's Revenge

547

Paper
Code

Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

2 code implementations • 7 Jan 2019 • Rui Wang, Joel Lehman, Jeff Clune, Kenneth O. Stanley

Our results show that POET produces a diverse range of sophisticated behaviors that solve a wide range of environmental challenges, many of which cannot be solved by direct optimization alone, or even through a direct-path curriculum-building control algorithm introduced to highlight the critical role of open-endedness in solving ambitious challenges.

236

Paper
Code

An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

1 code implementation • 17 Dec 2018 • Felipe Petroski Such, Vashisht Madhavan, Rosanne Liu, Rui Wang, Pablo Samuel Castro, Yulun Li, Jiale Zhi, Ludwig Schubert, Marc G. Bellemare, Jeff Clune, Joel Lehman

We lessen this friction, by (1) training several algorithms at scale and releasing trained models, (2) integrating with a previous Deep RL model release, and (3) releasing code that makes it easy for anyone to load, visualize, and analyze such models.

Atari Games Friction +2

201

Paper
Code

An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution

21 code implementations • NeurIPS 2018 • Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric Frank, Alex Sergeev, Jason Yosinski

In this paper we show a striking counterexample to this intuition via the seemingly trivial coordinate transform problem, which simply requires learning a mapping between coordinates in (x, y) Cartesian space and one-hot pixel space.

Ranked #866 on Image Classification on ImageNet

Atari Games Image Classification +1

395

Paper
Code

The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

no code implementations • 9 Mar 2018 • Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter, Laurent Keller, Carole Knibbe, Peter Krcah, Richard E. Lenski, Hod Lipson, Robert MacCurdy, Carlos Maestre, Risto Miikkulainen, Sara Mitri, David E. Moriarty, Jean-Baptiste Mouret, Anh Nguyen, Charles Ofria, Marc Parizeau, David Parsons, Robert T. Pennock, William F. Punch, Thomas S. Ray, Marc Schoenauer, Eric Shulte, Karl Sims, Kenneth O. Stanley, François Taddei, Danesh Tarapore, Simon Thibault, Westley Weimer, Richard Watson, Jason Yosinski

Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them.

Artificial Life

Paper
Add Code

Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

1 code implementation • 18 Dec 2017 • Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks.

Artificial Life

143

Paper
Code

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

14 code implementations • 18 Dec 2017 • Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman, Kenneth O. Stanley, Jeff Clune

Here we demonstrate they can: we evolve the weights of a DNN with a simple, gradient-free, population-based genetic algorithm (GA) and it performs well on hard deep RL problems, including Atari and humanoid locomotion.

Evolutionary Algorithms Q-Learning +1

1,616

Paper
Code

ES Is More Than Just a Traditional Finite-Difference Approximator

no code implementations • 18 Dec 2017 • Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

However, this ES optimizes for a different gradient than just reward: It optimizes for the average reward of the entire population, thereby seeking parameters that are robust to perturbation.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

2 code implementations • NeurIPS 2018 • Edoardo Conti, Vashisht Madhavan, Felipe Petroski Such, Joel Lehman, Kenneth O. Stanley, Jeff Clune

Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e. g. hours vs. days) because they parallelize better.

Policy Gradient Methods Q-Learning +2

1,616

Paper
Code

Using Indirect Encoding of Multiple Brains to Produce Multimodal Behavior

no code implementations • 26 Apr 2016 • Jacob Schrum, Joel Lehman, Sebastian Risi

Indirect encodings can potentially answer this challenge.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.