Search Results for author: Stephen Chung

Found 8 papers, 3 papers with code

Thinker: Learning to Plan and Act

1 code implementation NeurIPS 2023 Stephen Chung, Ivan Anokhin, David Krueger

This approach eliminates the need for handcrafted planning algorithms by enabling the agent to learn how to plan autonomously and allows for easy interpretation of the agent's plan with visualization.

Unbiased Weight Maximization

no code implementations25 Jul 2023 Stephen Chung

Notably, to our knowledge, this is the first learning rule for a network of Bernoulli-logistic units that is unbiased and scales well with the number of network's units in terms of learning speed.

Reinforcement Learning (RL)

Structural Credit Assignment with Coordinated Exploration

no code implementations25 Jul 2023 Stephen Chung

The first category includes algorithms that enable coordinated exploration among units, such as MAP propagation.

Reinforcement Learning (RL)

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

no code implementations27 Nov 2022 Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger

Existing offline reinforcement learning (RL) algorithms typically assume that training data is either: 1) generated by a known policy, or 2) of entirely unknown origin.

Domain Generalization Offline RL +2

Learning by Competition of Self-Interested Reinforcement Learning Agents

1 code implementation19 Oct 2020 Stephen Chung

The high variance arises from the inefficient structural credit assignment since a single reward signal is used to evaluate the collective action of all units.

reinforcement-learning Reinforcement Learning (RL)

MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents

1 code implementation NeurIPS 2021 Stephen Chung

An alternative way of training an artificial neural network is through treating each unit in the network as a reinforcement learning agent, and thus the network is considered as a team of agents.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning with Feedback-modulated TD-STDP

no code implementations29 Aug 2020 Stephen Chung, Robert Kozma

Spiking neuron networks have been used successfully to solve simple reinforcement learning tasks with continuous action set applying learning rules based on spike-timing-dependent plasticity (STDP).

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.