Search Results for author: Qingfeng Lan

Found 9 papers, 6 papers with code

Elephant Neural Networks: Born to Be a Continual Learner

no code implementations2 Oct 2023 Qingfeng Lan, A. Rupam Mahmood

We show that by simply replacing classical activation functions with elephant activation functions, we can significantly improve the resilience of neural networks to catastrophic forgetting.

Class Incremental Learning Incremental Learning +1

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

1 code implementation29 May 2023 Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli

One of the key shortcomings of existing Thompson sampling algorithms is the need to perform a Gaussian approximation of the posterior distribution, which is not a good surrogate in most practical settings.

Efficient Exploration reinforcement-learning +2

Learning to Optimize for Reinforcement Learning

1 code implementation3 Feb 2023 Qingfeng Lan, A. Rupam Mahmood, Shuicheng Yan, Zhongwen Xu

Reinforcement learning (RL) is essentially different from supervised learning and in practice these learned optimizers do not work well even in simple RL tasks.

Inductive Bias Meta-Learning +2

Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation

1 code implementation22 May 2022 Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood

The experience replay buffer, a standard component in deep reinforcement learning, is often used to reduce forgetting and improve sample efficiency by storing experiences in a large buffer and using them for training later.

reinforcement-learning Reinforcement Learning (RL)

Variational Quantum Soft Actor-Critic

1 code implementation20 Dec 2021 Qingfeng Lan

In this work, we develop a quantum reinforcement learning algorithm based on soft actor-critic -- one of the state-of-the-art methods for continuous control.

Continuous Control reinforcement-learning +1

Predictive Representation Learning for Language Modeling

no code implementations29 May 2021 Qingfeng Lan, Luke Kumar, Martha White, Alona Fyshe

Correlates of secondary information appear in LSTM representations even though they are not part of an \emph{explicitly} supervised prediction task.

Language Modelling Reinforcement Learning (RL) +1

Model-free Policy Learning with Reward Gradients

1 code implementation9 Mar 2021 Qingfeng Lan, Samuele Tosatto, Homayoon Farrahi, A. Rupam Mahmood

As a key component in reinforcement learning, the reward function is usually devised carefully to guide the agent.

Continuous Control Policy Gradient Methods

Maxmin Q-learning: Controlling the Estimation Bias of Q-learning

1 code implementation ICLR 2020 Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White

Q-learning suffers from overestimation bias, because it approximates the maximum action value using the maximum estimated action value.

Q-Learning

Reducing Selection Bias in Counterfactual Reasoning for Individual Treatment Effects Estimation

no code implementations19 Dec 2019 Zichen Zhang, Qingfeng Lan, Lei Ding, Yue Wang, Negar Hassanpour, Russell Greiner

We learn two groups of latent random variables, where one group corresponds to variables that only cause selection bias, and the other group is relevant for outcome prediction.

counterfactual Counterfactual Reasoning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.