Search Results for author: Lifeng Liu

Found 7 papers, 4 papers with code

Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models

1 code implementation28 Mar 2024 Ang Lv, Kaiyi Zhang, Yuhan Chen, Yulong Wang, Lifeng Liu, Ji-Rong Wen, Jian Xie, Rui Yan

In this paper, we deeply explore the mechanisms employed by Transformer-based language models in factual recall tasks.

Knowledge-Guided Exploration in Deep Reinforcement Learning

no code implementations26 Oct 2022 Sahisnu Mazumder, Bing Liu, Shuai Wang, Yingxuan Zhu, Xiaotian Yin, Lifeng Liu, Jian Li

This paper proposes a new method to drastically speed up deep reinforcement learning (deep RL) training for problems that have the property of state-action permissibility (SAP).

reinforcement-learning Reinforcement Learning (RL)

Evaluation Framework For Large-scale Federated Learning

1 code implementation3 Mar 2020 Lifeng Liu, Fengda Zhang, Jun Xiao, Chao Wu

Federated learning is proposed as a machine learning setting to enable distributed edge devices, such as mobile phones, to collaboratively learn a shared prediction model while keeping all the training data on device, which can not only take full advantage of data distributed across millions of nodes to train a good model but also protect data privacy.

Federated Learning

Guided Exploration in Deep Reinforcement Learning

no code implementations27 Sep 2018 Sahisnu Mazumder, Bing Liu, Shuai Wang, Yingxuan Zhu, Xiaotian Yin, Lifeng Liu, Jian Li, Yongbing Huang

This paper proposes a new method to drastically speed up deep reinforcement learning (deep RL) training for problems that have the property of \textit{state-action permissibility} (SAP).

reinforcement-learning Reinforcement Learning (RL)

A novel DDPG method with prioritized experience replay

1 code implementation IEEE International Conference on Systems, Man and Cybernetics (SMC) 2017 Yuenan Hou, Lifeng Liu, Qing Wei, Xudong Xu, Chunlin Chen

Recently, a state-of-the-art algorithm, called deep deterministic policy gradient (DDPG), has achieved good performance in many continuous control tasks in the MuJoCo simulator.

Continuous Control OpenAI Gym

Cannot find the paper you are looking for? You can Submit a new open access paper.