no code implementations • 22 Apr 2024 • Zichuan Liu, Zefan Wang, Linjie Xu, Jinyu Wang, Lei Song, Tianchun Wang, Chunlin Chen, Wei Cheng, Jiang Bian
The advent of large language models (LLMs) has revolutionized the field of natural language processing, yet they might be attacked to produce harmful content.
no code implementations • 15 Apr 2024 • Linjie Xu, Zichuan Liu, Alexander Dockhorn, Diego Perez-Liebana, Jinyu Wang, Lei Song, Jiang Bian
One of the notorious issues for Reinforcement Learning (RL) is poor sample efficiency.
1 code implementation • 6 Jun 2023 • Linjie Xu, Zhengyao Jiang, Jinyu Wang, Lei Song, Jiang Bian
Offline reinforcement learning (RL) methodologies enforce constraints on the policy to adhere closely to the behavior policy, thereby stabilizing value learning and mitigating the selection of out-of-distribution (OOD) actions during test time.
no code implementations • 22 May 2023 • Jinghan Yang, Linjie Xu, Lequan Yu
When facing an unsatisfactory prediction from a machine learning model, users can be interested in investigating the underlying reasons and exploring the potential for reversing the outcome.
1 code implementation • 30 May 2022 • Linjie Xu, Jorge Hurtado-Grueso, Dominic Jeurissen, Diego Perez Liebana, Alexander Dockhorn
In this paper, we propose Elastic MCTS, an algorithm that uses state abstraction to play strategy games.
1 code implementation • 21 Apr 2021 • Alexander Dockhorn, Jorge Hurtado-Grueso, Dominik Jeurissen, Linjie Xu, Diego Perez-Liebana
Portfolio methods represent a simple but efficient type of action abstraction which has shown to improve the performance of search-based agents in a range of strategy games.
no code implementations • 17 Apr 2021 • Diego Perez-Liebana, Cristina Guerrero-Romero, Alexander Dockhorn, Linjie Xu, Jorge Hurtado, Dominik Jeurissen
Designing agents that are able to achieve different play-styles while maintaining a competitive level of play is a difficult task, especially for games for which the research community has not found super-human performance yet, like strategy games.
1 code implementation • 12 Feb 2020 • Pengxin Guo, Chang Deng, Linjie Xu, Xiaonan Huang, Yu Zhang
The proposed feature augmentation strategy can be used in many deep multi-task learning models.