no code implementations • 10 Oct 2024 • Nan Fang, Guiliang Liu, Wei Gong
Reinforcement Learning (RL) applied in healthcare can lead to unsafe medical decisions and treatment, such as excessive dosages or abrupt changes, often due to agents overlooking common-sense constraints.
no code implementations • 24 Sep 2024 • Bo Yue, Jian Li, Guiliang Liu
To obtain the optimal constraints in complex environments, Inverse Constrained Reinforcement Learning (ICRL) seeks to recover these constraints from expert demonstrations in a data-driven manner.
no code implementations • 11 Sep 2024 • Guiliang Liu, Sheng Xu, Shicheng Liu, Ashish Gaurav, Sriram Ganapathi Subramanian, Pascal Poupart
This survey encompasses discrete, virtual, and realistic environments for evaluating ICRL agents.
1 code implementation • 28 Aug 2024 • Guanren Qiao, Guorui Quan, Jiawei Yu, Shujun Jia, Guiliang Liu
In evaluating the empirical performance across various real-world datasets, TrafficGamer ensures both fidelity and exploitability of the simulated scenarios, guaranteeing that they not only statically align with real-world traffic distribution but also efficiently capture equilibriums for representing safety-critical scenarios involving multiple agents.
1 code implementation • 24 Jun 2024 • Sriram Ganapathi Subramanian, Guiliang Liu, Mohammed Elmahgiubi, Kasra Rezaee, Pascal Poupart
This work provides a principled ICRL method that can take a confidence level with a set of expert demonstrations and outputs a constraint that is at least as constraining as the true underlying constraint with the desired level of confidence.
no code implementations • 27 Mar 2023 • Rongyu Zhang, Xiaowei Chi, Guiliang Liu, Wenyi Zhang, Yuan Du, Fangxin Wang
Multimodal learning has seen great success mining data features from multiple modalities with remarkable model performance improvement.
2 code implementations • 20 Jun 2022 • Guiliang Liu, Yudong Luo, Ashish Gaurav, Kasra Rezaee, Pascal Poupart
When deploying Reinforcement Learning (RL) agents into a physical system, we must ensure that these agents are well aware of the underlying constraints.
no code implementations • 2 Jun 2022 • Ashish Gaurav, Kasra Rezaee, Guiliang Liu, Pascal Poupart
We consider the setting where the reward function is given, and the constraints are unknown, and propose a method that is able to recover these constraints satisfactorily from the expert data.
no code implementations • NeurIPS 2021 • Guiliang Liu, Xiangyu Sun, Oliver Schulte, Pascal Poupart
We propose a Represent And Mimic (RAMi) framework for training 1) an identifiable latent representation to capture the independent factors of variation for the objects and 2) a mimic tree that extracts the causal impact of the latent features on DRL action values.
no code implementations • ICLR 2022 • Yudong Luo, Guiliang Liu, Haonan Duan, Oliver Schulte, Pascal Poupart
Distributional Reinforcement Learning (RL) differs from traditional RL by estimating the distribution over returns to capture the intrinsic uncertainty of MDPs.
Distributional Reinforcement Learning reinforcement-learning +2
no code implementations • ICLR 2022 • Guiliang Liu, Ashutosh Adhikari, Amir-Massoud Farahmand, Pascal Poupart
The advancement of dynamics models enables model-based planning in complex environments.
1 code implementation • 9 Sep 2021 • Xiangyu Sun, Oliver Schulte, Guiliang Liu, Pascal Poupart
We describe NTS-NOTEARS, a score-based structure learning method for time-series data to learn dynamic Bayesian networks (DBNs) that captures nonlinear, lagged (inter-slice) and instantaneous (intra-slice) relations among variables.
no code implementations • NeurIPS 2020 • Guiliang Liu, Oliver Schulte, Pascal Poupart, Mike Rudd, Mehrsan Javan
This paper develops a new approach for agent representations, based on a Markov game model, that is tailored towards applications in professional ice hockey.
1 code implementation • 4 Jun 2020 • Xiangyu Sun, Jack Davis, Oliver Schulte, Guiliang Liu
This paper addresses the trade-off between Accuracy and Transparency for deep learning applied to sports analytics.
no code implementations • 16 Jul 2018 • Guiliang Liu, Oliver Schulte, Wang Zhu, Qingcan Li
An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment.
no code implementations • 26 May 2018 • Guiliang Liu, Oliver Schulte
To assess a player's overall performance, we introduce a novel Game Impact Metric (GIM) that aggregates the values of the player's actions.