Search Results for author: Zhihong Deng

Found 7 papers, 3 papers with code

What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning

no code implementations16 Apr 2024 Zhihong Deng, Jing Jiang, Guodong Long, Chengqi Zhang

In sequential decision-making problems involving sensitive attributes like race and gender, reinforcement learning (RL) agents must carefully consider long-term fairness while maximizing returns.

Attribute counterfactual +4

Prefix-Tuning Based Unsupervised Text Style Transfer

no code implementations23 Oct 2023 Huiyu Mai, Wenhao Jiang, Zhihong Deng

Unsupervised text style transfer aims at training a generative model that can alter the style of the input sentence while preserving its content without using any parallel data.

Sentence Style Transfer +2

Causal Reinforcement Learning: A Survey

no code implementations4 Jul 2023 Zhihong Deng, Jing Jiang, Guodong Long, Chengqi Zhang

Causality, however, offers a notable advantage as it can formalize knowledge in a systematic manner and leverage invariance for effective knowledge transfer.

reinforcement-learning Transfer Learning

Retrieved Sequence Augmentation for Protein Representation Learning

1 code implementation24 Feb 2023 Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Lingpeng Kong

RSA links query protein sequences to a set of sequences with similar structures or properties in the database and combines these sequences for downstream prediction.

Property Prediction Representation Learning +1

Undersampling and Cumulative Class Re-decision Methods to Improve Detection of Agitation in People with Dementia

no code implementations7 Feb 2023 Zhidong Meng, Andrea Iaboni, Bing Ye, Kristine Newman, Alex Mihailidis, Zhihong Deng, Shehroz S. Khan

Agitation is one of the most prevalent symptoms in people with dementia (PwD) that can place themselves and the caregiver's safety at risk.

Decision Making

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

1 code implementation ICLR 2022 Chenjia Bai, Lingxiao Wang, Zhuoran Yang, Zhihong Deng, Animesh Garg, Peng Liu, Zhaoran Wang

We show that such OOD sampling and pessimistic bootstrapping yields provable uncertainty quantifier in linear MDPs, thus providing the theoretical underpinning for PBRL.

D4RL Offline RL +3

False Correlation Reduction for Offline Reinforcement Learning

1 code implementation24 Oct 2021 Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, Jing Jiang

Offline reinforcement learning (RL) harnesses the power of massive datasets for resolving sequential decision problems.

D4RL Decision Making +3

Cannot find the paper you are looking for? You can Submit a new open access paper.