no code implementations • ICML 2020 • Hengrui Cai, Wenbin Lu, Rui Song
Estimation of optimal decision rules (ODR) has been extensively investigated recently, however, at present, no testing procedure is proposed to verify whether these ODRs are significantly better than the naive decision rule that always assigning individuals to a fixed treatment option.
1 code implementation • 30 Dec 2023 • Hengrui Cai, ShengJie Liu, Rui Song
This paper explores the causal reasoning of large language models (LLMs) to enhance their interpretability and reliability in advancing artificial intelligence.
1 code implementation • 25 Jun 2023 • Wenbo Zhang, Tong Wu, Yunlong Wang, Yong Cai, Hengrui Cai
With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction.
no code implementations • 24 Mar 2023 • Tao Ma, Hengrui Cai, Zhengling Qi, Chengchun Shi, Eric B. Laber
In real-world applications of reinforcement learning, it is often challenging to obtain a state representation that is parsimonious and satisfies the Markov property without prior knowledge.
1 code implementation • 29 Jan 2023 • Richard A Watson, Hengrui Cai, Xinming An, Samuel McLean, Rui Song
To characterize this heterogeneity, we first conceptualize heterogeneous causal graphs (HCGs) by generalizing the causal graphical model with confounder-based interactions and multiple mediators.
no code implementations • 30 Dec 2022 • Ye Shen, Runzhe Wan, Hengrui Cai, Rui Song
In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications.
no code implementations • 17 Nov 2021 • Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu
To derive an optimal I2DR, our jump interval-learning method estimates the conditional mean of the outcome given the treatment and the covariates via jump penalized regression, and derives the corresponding optimal I2DR based on the estimated outcome regression function.
no code implementations • 29 Oct 2021 • Ye Shen, Hengrui Cai, Rui Song
We use this probability to conduct valid inference on the online conditional mean estimator under each action and develop the doubly robust interval estimation (DREAM) method to infer the value under the estimated optimal policy in online learning.
1 code implementation • 11 Oct 2021 • Hengrui Cai, Wenbin Lu, Rachel Marceau West, Devan V. Mehrotra, Lingkang Huang
In this paper, we present an optimal subgroup selection rule (SSR) that maximizes the number of selected patients, and in the meantime, achieves the pre-specified clinically meaningful mean outcome, such as the average treatment effect.
no code implementations • 30 May 2021 • Hengrui Cai, Zhihao Cen, Ling Leng, Rui Song
We consider the sequential decision optimization on the periodic environment, that occurs in a wide variety of real-world applications when the data involves seasonality, such as the daily demand of drivers in ride-sharing and dynamic traffic patterns in transportation.
no code implementations • 21 Apr 2021 • Hengrui Cai, Rui Song, Wenbin Lu
We propose an auGmented inverse propensity weighted Experimental and Auxiliary sample-based decision Rule (GEAR) by maximizing the augmented inverse propensity weighted value estimator over a class of decision rules using the experimental sample, with the primary outcome being imputed based on the auxiliary sample.
1 code implementation • 21 Apr 2021 • Hengrui Cai, Wenbin Lu, Rui Song
We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available.
no code implementations • ICLR 2021 • Hengrui Cai, Rui Song, Wenbin Lu
Under a general causal graph, the exposure may have a direct effect on the outcome and also an indirect effect regulated by a set of mediators.
1 code implementation • NeurIPS 2021 • Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu
To handle continuous treatments, we develop a novel estimation method for OPE using deep jump learning.
no code implementations • 28 Sep 2020 • Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu
To handle continuous action space, we develop a brand-new deep jump Q-evaluation method for OPE.