no code implementations • 12 Sep 2024 • Rui Duan, Xin Xiong, Jueyi Liu, Katherine P. Liao, Tianxi Cai
To overcome these limitations, we introduce the Federated One-shot Ensemble Clustering (FONT) algorithm, a novel solution tailored for multi-site analyses under such constraints.
no code implementations • 22 Mar 2024 • Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou
To accommodate the statistical analysis of multimodal EHR data, in this paper, we propose a novel multimodal feature embedding generative model and design a multimodal contrastive loss to obtain the multimodal EHR feature representation.
no code implementations • 25 Dec 2023 • Zhiwei Xu, Ziming Gan, Doudou Zhou, Shuting Shen, Junwei Lu, Tianxi Cai
The effective analysis of high-dimensional Electronic Health Record (EHR) data, with substantial potential for healthcare research, presents notable methodological challenges.
no code implementations • 12 Sep 2023 • Xin Xiong, Zijian Guo, Tianxi Cai
Many existing transfer learning methods rely on leveraging information from source data that closely resembles the target data.
no code implementations • 31 May 2023 • Junwei Lu, Jin Yin, Tianxi Cai
To overcome these challenges, we propose to infer the conditional dependency structure among EHR features via a latent graphical block model (LGBM).
1 code implementation • 19 May 2023 • Jun Wen, Jue Hou, Clara-Lea Bonzel, Yihan Zhao, Victor M. Castro, Vivian S. Gainer, Dana Weisenfeld, Tianrun Cai, Yuk-Lam Ho, Vidul A. Panickan, Lauren Costa, Chuan Hong, J. Michael Gaziano, Katherine P. Liao, Junwei Lu, Kelly Cho, Tianxi Cai
We propose a LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.
no code implementations • 28 Sep 2022 • Tianxi Cai, Dong Xia, Luwan Zhang, Doudou Zhou
Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects.
no code implementations • 11 Jun 2022 • Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai
Extensive simulations demonstrate the effectiveness of the proposed algorithm.
1 code implementation • 16 Mar 2022 • Yucong Lin, Keming Lu, Sheng Yu, Tianxi Cai, Marinka Zitnik
On a dataset annotated by human experts, REMAP improves text-based disease relation extraction by 10. 0% (accuracy) and 17. 2% (F1-score) by fusing disease knowledge graphs with text information.
no code implementations • 27 Aug 2021 • Sai Li, Tianxi Cai, Rui Duan
With only a small number of communications across participating sites, the proposed method can achieve performance comparable to the pooled analysis where individual-level data are directly pooled together.
no code implementations • 21 May 2021 • Doudou Zhou, Tianxi Cai, Junwei Lu
Besides, we prove the statistical rate for the eigenspace of the underlying matrix, which is comparable to the rate under the independently missing assumption.
no code implementations • 4 May 2021 • Jue Hou, Zijian Guo, Tianxi Cai
Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors.
no code implementations • 9 Dec 2020 • Aaron Sonabend-W, Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee
2) The surrogate variables we leverage in the modified SSL framework are predictive of the outcome but not informative to the optimal policy or value function.
1 code implementation • 19 Oct 2020 • Jessica Gronsbell, Molei Liu, Lu Tian, Tianxi Cai
In step II, we augment the initial imputations to ensure the consistency of the resulting estimators regardless of the specification of the prediction model or the imputation model.
2 code implementations • NeurIPS 2020 • Aaron Sonabend-W, Junwei Lu, Leo A. Celi, Tianxi Cai, Peter Szolovits
However, the adoption of such policies in practice is often challenging, as they are hard to interpret within the application context, and lack measures of uncertainty for the learned policy value and its decisions.
1 code implementation • 6 Apr 2018 • Luwan Zhang, Katherine Liao, Issac Kohane, Tianxi Cai
To bridge this gap, in this paper we propose a novel spectral clustering method that optimally combines multiple data sources while leveraging the prior distance knowledge.
4 code implementations • 4 Apr 2018 • Andrew L. Beam, Benjamin Kompa, Allen Schmaltz, Inbar Fried, Griffin Weber, Nathan P. Palmer, Xu Shi, Tianxi Cai, Isaac S. Kohane
Word embeddings are a popular approach to unsupervised learning of word relationships that are widely used in natural language processing.
2 code implementations • 2 Apr 2018 • Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, Isaac Kohane, Tianxi Cai
We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$.
Computation Applications
no code implementations • 18 Jan 2017 • Abhishek Chakrabortty, Matey Neykov, Raymond Carroll, Tianxi Cai
We consider the recovery of regression coefficients, denoted by $\boldsymbol{\beta}_0$, for a single index model (SIM) relating a binary outcome $Y$ to a set of possibly high dimensional covariates $\boldsymbol{X}$, based on a large but 'unlabeled' dataset $\mathcal{U}$, with $Y$ never observed.
no code implementations • 17 Jan 2017 • Abhishek Chakrabortty, Tianxi Cai
It is often of interest to investigate if and when the unlabeled data can be exploited to improve estimation of the regression parameter in the adopted linear model.
no code implementations • 25 Nov 2015 • Matey Neykov, Jun S. Liu, Tianxi Cai
In the present paper we analyze algorithms based on covariance screening and least squares with $L_1$ penalization (i. e. LASSO) and demonstrate that they can also enjoy optimal (up to a scalar) rescaled sample size in terms of support recovery, albeit under slightly different assumptions on $f$ and $\varepsilon$ compared to the SIR based algorithms.
no code implementations • 8 Apr 2015 • Tianxi Cai, T. Tony Cai, Anru Zhang
Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics and electrical engineering.
no code implementations • 23 Nov 2013 • Sheng Yu, Tianrun Cai, Tianxi Cai
This paper introduces the design and performance of Narrative Information Linear Extraction (NILE), a natural language processing (NLP) package for EHR analysis that we share with the medical informatics community.