no code implementations • 9 Aug 2023 • Yangming Li, Zhaozhi Qian, Mihaela van der Schaar
While diffusion models have achieved promising performances in data synthesis, they might suffer error propagation because of their cascade structure, where the distributional mismatch spreads and magnifies through the chain of denoising modules.
no code implementations • 8 Jun 2023 • Aleksa Bisercic, Mladen Nikolic, Mihaela van der Schaar, Boris Delibasic, Pietro Lio, Andrija Petrovic
Drawing upon the reasoning capabilities of LLMs, TEMED-LLM goes beyond traditional extraction techniques, accurately inferring tabular features, even when their names are not explicitly mentioned in the text.
no code implementations • 7 Jun 2023 • Elisabeth R. M. Heremans, Nabeel Seedat, Bertien Buyse, Dries Testelmans, Mihaela van der Schaar, Maarten De Vos
As machine learning becomes increasingly prevalent in critical fields such as healthcare, ensuring the safety and reliability of machine learning systems becomes paramount.
1 code implementation • 7 Jun 2023 • Toon Vanderschueren, Alicia Curth, Wouter Verbeke, Mihaela van der Schaar
Machine learning (ML) holds great potential for accurately forecasting treatment outcomes over time, which could ultimately enable the adoption of more individualized treatment strategies in many practical applications.
2 code implementations • 31 May 2023 • Tennison Liu, Jeroen Berrevoets, Zhaozhi Qian, Mihaela van der Schaar
This paper addresses unsupervised representation learning on tabular data containing multiple views generated by distinct sources of measurement.
1 code implementation • 16 May 2023 • Boris van Breugel, Zhaozhi Qian, Mihaela van der Schaar
Generating synthetic data through generative models is gaining interest in the ML community and beyond, promising a future where datasets can be tailored to individual needs.
1 code implementation • 13 Apr 2023 • Jonathan Crabbé, Mihaela van der Schaar
Through this rigorous formalism, we derive (1) two metrics to measure the robustness of any interpretability method with respect to the model symmetry group; (2) theoretical robustness guarantees for some popular interpretability methods and (3) a systematic approach to increase the invariance of any interpretability method with respect to a symmetry group.
no code implementations • 7 Apr 2023 • Boris van Breugel, Mihaela van der Schaar
Generating synthetic data through generative models is gaining interest in the ML community and beyond.
no code implementations • 7 Apr 2023 • Eleonora Giunchiglia, Fergus Imrie, Mihaela van der Schaar, Thomas Lukasiewicz
In the recent years, machine learning has made great advancements that have been at the root of many breakthroughs in different application domains.
2 code implementations • 9 Mar 2023 • Alan Jeffares, Tennison Liu, Jonathan Crabbé, Fergus Imrie, Mihaela van der Schaar
In this work, we introduce Tabular Neural Gradient Orthogonalization and Specialization (TANGOS), a novel framework for regularization in the tabular setting built on latent unit attributions.
no code implementations • 3 Mar 2023 • Jeroen Berrevoets, Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar
To address this challenge and make progress in solving real-world problems, we propose a new way of thinking about causality - we call this causal deep learning.
no code implementations • 2 Mar 2023 • Mahed Abroshan, Michael Burkhart, Oscar Giles, Sam Greenbury, Zoe Kourtzi, Jack Roberts, Mihaela van der Schaar, Jannetta S Steyn, Alan Wilson, May Yong
Machine learning techniques are effective for building predictive models because they identify patterns in large datasets.
2 code implementations • 24 Feb 2023 • Yuchao Qin, Mihaela van der Schaar, Changhee Lee
Clustering time-series data in healthcare is crucial for clinical phenotyping to understand patients' disease progression patterns and to design treatment guidelines tailored to homogeneous patient subgroups.
2 code implementations • 24 Feb 2023 • Samuel Holt, Alihan Hüyük, Zhaozhi Qian, Hao Sun, Mihaela van der Schaar
Many real-world offline reinforcement learning (RL) problems involve continuous-time environments with delays.
1 code implementation • 24 Feb 2023 • Boris van Breugel, Hao Sun, Zhaozhi Qian, Mihaela van der Schaar
In this work we argue for a realistic MIA setting that assumes the attacker has some knowledge of the underlying data distribution.
1 code implementation • 24 Feb 2023 • Alexander Norcliffe, Bogdan Cebere, Fergus Imrie, Pietro Lio, Mihaela van der Schaar
SurvivalGAN outperforms multiple baselines at generating survival data, and in particular addresses the failure modes as measured by the new metrics, in addition to improving downstream performance of survival models trained on the synthetic data.
2 code implementations • 23 Feb 2023 • Alicia Curth, Mihaela van der Schaar
We study the problem of inferring heterogeneous treatment effects (HTEs) from time-to-event data in the presence of competing events.
2 code implementations • 23 Feb 2023 • Nabeel Seedat, Alan Jeffares, Fergus Imrie, Mihaela van der Schaar
However, the use of self-supervision beyond model pretraining and representation learning has been largely unexplored.
1 code implementation • 6 Feb 2023 • Alicia Curth, Mihaela van der Schaar
Personalized treatment effect estimates are often of interest in high-stakes applications -- thus, before deploying a model estimating such effects in practice, one needs to be sure that the best candidate from the ever-growing machine learning toolbox for this task was chosen.
1 code implementation • 28 Jan 2023 • Evgeny S. Saveliev, Mihaela van der Schaar
TemporAI is an open source Python software library for machine learning (ML) tasks involving data with a time component, focused on medicine and healthcare use cases.
no code implementations • 26 Jan 2023 • Alan Jeffares, Tennison Liu, Jonathan Crabbé, Mihaela van der Schaar
In the case of deep ensembles of neural networks, we are provided with the opportunity to directly optimize the true objective: the joint performance of the ensemble as a whole.
1 code implementation • 18 Jan 2023 • Zhaozhi Qian, Bogdan-Constantin Cebere, Mihaela van der Schaar
Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more.
no code implementations • 1 Dec 2022 • Jeroen Berrevoets, Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar
With CDL, researchers aim to structure and encode causal knowledge in the extremely flexible representation space of deep learning models.
no code implementations • 11 Nov 2022 • Tennison Liu, Alex J. Chan, Boris van Breugel, Mihaela van der Schaar
It is important to guarantee that machine learning algorithms deployed in the real world do not result in unfairness or unintended social consequences.
no code implementations • 9 Nov 2022 • Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar
However, this remains a nascent area with no standardized framework to guide practitioners to the necessary data-centric considerations or to communicate the design of data-centric driven ML systems.
2 code implementations • 1 Nov 2022 • Fergus Imrie, Alexander Norcliffe, Pietro Lio, Mihaela van der Schaar
To do so, we define predictive groups in terms of linear and non-linear interactions between features.
2 code implementations • 24 Oct 2022 • Nabeel Seedat, Jonathan Crabbé, Ioana Bica, Mihaela van der Schaar
High model performance, on average, can hide that models may systematically underperform on subgroups of the data.
1 code implementation • 21 Oct 2022 • Fergus Imrie, Bogdan Cebere, Eoin F. McKinney, Mihaela van der Schaar
However, the use of machine learning introduces a number of technical and practical challenges that have thus far restricted widespread adoption of such techniques in clinical settings.
2 code implementations • 11 Oct 2022 • Alex J. Chan, Mihaela van der Schaar
Consider making a prediction over new test data without any opportunity to learn from a training set of labelled data - instead given access to a set of expert models and their predictions alongside some limited information about the dataset used to train them.
2 code implementations • 8 Oct 2022 • Ioana Bica, Mihaela van der Schaar
Consider the problem of improving the estimation of conditional average treatment effects (CATE) for a target domain of interest by leveraging related information from a source domain with a different feature space.
2 code implementations • 22 Sep 2022 • Jonathan Crabbé, Mihaela van der Schaar
We further demonstrate empirically that CARs offer (1) more accurate descriptions of how concepts are scattered in the DNN's latent space; (2) global explanations that are closer to human concept annotations and (3) concept-based feature importance that meaningfully relate concepts with each other.
no code implementations • 11 Aug 2022 • Alicia Curth, Alihan Hüyük, Mihaela van der Schaar
We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial.
no code implementations • 2 Aug 2022 • Yanke Li, Hatt Tobias, Ioana Bica, Mihaela van der Schaar
The encoder is designed to serve as an inference device on $E$ while the decoder reconstructs each observed variable conditioned on its graphical parents in the DAG and the inferred $E$.
no code implementations • 11 Jul 2022 • Hao Sun, Boris van Breugel, Jonathan Crabbe, Nabeel Seedat, Mihaela van der Schaar
Uncertainty quantification (UQ) is essential for creating trustworthy machine learning models.
no code implementations • 21 Jun 2022 • Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar
To this end, we propose D-CIPHER, which is robust to measurement artifacts and can uncover a new and very general class of differential equations.
2 code implementations • 16 Jun 2022 • Nabeel Seedat, Fergus Imrie, Alexis Bellot, Zhaozhi Qian, Mihaela van der Schaar
To assess solutions to this problem, we propose a controllable simulation environment based on a model of tumor growth for a range of scenarios with irregular sampling reflective of a variety of clinical scenarios.
no code implementations • 16 Jun 2022 • Jonathan Crabbé, Alicia Curth, Ioana Bica, Mihaela van der Schaar
This allows us to evaluate treatment effect estimators along a new and important dimension that has been overlooked in previous work: We construct a benchmarking environment to empirically investigate the ability of personalized treatment effect models to identify predictive covariates -- covariates that determine differential responses to treatment.
1 code implementation • 15 Jun 2022 • Daniel Jarrett, Bogdan Cebere, Tennison Liu, Alicia Curth, Mihaela van der Schaar
Consider the problem of imputing missing values in a dataset.
1 code implementation • 13 Jun 2022 • Jeroen Berrevoets, Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar
Directed acyclic graphs (DAGs) encode a lot of information about a particular distribution in their structure.
2 code implementations • 10 Jun 2022 • Samuel Holt, Zhaozhi Qian, Mihaela van der Schaar
Neural Ordinary Differential Equations model dynamical systems with ODEs learned by neural networks.
no code implementations • ICLR 2022 • Alizée Pace, Alex J. Chan, Mihaela van der Schaar
Building models of human decision-making from observed behaviour is critical to better understand, diagnose and support real-world policies such as clinical care.
2 code implementations • ICLR 2022 • Alex J. Chan, Alicia Curth, Mihaela van der Schaar
Human decision making is well known to be imperfect and the ability to analyse such processes individually is crucial when attempting to aid or improve a decision-maker's ability to perform a task, e. g. to alert them to potential biases or oversights on their part.
3 code implementations • 3 Mar 2022 • Jonathan Crabbé, Mihaela van der Schaar
Unsupervised black-box models are challenging to interpret.
1 code implementation • 25 Feb 2022 • Tobias Hatt, Jeroen Berrevoets, Alicia Curth, Stefan Feuerriegel, Mihaela van der Schaar
While observational data is confounded, randomized data is unconfounded, but its sample size is usually too small to learn heterogeneous treatment effects.
no code implementations • 21 Feb 2022 • Alihan Hüyük, William R. Zame, Mihaela van der Schaar
Modeling the preferences of agents over a set of alternatives is a principal concern in many areas.
1 code implementation • 17 Feb 2022 • Nabeel Seedat, Jonathan Crabbé, Mihaela van der Schaar
These estimators can be used to evaluate the congruence of test instances with respect to the training set, to answer two practically useful questions: (1) which test instances will be reliably predicted by a model trained with the training instances?
1 code implementation • 4 Feb 2022 • Jeroen Berrevoets, Fergus Imrie, Trent Kyono, James Jordon, Mihaela van der Schaar
However, no imputation at all also leads to biased estimates, as missingness determined by treatment introduces bias in covariates.
no code implementations • 7 Dec 2021 • Jeroen Berrevoets, Alicia Curth, Ioana Bica, Eoin McKinney, Mihaela van der Schaar
Choosing the best treatment-plan for each individual patient requires accurate forecasts of their outcome trajectories as a function of the treatment, over time.
1 code implementation • NeurIPS 2021 • Zhaozhi Qian, Alicia Curth, Mihaela van der Schaar
Most existing methods for conditional average treatment effect estimation are designed to estimate the effect of a single cause - only one variable can be intervened on at one time.
no code implementations • NeurIPS 2021 • Daniel Jarrett, Ioana Bica, Mihaela van der Schaar
In this work, we study a generative framework that seeks to combine the strengths of both: Motivated by a moment-matching objective to mitigate compounding error, we optimize a local (but forward-looking) transition policy, where the reinforcement signal is provided by a global (but stepwise-decomposable) energy model trained by contrastive estimation.
no code implementations • NeurIPS 2021 • Yuchao Qin, Fergus Imrie, Alihan Hüyük, Daniel Jarrett, alexander gimson, Mihaela van der Schaar
Significant effort has been placed on developing decision support tools to improve patient care.
1 code implementation • NeurIPS 2021 • Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Wood, Mihaela van der Schaar
Most of the medical observational studies estimate the causal treatment effects using electronic health records (EHR), where a patient's covariates and outcomes are both observed longitudinally.
no code implementations • NeurIPS 2021 • Ioana Bica, Daniel Jarrett, Mihaela van der Schaar
By leveraging data from multiple environments, we propose Invariant Causal Imitation Learning (ICIL), a novel technique in which we learn a feature representation that is invariant across domains, on the basis of which we learn an imitation policy that matches expert behavior.
1 code implementation • NeurIPS 2021 • Kamile Stankeviciute, Ahmed M. Alaa, Mihaela van der Schaar
Current approaches for multi-horizon time series forecasting using recurrent neural networks (RNNs) focus on issuing point estimates, which is insufficient for decision-making in critical application domains where an uncertainty estimate is also required.
1 code implementation • NeurIPS 2021 • Trent Kyono, Yao Zhang, Alexis Bellot, Mihaela van der Schaar
Missing data is an important problem in machine learning practice.
1 code implementation • NeurIPS 2021 • Jonathan Crabbé, Zhaozhi Qian, Fergus Imrie, Mihaela van der Schaar
SimplEx uses the corpus to improve the user's understanding of the latent space with post-hoc explanations answering two questions: (1) Which corpus examples explain the prediction issued for a given test example?
1 code implementation • NeurIPS 2021 • Alicia Curth, Changhee Lee, Mihaela van der Schaar
We study the problem of inferring heterogeneous treatment effects from time-to-event data.
1 code implementation • NeurIPS 2021 • Boris van Breugel, Trent Kyono, Jeroen Berrevoets, Mihaela van der Schaar
In this paper, we introduce DECAF: a GAN-based fair synthetic data generator for tabular data.
no code implementations • ICLR 2022 • Changhee Lee, Fergus Imrie, Mihaela van der Schaar
Discovering relevant input features for predicting a target variable is a key scientific question.
no code implementations • ICLR 2022 • Zhaozhi Qian, Krzysztof Kacprzyk, Mihaela van der Schaar
In the experiments, D-CODE successfully discovered the governing equations of a diverse range of dynamical systems under challenging measurement settings with high noise and infrequent sampling.
no code implementations • 8 Sep 2021 • Mahed Abroshan, Kai Hou Yip, Cem Tekin, Mihaela van der Schaar
Secondly, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features.
1 code implementation • 6 Aug 2021 • Yao Zhang, Jeroen Berrevoets, Mihaela van der Schaar
Conditional average treatment effects (CATEs) allow us to understand the effect heterogeneity across a large population of individuals.
no code implementations • 28 Jul 2021 • Alicia Curth, Mihaela van der Schaar
The machine learning toolbox for estimation of heterogeneous treatment effects from observational data is expanding rapidly, yet many of its algorithms have been evaluated only on a very limited set of semi-synthetic benchmark datasets.
2 code implementations • 13 Jul 2021 • Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar
Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare.
1 code implementation • 9 Jun 2021 • Jonathan Crabbé, Mihaela van der Schaar
How can we explain the predictions of a machine learning model?
1 code implementation • 8 Jun 2021 • Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, Mihaela van der Schaar
Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes.
1 code implementation • NeurIPS 2021 • Alicia Curth, Mihaela van der Schaar
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments to obtain better estimates of conditional average treatment effects in finite samples.
no code implementations • NeurIPS 2021 • Zhaozhi Qian, William R. Zame, Lucas M. Fleuren, Paul Elbers, Mihaela van der Schaar
To close this gap, we propose the latent hybridisation model (LHM) that integrates a system of expert-designed ODEs with machine-learned Neural ODEs to fully describe the dynamics of the system and to link the expert and latent variables to observable quantities.
2 code implementations • ICLR 2022 • Alexis Bellot, Kim Branson, Mihaela van der Schaar
In this paper, we consider score-based structure learning for the study of dynamical systems.
no code implementations • 28 Mar 2021 • Alexis Bellot, Mihaela van der Schaar
Unobserved confounding is one of the greatest challenges for causal discovery.
no code implementations • 23 Feb 2021 • Victor D. Bourgin, Ioana Bica, Mihaela van der Schaar
Medical time-series datasets have unique characteristics that make prediction tasks challenging.
2 code implementations • 17 Feb 2021 • Ahmed M. Alaa, Boris van Breugel, Evgeny Saveliev, Mihaela van der Schaar
In this paper, we introduce a 3-dimensional evaluation metric, ($\alpha$-Precision, $\beta$-Recall, Authenticity), that characterizes the fidelity, diversity and generalization performance of any generative model in a domain-agnostic fashion.
2 code implementations • 12 Feb 2021 • Alex J. Chan, Mihaela van der Schaar
Bayesian inference over the reward presents an ideal solution to the ill-posed nature of the inverse reinforcement learning problem.
no code implementations • 11 Feb 2021 • Trent Kyono, Ioana Bica, Zhaozhi Qian, Mihaela van der Schaar
We leverage the invariance of causal structures across domains to propose a novel model selection metric specifically designed for ITE methods under the UDA setting.
1 code implementation • 5 Feb 2021 • Changhee Lee, Mihaela van der Schaar
Due to non-uniformity and technical limitations in omics platforms, such integrative analyses on multiple omics, which we refer to as views, involve learning from incomplete observations with various view-missing patterns.
1 code implementation • 2 Feb 2021 • Alexis Bellot, Mihaela van der Schaar
Counterfactual estimation using synthetic controls is one of the most successful recent methodological developments in causal inference.
1 code implementation • 28 Jan 2021 • Can Xu, Ahmed M. Alaa, Ioana Bica, Brent D. Ershoff, Maxime Cannesson, Mihaela van der Schaar
Organ transplantation is often the last resort for treating end-stage illness, but the probability of a successful transplantation depends greatly on compatibility between donors and recipients.
2 code implementations • 26 Jan 2021 • Alicia Curth, Mihaela van der Schaar
The need to evaluate treatment effectiveness is ubiquitous in most of empirical science, and interest in flexibly investigating effect heterogeneity is growing rapidly.
no code implementations • 26 Jan 2021 • Hyun-Suk Lee, Cong Shen, William Zame, Jang-Won Lee, Mihaela van der Schaar
Phase I clinical trials are designed to test the safety (non-toxicity) of drugs and find the maximum tolerated dose (MTD).
no code implementations • 19 Jan 2021 • Setareh Maghsudi, Andrew Lan, Jie Xu, Mihaela van der Schaar
The objective of personalized learning is to design an effective knowledge acquisition track that matches the learner's strengths and bypasses her weaknesses to ultimately meet her desired goal.
1 code implementation • ICLR 2021 • Alihan Hüyük, Daniel Jarrett, Cem Tekin, Mihaela van der Schaar
Understanding human behavior from observed data is critical for transparency and accountability in decision-making.
no code implementations • ICLR 2021 • Daniel Jarrett, Jinsung Yoon, Ioana Bica, Zhaozhi Qian, Ari Ercole, Mihaela van der Schaar
Despite exponential growth in electronic patient data, there is a remarkable gap between the potential and realized utilization of ML for clinical research and decision support.
no code implementations • ICLR 2021 • Alex James Chan, Mihaela van der Schaar
Bayesian inference over the reward presents an ideal solution to the ill-posed nature of the inverse reinforcement learning problem.
no code implementations • 1 Jan 2021 • Zhaozhi Qian, Yao Zhang, Ioana Bica, Angela Wood, Mihaela van der Schaar
Estimating causal treatment effects using observational data is a problem with few solutions when the confounder has a temporal structure, e. g. the history of disease progression might impact both treatment decisions and clinical outcomes.
no code implementations • ICLR 2021 • Ahmed Alaa, Alex James Chan, Mihaela van der Schaar
Generating synthetic time-series data is crucial in various application domains, such as medical prognosis, wherein research is hamstrung by the lack of access to data due to concerns over privacy.
no code implementations • 8 Dec 2020 • James Jordon, Alan Wilson, Mihaela van der Schaar
Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data.
no code implementations • NeurIPS 2020 • Jeroen Berrevoets, James Jordon, Ioana Bica, alexander gimson, Mihaela van der Schaar
Transplant-organs are a scarce medical resource.
1 code implementation • NeurIPS 2020 • Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar
We also introduce a novel tabular data augmentation method for self- and semi-supervised learning frameworks.
1 code implementation • NeurIPS 2020 • Yao Zhang, Mihaela van der Schaar
A dynamic treatment regime (DTR) is a sequence of treatment rules indicating how to individualize treatments for a patient based on the previously assigned treatments and the evolving covariate history.
1 code implementation • NeurIPS 2020 • Jonathan Crabbé, Yao Zhang, William Zame, Mihaela van der Schaar
Machine Learning has proved its ability to produce accurate models but the deployment of these models outside the machine learning community has been hindered by the difficulties of interpreting these models.
1 code implementation • NeurIPS 2020 • Trent Kyono, Yao Zhang, Mihaela van der Schaar
Regularization improves generalization of supervised models to out-of-sample data.
1 code implementation • 14 Aug 2020 • Alicia Curth, Ahmed M. Alaa, Mihaela van der Schaar
Within this framework, we propose two general learning algorithms that build on the idea of nonparametric plug-in bias removal via IFs: the 'IF-learner' which uses pseudo-outcomes motivated by uncentered IFs for regression in large samples and outputs entire target functions without confidence bands, and the 'Group-IF-learner', which outputs only approximations to a function but can give confidence estimates if sufficient information on coarsening mechanisms is available.
no code implementations • 27 Jul 2020 • Zhaozhi Qian, Ahmed M. Alaa, Mihaela van der Schaar
The coronavirus disease 2019 (COVID-19) global pandemic poses the threat of overwhelming healthcare systems with unprecedented demands for intensive care resources.
1 code implementation • 23 Jul 2020 • James Jordon, Daniel Jarrett, Jinsung Yoon, Tavian Barnes, Paul Elbers, Patrick Thoral, Ari Ercole, Cheng Zhang, Danielle Belgrave, Mihaela van der Schaar
The clinical time-series setting poses a unique combination of challenges to data modeling and sharing.
no code implementations • 21 Jul 2020 • Alexis Bellot, Mihaela van der Schaar
Part of the challenge of learning robust models lies in the influence of unobserved confounders that void many of the invariances and principles of minimum error presently used for this problem.
no code implementations • ICLR 2021 • Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar
Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i. e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions.
1 code implementation • ICML 2020 • Ahmed M. Alaa, Mihaela van der Schaar
Deep learning models achieve high predictive accuracy across a broad spectrum of tasks, but rigorously quantifying their predictive uncertainty remains challenging.
no code implementations • ICML 2020 • Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, Mihaela van der Schaar
In this paper, we develop an approximate Bayesian inference scheme based on posterior regularisation, wherein unlabelled target data are used as "pseudo-labels" of model confidence that are used to regularise the model's loss on labelled source data.
no code implementations • ICML 2020 • Daniel Jarrett, Mihaela van der Schaar
Finally, we illustrate how this formulation enables understanding decision-making behavior by quantifying preferences implicit in observed decision strategies (the inverse problem).
1 code implementation • NeurIPS 2020 • Daniel Jarrett, Ioana Bica, Mihaela van der Schaar
Through experiments with application to control and healthcare settings, we illustrate consistent performance gains over existing algorithms for strictly batch imitation learning.
no code implementations • 24 Jun 2020 • Yao Zhang, William Zame, Mihaela van der Schaar
Successful application of machine learning models to real-world prediction problems, e. g. financial forecasting and personalized medicine, has proved to be challenging, because such settings require limiting and quantifying the uncertainty in the model predictions, i. e. providing valid and accurate prediction intervals.
1 code implementation • ICML 2020 • Ahmed M. Alaa, Mihaela van der Schaar
Recurrent neural networks (RNNs) are instrumental in modelling sequential and time-series data.
1 code implementation • ICML 2020 • Changhee Lee, Mihaela van der Schaar
In this paper, we develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest (e. g., adverse events, the onset of comorbidities).
1 code implementation • NeurIPS 2020 • Hyun-Suk Lee, Yao Zhang, William Zame, Cong Shen, Jang-Won Lee, Mihaela van der Schaar
Most of the current methods of subgroup analysis begin with a particular algorithm for estimating individualized treatment effects (ITE) and identify subgroups by maximizing the difference across subgroups of the average treatment effect in each subgroup.
no code implementations • ICML 2020 • Cong Shen, Zhiyang Wang, Sofia S. Villar, Mihaela van der Schaar
Phase I dose-finding trials are increasingly challenging as the relationship between efficacy and toxicity of new compounds (or combination of them) becomes more complex.
1 code implementation • NeurIPS 2020 • Zhaozhi Qian, Ahmed M. Alaa, Mihaela van der Schaar
To this end, this paper develops a Bayesian model for predicting the effects of COVID-19 lockdown policies in a global context -- we treat each country as a distinct data point, and exploit variations of policies across countries to learn country-specific policy effects.
no code implementations • 13 Apr 2020 • Setareh Maghsudi, Mihaela van der Schaar
The former problem boils down to a stochastic knapsack problem, and we cast the latter as a multi-armed bandit problem.
1 code implementation • NeurIPS 2020 • Ioana Bica, James Jordon, Mihaela van der Schaar
While much attention has been given to the problem of estimating the effect of discrete interventions from observational data, relatively little work has been done in the setting of continuous-valued interventions, such as treatments associated with a dosage parameter.
3 code implementations • ICLR 2020 • Ioana Bica, Ahmed M. Alaa, James Jordon, Mihaela van der Schaar
Identifying when to give treatments to patients and how to select among multiple treatments over time are important medical problems with a few existing solutions.
no code implementations • ICLR 2020 • Daniel Jarrett, Mihaela van der Schaar
Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings.
1 code implementation • 14 Jan 2020 • Yao Zhang, Alexis Bellot, Mihaela van der Schaar
The choice of making an intervention depends on its potential benefit or harm in comparison to alternatives.
1 code implementation • 12 Jan 2020 • Yao Zhang, Daniel Jarrett, Mihaela van der Schaar
In this paper, we propose a novel Bayesian optimization (BO) algorithm to tackle the challenge of model selection in this setting.
1 code implementation • 8 Jan 2020 • Hyun-Suk Lee, Cong Shen, James Jordon, Mihaela van der Schaar
In addition, patient recruitment can be difficult by the fact that clinical trials do not aim to provide a benefit to any given patient in the trial.
1 code implementation • 8 Jan 2020 • Zhaozhi Qian, Ahmed M. Alaa, Alexis Bellot, Jem Rashbass, Mihaela van der Schaar
Comorbid diseases co-occur and progress via complex temporal patterns that vary among individuals.
no code implementations • ICLR 2020 • Ioana Bica, James Jordon, Mihaela van der Schaar
Our model consists of 3 blocks: (1) a generator, (2) a discriminator, (3) an inference block.
no code implementations • 19 Dec 2019 • Alexis Bellot, Mihaela van der Schaar
Analyzing electronic health records (EHR) poses significant challenges because often few samples are available describing a patient's health and, when available, their information content is highly diverse.
1 code implementation • 1 Dec 2019 • Jinsung Yoon, Daniel Jarrett, Mihaela van der Schaar
A good generative model for time-series data should preservetemporal dynamics, in the sense that new sequences respect the original relationships between variablesacross time.
1 code implementation • NeurIPS 2019 • Ahmed M. Alaa, Mihaela van der Schaar
Models of disease progression are instrumental for predicting patient outcomes and understanding disease dynamics.
1 code implementation • NeurIPS 2019 • James Jordon, Jinsung Yoon, Mihaela van der Schaar
The second benefit is that, through analysis that we provide inthe paper, we can derive tighter differential privacy guarantees when several queriesare made to this mechanism.
1 code implementation • NeurIPS 2019 • Ahmed M. Alaa, Mihaela van der Schaar
A symbolic metamodel is a model of a model, i. e., a surrogate model of a trained (machine learning) model expressed through a succinct symbolic expression that comprises familiar mathematical functions and can be subjected to symbolic manipulation.
no code implementations • 27 Nov 2019 • Trent Kyono, Mihaela van der Schaar
We show in this paper how prior knowledge in the form of a causal graph can be utilized to guide model selection, i. e., to identify from a set of trained networks the models that are the most robust and invariant to unseen domains.
no code implementations • 25 Sep 2019 • Changhee Lee, Mihaela van der Schaar
In this paper, we develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest (e. g., adverse events, the onset of comorbidities, etc.).
no code implementations • 25 Sep 2019 • Ahmed M. Alaa, Mihaela van der Schaar
To address this question, we develop the discriminative jackknife (DJ), a formal inference procedure that constructs predictive confidence intervals for a wide range of deep learning models, is easy to implement, and provides rigorous theoretical guarantees on (1) and (2).
1 code implementation • NeurIPS 2019 • Alexis Bellot, Mihaela van der Schaar
We consider the hypothesis testing problem of detecting conditional dependence, with a focus on high-dimensional feature spaces.
no code implementations • 9 Jul 2019 • Alexis Bellot, Mihaela van der Schaar
We present a general framework for hypothesis testing on distributions of sets of individual examples.
1 code implementation • 16 Jun 2019 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
The predictor network uses the observations selected by the selector network to predict a label, providing feedback to the selector network (well-selected variables should be predictive of the label).
no code implementations • 29 May 2019 • Yao Zhang, James Jordon, Ahmed M. Alaa, Mihaela van der Schaar
In this paper, we present Lifelong Bayesian Optimization (LBO), an online, multitask Bayesian optimization (BO) algorithm designed to solve the problem of model selection for datasets arriving and evolving over time.
1 code implementation • ICLR 2019 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
Machine learning has the potential to assist many communities in using the large datasets that are becoming more and more available.
Ranked #2 on
Synthetic Data Generation
on UCI Epileptic Seizure Recognition
(using extra training data)
1 code implementation • ICLR 2019 • James Jordon, Jinsung Yoon, Mihaela van der Schaar
We demonstrate the capability of our model to perform feature selection, showing that it performs as well as the originally proposed knockoff generation model in the Gaussian setting and that it outperforms the original model in non-Gaussian settings, including on a real-world dataset.
no code implementations • ICLR 2019 • Changhee Lee, Mihaela van der Schaar
Currently available survival analysis methods are limited in their ability to deal with complex, heterogeneous, and longitudinal data such as that available in primary care records, or in their ability to deal with multiple competing risks.
1 code implementation • ICLR 2019 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
The advent of big data brings with it data with more and more dimensions and thus a growing need to be able to efficiently select which features to use for a variety of problems.
no code implementations • 28 Apr 2019 • Deniz Gunduz, Paul de Kerret, Nicholas D. Sidiropoulos, David Gesbert, Chandra Murthy, Mihaela van der Schaar
Thanks to the recent advances in processing speed and data acquisition and storage, machine learning (ML) is penetrating every facet of our lives, and transforming research in many areas in a fundamental manner.
2 code implementations • ICML 2020 • Ioana Bica, Ahmed M. Alaa, Mihaela van der Schaar
The estimation of treatment effects is a pervasive problem in medicine.
no code implementations • NeurIPS 2018 • Alexis Bellot, Mihaela van der Schaar
The co-occurrence of multiple diseases among the general population is an important problem as those patients have more risk of complications and represent a large share of health care expenditure.
no code implementations • 27 Nov 2018 • Owen Lahav, Nicholas Mastronarde, Mihaela van der Schaar
Our results demonstrate that ML experts cannot accurately predict which system outputs will maximize clinicians' confidence in the underlying neural network model, and suggest additional findings that have broad implications to the future of research into ML interpretability and the use of ML in medicine.
no code implementations • 26 Nov 2018 • Daniel Jarrett, Jinsung Yoon, Mihaela van der Schaar
Accurate prediction of disease trajectories is critical for early identification and timely treatment of patients at risk.
1 code implementation • 22 Nov 2018 • Carl Rietschel, Jinsung Yoon, Mihaela van der Schaar
Deep learning models for survival analysis have gained significant attention in the literature, but they suffer from severe performance deficits when the dataset contains many irrelevant features.
no code implementations • 21 Nov 2018 • Changhee Lee, Nicholas Mastronarde, Mihaela van der Schaar
Estimating the individual treatment effect (ITE) from observational data is essential in medicine.
no code implementations • 2 Nov 2018 • Kartik Ahuja, Mihaela van der Schaar
A clinician desires to use a risk-stratification method that achieves confident risk-stratification - the risk estimates of the different patients reflect the true risks with a high probability.
1 code implementation • 26 Oct 2018 • Kartik Ahuja, Mihaela van der Schaar
We use the new metric to develop a variable importance ranking approach.
Methodology
no code implementations • 24 Oct 2018 • Ahmed M. Alaa, Mihaela van der Schaar
In this paper, we develop the phased attentive state space (PASS) model of disease progression, a deep probabilistic model that captures complex representations for disease progression while maintaining clinical interpretability.
Ranked #1 on
Disease Trajectory Forecasting
on UK CF trust
no code implementations • 5 Oct 2018 • Onur Atan, William R. Zame, Mihaela van der Schaar
Randomized Controlled Trials (RCTs) are the gold standard for comparing the effectiveness of a new treatment to the current one (the control).
no code implementations • ICLR 2018 • Anton Nemchenko, Trent Kyono, Mihaela van der Schaar
Survival analysis in the presence of multiple possible adverse events, i. e., competing risks, is a pervasive problem in many industries (healthcare, finance, etc.).
no code implementations • 6 Jul 2018 • Bryan Lim, Mihaela van der Schaar
Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.
1 code implementation • 29 Jun 2018 • James Jordon, Jinsung Yoon, Mihaela van der Schaar
Machine learning has the potential to assist many communities in using the large datasets that are becoming more and more available.
1 code implementation • 27 Jun 2018 • Kartik Ahuja, William Zame, Mihaela van der Schaar
Piecewise local-linear models provide a natural way to extend local-linear models to explain the global behavior of the model.
10 code implementations • ICML 2018 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
Accordingly, we call our method Generative Adversarial Imputation Nets (GAIN).
no code implementations • 27 Mar 2018 • Bryan Lim, Mihaela van der Schaar
Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.
1 code implementation • ICML 2018 • Ahmed M. Alaa, Mihaela van der Schaar
AUTOPROGNOSIS optimizes ensembles of pipeline configurations efficiently using a novel batched Bayesian optimization (BO) algorithm that learns a low-dimensional decomposition of the pipelines high-dimensional hyperparameter space in concurrence with the BO procedure.
2 code implementations • ICML 2018 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
Training complex machine learning models for prediction often requires a large amount of data that is not always readily available.
1 code implementation • ICLR 2018 • Jinsung Yoon, James Jordon, Mihaela van der Schaar
Estimating individualized treatment effects (ITE) is a challenging task due to the need for an individual's potential outcomes to be learned from biased data and without having access to the counterfactuals.
Ranked #2 on
Causal Inference
on Jobs
1 code implementation • ICLR 2018 • Jinsung Yoon, William R. Zame, Mihaela van der Schaar
At runtime, the operator prescribes a performance level or a cost constraint, and Deep Sensing determines what measurements to take and what to infer from those measurements, and then issues predictions.
no code implementations • 24 Dec 2017 • Ahmed M. Alaa, Mihaela van der Schaar
We investigate the problem of estimating the causal effect of a treatment on individual subjects from observational data, this is a central problem in various application domains, including healthcare, social sciences, and online advertising.
no code implementations • NeurIPS 2017 • Kartik Ahuja, William Zame, Mihaela van der Schaar
However, there has been limited work to address the personalized screening for these different diseases.
2 code implementations • 23 Nov 2017 • Jinsung Yoon, William R. Zame, Mihaela van der Schaar
Existing methods address this estimation problem by interpolating within data streams or imputing across data streams (both of which ignore important information) or ignoring the temporal aspect of the data and imposing strong assumptions about the nature of the data-generating process and/or the pattern of missing data (both of which are especially problematic for medical data).
1 code implementation • 19 Jun 2017 • Ahmed M. Alaa, Michael Weisz, Mihaela van der Schaar
The network is trained in alternating phases, where in each phase we use the training examples of one of the two potential outcomes (treated and control populations) to update the weights of the shared layers and the respective outcome-specific layers.
no code implementations • 5 Jun 2017 • Jinsung Yoon, William R. Zame, Mihaela van der Schaar
Our approach constructs a tree of subsets of the feature space and associates a predictor (predictive model) - determined by training one of a given family of base learners on an endogenously determined training set - to each node of the tree; we call the resulting object a tree of predictors.
no code implementations • 22 May 2017 • Ahmed M. Alaa, Jinsung Yoon, Scott Hu, Mihaela van der Schaar
We report the development and validation of a data-driven real-time risk score that provides timely assessments for the clinical acuity of ward patients based on their temporal lab tests and vital signs, which allows for timely intensive care unit (ICU) admissions.
no code implementations • ICML 2017 • Ahmed M. Alaa, Scott Hu, Mihaela van der Schaar
Our model captures "informatively sampled" patient episodes: the clinicians' decisions on when to observe a hospitalized patient's vital signs and lab tests over time are represented by a marked Hawkes process, with intensity parameters that are modulated by the patient's latent clinical states, and with observable physiological data (mark process) modeled as a switching multi-task Gaussian process.
no code implementations • 10 May 2017 • Sabrina Klos, Cem Tekin, Mihaela van der Schaar, Anja Klein
In our algorithm, a local controller (LC) in the mobile device of a worker regularly observes the worker's context, her/his decisions to accept or decline tasks and the quality in completing tasks.
1 code implementation • NeurIPS 2017 • Ahmed M. Alaa, Mihaela van der Schaar
Stemming from the potential outcomes model, we propose a novel multi- task learning framework in which factual and counterfactual outcomes are mod- eled as the outputs of a function in a vector-valued reproducing kernel Hilbert space (vvRKHS).
no code implementations • 23 Dec 2016 • Onur Atan, William R. Zame, Qiaojun Feng, Mihaela van der Schaar
This paper proposes a novel approach for constructing effective personalized policies when the observed data lacks counter-factual information, is biased and possesses many features.
no code implementations • 18 Dec 2016 • Ahmed M. Alaa, Mihaela van der Schaar
Modeling continuous-time physiological processes that manifest a patient's evolving clinical states is a key step in approaching many problems in healthcare.
no code implementations • NeurIPS 2016 • William Hoiles, Mihaela van der Schaar
Estimating patient's clinical state from multiple concurrent physiological streams plays an important role in determining if a therapeutic intervention is necessary and for triaging patients in the hospital.
no code implementations • 16 Nov 2016 • Ahmed M. Alaa, Jinsung Yoon, Scott Hu, Mihaela van der Schaar
Critically ill patients in regular wards are vulnerable to unanticipated clinical dete- rioration which requires timely transfer to the intensive care unit (ICU).
no code implementations • 12 Nov 2016 • Jinsung Yoon, Ahmed M. Alaa, Martin Cadeiras, Mihaela van der Schaar
Organ transplants can improve the life expectancy and quality of life for the recipient but carries the risk of serious post-operative complications, such as septic shock and organ rejection.
no code implementations • 27 Oct 2016 • Ahmed M. Alaa, Jinsung Yoon, Scott Hu, Mihaela van der Schaar
Objective: In this paper, we develop a personalized real-time risk scoring algorithm that provides timely and granular assessments for the clinical acuity of ward patients based on their (temporal) lab tests and vital signs; the proposed risk scoring system ensures timely intensive care unit (ICU) admissions for clinically deteriorating patients.
no code implementations • NeurIPS 2016 • Ahmed M. Alaa, Mihaela van der Schaar
We develop a Bayesian model for decision-making under time pressure with endogenous information acquisition.
no code implementations • 14 Jun 2016 • Sabrina Müller, Onur Atan, Mihaela van der Schaar, Anja Klein
We derive a sublinear regret bound, which characterizes the learning speed and proves that our algorithm converges to the optimal cache content placement strategy in terms of maximizing the number of cache hits.
no code implementations • 3 May 2016 • Ahmed M. Alaa, Jinsung Yoon, Scott Hu, Mihaela van der Schaar
We develop a personalized real time risk scoring algorithm that provides timely and granular assessments for the clinical acuity of ward patients based on their (temporal) lab tests and vital signs.
no code implementations • 11 Feb 2016 • Onur Atan, Mihaela van der Schaar
In most real-world settings such as recommender systems, finance, and healthcare, collecting useful information is costly and requires an active choice on the part of the decision maker.
no code implementations • 1 Feb 2016 • Ahmed M. Alaa, Kyeong H. Moon, William Hsu, Mihaela van der Schaar
A cluster of patients is a set of patients with similar features (e. g. age, breast density, family history, etc.
no code implementations • 30 Dec 2015 • Jie Xu, Tianwei Xing, Mihaela van der Schaar
Given the variability in student learning it is becoming increasingly important to tailor courses as well as course sequences to student needs.
no code implementations • 23 Dec 2015 • Cem Tekin, Jinsung Yoon, Mihaela van der Schaar
Extracting actionable intelligence from distributed, heterogeneous, correlated and high-dimensional data sources requires run-time processing and learning both locally and globally.
no code implementations • 16 Aug 2015 • Yannick Meier, Jie Xu, Onur Atan, Mihaela van der Schaar
We derive a confidence estimate for the prediction accuracy and demonstrate the performance of our algorithm on a dataset obtained based on the performance of approximately 700 UCLA undergraduate students who have taken an introductory digital signal processing over the past 7 years.
no code implementations • 4 Aug 2015 • Cem Tekin, Mihaela van der Schaar
After the $stop$ action is taken, the learner collects a terminal reward, and observes the costs and terminal rewards associated with each step of the episode.
no code implementations • 29 Mar 2015 • Onur Atan, Cem Tekin, Mihaela van der Schaar
In the case in which rewards of all arms are deterministic functions of a single unknown parameter, we construct a greedy policy that achieves {\em bounded regret}, with a bound that depends on the single true parameter of the problem.
no code implementations • 7 Feb 2015 • Cem Tekin, Mihaela van der Schaar
A key challenge for such systems is to accurately predict what type of content each of its consumers prefers in a certain context, and adapt these predictions to the evolving consumers' preferences, contexts and content characteristics.
no code implementations • 5 Feb 2015 • Cem Tekin, Mihaela van der Schaar
We prove a general regret bound for our algorithm whose time order depends only on the maximum number of relevant dimensions among all the actions, which in the special case where the relevance relation is single-valued (a function), reduces to $\tilde{O}(T^{2(\sqrt{2}-1)})$; in the absence of a relevance relation, the best known contextual bandit algorithms achieve regret $\tilde{O}(T^{(D+1)/(D+2)})$, where $D$ is the full dimension of the context vector.
no code implementations • NeurIPS 2014 • Cem Tekin, Mihaela van der Schaar
When the relation is a function, i. e., the reward of an action only depends on the context of a single type, and the expected reward of an action is Lipschitz continuous in the context of its relevant type, we propose an algorithm that achieves $\tilde{O}(T^{\gamma})$ regret with a high probability, where $\gamma=2/(1+\sqrt{2})$.
no code implementations • 13 Nov 2014 • SaiDhiraj Amuru, Cem Tekin, Mihaela van der Schaar, R. Michael Buehrer
We first present novel online learning algorithms to maximize the jamming efficacy against static transmitter-receiver pairs and prove that our learning algorithm converges to the optimal (in terms of the error rate inflicted at the victim and the energy used) jamming strategy.
no code implementations • 29 Oct 2014 • Onur Atan, Cem Tekin, Mihaela van der Schaar
Specifically, we prove that the parameter-free (worst-case) regret is sublinear in time, and decreases with the informativeness of the arms.
no code implementations • 22 Mar 2014 • Jie Xu, Mihaela van der Schaar, Jiangchuan Liu, Haitao Li
This paper presents a systematic online prediction method (Social-Forecast) that is capable to accurately forecast the popularity of videos promoted by social media.
no code implementations • 26 Sep 2013 • Cem Tekin, Simpson Zhang, Mihaela van der Schaar
In contrast to centralized recommender systems, in which there is a single centralized seller who has access to the complete inventory of items as well as the complete record of sales and user information, in decentralized recommender systems each seller/learner only has access to the inventory of items and user information for its own products and not the products and user information of other sellers, but can get commission if it sells an item of another seller.
no code implementations • 24 Aug 2013 • Luca Canzian, Yu Zhang, Mihaela van der Schaar
We present an efficient distributed online learning scheme to classify data captured from distributed, heterogeneous, and dynamic data sources.
no code implementations • 21 Aug 2013 • Cem Tekin, Mihaela van der Schaar
We model the problem of joint classification by the distributed and heterogeneous learners from multiple data sources as a distributed contextual bandit problem where each data is characterized by a specific context.
no code implementations • 21 Aug 2013 • Cem Tekin, Mihaela van der Schaar
At each moment of time, an instance characterized by a certain context may arrive to each learner; based on the context, the learner can select one of its own actions (which gives a reward and provides information) or request assistance from another learner.