2 code implementations • 1 Mar 2024 • Alihan Hüyük, Qiyao Wei, Alicia Curth, Mihaela van der Schaar
Actions of an expert thus naturally encode part of their domain knowledge, and can help make inferences within the same domain: Knowing doctors try to prescribe the best treatment for their patients, we can tell treatments prescribed more frequently are likely to be more effective.
1 code implementation • 30 Jan 2024 • Alihan Hüyük, Zhaozhi Qian, Mihaela van der Schaar
Clinical trials are typically run in order to understand the effects of a new treatment on a given population of patients.
no code implementations • 23 Nov 2023 • Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar
On the one hand, it brings opportunities for safe policy improvement under high-stakes scenarios like clinical guidelines.
no code implementations • 28 Oct 2023 • Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar
Consider learning a decision support assistant to serve as an intermediary between (oracle) expert behavior and (imperfect) human behavior: At each time, the algorithm observes an action chosen by a fallible agent, and decides whether to *accept* that agent's decision, *intervene* with an alternative, or *request* the expert's opinion.
no code implementations • 28 Oct 2023 • Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar
Decision analysis deals with modeling and enhancing decision processes.
1 code implementation • ICLR 2021 • Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar
Understanding human behavior from observed data is critical for transparency and accountability in decision-making.
2 code implementations • 13 Sep 2023 • Hao Sun, Alihan Hüyük, Mihaela van der Schaar
We identify a previously overlooked objective of query dependency in such optimization and elucidate two ensuing challenges that impede the successful and economical design of prompt optimization techniques.
2 code implementations • 24 Feb 2023 • Samuel Holt, Alihan Hüyük, Zhaozhi Qian, Hao Sun, Mihaela van der Schaar
Many real-world offline reinforcement learning (RL) problems involve continuous-time environments with delays.
no code implementations • 11 Aug 2022 • Alicia Curth, Alihan Hüyük, Mihaela van der Schaar
We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial.
no code implementations • 21 Feb 2022 • Alihan Hüyük, William R. Zame, Mihaela van der Schaar
Modeling the preferences of agents over a set of alternatives is a principal concern in many areas.
no code implementations • NeurIPS 2021 • Yuchao Qin, Fergus Imrie, Alihan Hüyük, Daniel Jarrett, alexander gimson, Mihaela van der Schaar
Significant effort has been placed on developing decision support tools to improve patient care.
2 code implementations • 13 Jul 2021 • Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar
Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare.
no code implementations • ICLR 2021 • Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar
Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i. e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions.
no code implementations • 26 Jul 2019 • Alihan Hüyük, Cem Tekin
The algorithm we propose for the second setting also attains bounded regret for the multiarmed bandit with satisficing objectives.
no code implementations • 7 Jul 2019 • Alihan Hüyük, Cem Tekin
Influence maximization, adaptive routing, and dynamic spectrum allocation all require choosing the right action from a large set of alternatives.
no code implementations • 7 Sep 2018 • Alihan Hüyük, Cem Tekin
We analyze the regret of combinatorial Thompson sampling (CTS) for the combinatorial multi-armed bandit with probabilistically triggered arms under the semi-bandit feedback setting.