Search Results for author: Alihan Hüyük

Found 16 papers, 6 papers with code

Defining Expertise: Applications to Treatment Effect Estimation

2 code implementations1 Mar 2024 Alihan Hüyük, Qiyao Wei, Alicia Curth, Mihaela van der Schaar

Actions of an expert thus naturally encode part of their domain knowledge, and can help make inferences within the same domain: Knowing doctors try to prescribe the best treatment for their patients, we can tell treatments prescribed more frequently are likely to be more effective.

Inductive Bias Model Selection

Adaptive Experiment Design with Synthetic Controls

1 code implementation30 Jan 2024 Alihan Hüyük, Zhaozhi Qian, Mihaela van der Schaar

Clinical trials are typically run in order to understand the effects of a new treatment on a given population of patients.

When is Off-Policy Evaluation Useful? A Data-Centric Perspective

no code implementations23 Nov 2023 Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar

On the one hand, it brings opportunities for safe policy improvement under high-stakes scenarios like clinical guidelines.

Off-policy evaluation

Online Decision Mediation

no code implementations28 Oct 2023 Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Consider learning a decision support assistant to serve as an intermediary between (oracle) expert behavior and (imperfect) human behavior: At each time, the algorithm observes an action chosen by a fallible agent, and decides whether to *accept* that agent's decision, *intervene* with an alternative, or *request* the expert's opinion.

Decision Making Descriptive

Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning

1 code implementation ICLR 2021 Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Understanding human behavior from observed data is critical for transparency and accountability in decision-making.

Decision Making

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

2 code implementations13 Sep 2023 Hao Sun, Alihan Hüyük, Mihaela van der Schaar

We identify a previously overlooked objective of query dependency in such optimization and elucidate two ensuing challenges that impede the successful and economical design of prompt optimization techniques.

Arithmetic Reasoning Navigate +2

Neural Laplace Control for Continuous-time Delayed Systems

2 code implementations24 Feb 2023 Samuel Holt, Alihan Hüyük, Zhaozhi Qian, Hao Sun, Mihaela van der Schaar

Many real-world offline reinforcement learning (RL) problems involve continuous-time environments with delays.

Model Predictive Control Offline RL +1

Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions

no code implementations11 Aug 2022 Alicia Curth, Alihan Hüyük, Mihaela van der Schaar

We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial.

Inferring Lexicographically-Ordered Rewards from Preferences

no code implementations21 Feb 2022 Alihan Hüyük, William R. Zame, Mihaela van der Schaar

Modeling the preferences of agents over a set of alternatives is a principal concern in many areas.

Inverse Contextual Bandits: Learning How Behavior Evolves over Time

2 code implementations13 Jul 2021 Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare.

Benchmarking Decision Making +1

Learning "What-if" Explanations for Sequential Decision-Making

no code implementations ICLR 2021 Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i. e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions.

counterfactual Counterfactual Reasoning +3

Lexicographic Multiarmed Bandit

no code implementations26 Jul 2019 Alihan Hüyük, Cem Tekin

The algorithm we propose for the second setting also attains bounded regret for the multiarmed bandit with satisficing objectives.

Thompson Sampling for Combinatorial Network Optimization in Unknown Environments

no code implementations7 Jul 2019 Alihan Hüyük, Cem Tekin

Influence maximization, adaptive routing, and dynamic spectrum allocation all require choosing the right action from a large set of alternatives.

Combinatorial Optimization Thompson Sampling

Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms

no code implementations7 Sep 2018 Alihan Hüyük, Cem Tekin

We analyze the regret of combinatorial Thompson sampling (CTS) for the combinatorial multi-armed bandit with probabilistically triggered arms under the semi-bandit feedback setting.

Thompson Sampling

Cannot find the paper you are looking for? You can Submit a new open access paper.