Search Results for author: Alan Malek

Found 14 papers, 3 papers with code

Additive Causal Bandits with Unknown Graph

1 code implementation • 13 Jun 2023 • Alan Malek, Virginia Aglietti, Silvia Chiappa

We explore algorithms to select actions in the causal bandit setting where the learner can choose to intervene on a set of random variables related by a causal graph, and the learner sequentially chooses interventions and observes a sample from the interventional distribution.

Paper
Code

Constrained Causal Bayesian Optimization

1 code implementation • 31 May 2023 • Virginia Aglietti, Alan Malek, Ira Ktena, Silvia Chiappa

We propose constrained causal Bayesian optimization (cCBO), an approach for finding interventions in a known causal graph that optimize a target variable under some constraints.

Bayesian Optimization Gaussian Processes

Paper
Code

Pragmatic Fairness: Developing Policies with Outcome Disparity Control

1 code implementation • 28 Jan 2023 • Limor Gultchin, Siyuan Guo, Alan Malek, Silvia Chiappa, Ricardo Silva

We introduce a causal framework for designing optimal policies that satisfy fairness constraints.

Attribute Blocking +1

Paper
Code

Asymptotically Best Causal Effect Identification with Multi-Armed Bandits

no code implementations • NeurIPS 2021 • Alan Malek, Silvia Chiappa

This paper considers the problem of selecting a formula for identifying a causal quantity of interest among a set of available formulas.

Multi-Armed Bandits

Paper
Add Code

Prequential MDL for Causal Structure Learning with Neural Networks

no code implementations • 2 Jul 2021 • Jorg Bornschein, Silvia Chiappa, Alan Malek, Rosemary Nan Ke

Learning the structure of Bayesian networks and causal relationships from observations is a common goal in several areas of science and technology.

Paper
Add Code

Large-Scale Markov Decision Problems via the Linear Programming Dual

no code implementations • 6 Jan 2019 • Yasin Abbasi-Yadkori, Peter L. Bartlett, Xi Chen, Alan Malek

Moreover, we propose an efficient algorithm, scaling with the size of the subspace but not the state space, that is able to find a policy with low excess loss relative to the best policy in this class.

Paper
Add Code

Horizon-Independent Minimax Linear Regression

no code implementations • NeurIPS 2018 • Alan Malek, Peter L. Bartlett

We consider online linear regression: at each round, an adversary reveals a covariate vector, the learner predicts a real value, the adversary reveals a label, and the learner suffers the squared prediction error.

regression

Paper
Add Code

Best Arm Identification for Contaminated Bandits

no code implementations • 26 Feb 2018 • Jason Altschuler, Victor-Emmanuel Brunel, Alan Malek

Specifically, we propose a variant of the Best Arm Identification problem for \emph{contaminated bandits}, where each arm pull has probability $\varepsilon$ of generating a sample from an arbitrary contamination distribution instead of the true underlying distribution.

Active Learning

Paper
Add Code

Random Permutation Online Isotonic Regression

no code implementations • NeurIPS 2017 • Wojciech Kotlowski, Wouter M. Koolen, Alan Malek

We revisit isotonic regression on linear orders, the problem of fitting monotonic functions to best explain the data, in an online setting.

regression

Paper
Add Code

Hit-and-Run for Sampling and Planning in Non-Convex Spaces

no code implementations • 19 Oct 2016 • Yasin Abbasi-Yadkori, Peter L. Bartlett, Victor Gabillon, Alan Malek

We propose the Hit-and-Run algorithm for planning and sampling problems in non-convex spaces.

Paper
Add Code

Online Isotonic Regression

no code implementations • 14 Mar 2016 • Wojciech Kotłowski, Wouter M. Koolen, Alan Malek

We then prove that the Exponential Weights algorithm played over a covering net of isotonic functions has a regret bounded by $O\big(T^{1/3} \log^{2/3}(T)\big)$ and present a matching $\Omega(T^{1/3})$ lower bound on regret.

regression

Paper
Add Code

Minimax Time Series Prediction

no code implementations • NeurIPS 2015 • Wouter M. Koolen, Alan Malek, Peter L. Bartlett, Yasin Abbasi

We consider an adversarial formulation of the problem ofpredicting a time series with square loss.

Time Series Time Series Prediction

Paper
Add Code

Efficient Minimax Strategies for Square Loss Games

no code implementations • NeurIPS 2014 • Wouter M. Koolen, Alan Malek, Peter L. Bartlett

We consider online prediction problems where the loss between the prediction and the outcome is measured by the squared Euclidean distance and its generalization, the squared Mahalanobis distance.

Density Estimation

Paper
Add Code

Linear Programming for Large-Scale Markov Decision Problems

no code implementations • 27 Feb 2014 • Yasin Abbasi-Yadkori, Peter L. Bartlett, Alan Malek

We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.