Search Results for author: Miroslav Dudík

Found 20 papers, 5 papers with code

Fairlearn: Assessing and Improving Fairness of AI Systems

no code implementations • 29 Mar 2023 • Hilde Weerts, Miroslav Dudík, Richard Edgar, Adrin Jalali, Roman Lutz, Michael Madaio

Fairlearn is an open source project to help practitioners assess and improve fairness of artificial intelligence (AI) systems.

Fairness

Paper
Add Code

Convex Analysis at Infinity: An Introduction to Astral Space

no code implementations • 6 May 2022 • Miroslav Dudík, Robert E. Schapire, Matus Telgarsky

Not all convex functions on $\mathbb{R}^n$ have finite minimizers; some can only be minimized by a sequence as it heads to infinity.

Paper
Add Code

Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning

1 code implementation • 10 Feb 2022 • Alberto Bietti, Chen-Yu Wei, Miroslav Dudík, John Langford, Zhiwei Steven Wu

Large-scale machine learning systems often involve data distributed across a collection of users.

Personalized Federated Learning Stochastic Optimization

Paper
Code

Bayesian decision-making under misspecified priors with applications to meta-learning

no code implementations • NeurIPS 2021 • Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu, Thodoris Lykouris, Miroslav Dudík, Robert E. Schapire

We prove that the expected reward accrued by Thompson sampling (TS) with a misspecified prior differs by at most $\tilde{\mathcal{O}}(H^2 \epsilon)$ from TS with a well specified prior, where $\epsilon$ is the total-variation distance between priors and $H$ is the learning horizon.

Decision Making Meta-Learning +2

Paper
Add Code

Gradient descent follows the regularization path for general losses

no code implementations • 19 Jun 2020 • Ziwei Ji, Miroslav Dudík, Robert E. Schapire, Matus Telgarsky

Recent work across many machine learning disciplines has highlighted that standard descent methods, even without explicit regularization, do not merely minimize the training error, but also exhibit an implicit bias.

Paper
Add Code

Doubly robust off-policy evaluation with shrinkage

no code implementations • ICML 2020 • Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miroslav Dudík

We propose a new framework for designing estimators for off-policy evaluation in contextual bandits.

Model Selection Multi-Armed Bandits +1

Paper
Add Code

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

4 code implementations • 30 May 2019 • Alekh Agarwal, Miroslav Dudík, Zhiwei Steven Wu

Our schemes only require access to standard risk minimization algorithms (such as standard classification or least-squares regression) while providing theoretical guarantees on the optimality and fairness of the obtained solutions.

Attribute Fairness +1

1,821

Paper
Code

Provably efficient RL with Rich Observations via Latent State Decoding

1 code implementation • 25 Jan 2019 • Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford

We study the exploration problem in episodic MDPs with rich observations generated from a small number of latent states.

Clustering Q-Learning +1

Paper
Code

A Reductions Approach to Fair Classification

3 code implementations • ICML 2018 • Alekh Agarwal, Alina Beygelzimer, Miroslav Dudík, John Langford, Hanna Wallach

We present a systematic approach for achieving fairness in a binary classification setting.

Binary Classification Classification +2

1,821

Paper
Code

Practical Contextual Bandits with Regression Oracles

no code implementations • ICML 2018 • Dylan J. Foster, Alekh Agarwal, Miroslav Dudík, Haipeng Luo, Robert E. Schapire

A major challenge in contextual bandits is to design general-purpose algorithms that are both practically useful and theoretically well-founded.

General Classification Multi-Armed Bandits +1

Paper
Add Code

Hierarchical Imitation and Reinforcement Learning

no code implementations • ICML 2018 • Hoang M. Le, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, Hal Daumé III

We study how to effectively leverage expert feedback to learn sequential decision-making policies.

Imitation Learning Montezuma's Revenge +2

Paper
Add Code

Oracle-Efficient Online Learning and Auction Design

no code implementations • 5 Nov 2016 • Miroslav Dudík, Nika Haghtalab, Haipeng Luo, Robert E. Schapire, Vasilis Syrgkanis, Jennifer Wortman Vaughan

We consider the design of computationally efficient online learning algorithms in an adversarial setting in which the learner has access to an offline optimization oracle.

Paper
Add Code

Arbitrage-Free Combinatorial Market Making via Integer Programming

no code implementations • 9 Jun 2016 • Christian Kroer, Miroslav Dudík, Sébastien Lahaie, Sivaraman Balakrishnan

We present a new combinatorial market maker that operates arbitrage-free combinatorial prediction markets specified by integer programs.

Paper
Add Code

Off-policy evaluation for slate recommendation

1 code implementation • NeurIPS 2017 • Adith Swaminathan, Akshay Krishnamurthy, Alekh Agarwal, Miroslav Dudík, John Langford, Damien Jose, Imed Zitouni

This paper studies the evaluation of policies that recommend an ordered set of items (e. g., a ranking) based on some context---a common scenario in web search, ads, and recommendation.

Learning-To-Rank Off-policy evaluation

Paper
Code

Budget Constraints in Prediction Markets

no code implementations • 7 Oct 2015 • Nikhil Devanur, Miroslav Dudík, Zhiyi Huang, David M. Pennock

We give a detailed characterization of optimal trades under budget constraints in a prediction market with a cost-function-based automated market maker.

Paper
Add Code

Convex Risk Minimization and Conditional Probability Estimation

no code implementations • 15 Jun 2015 • Matus Telgarsky, Miroslav Dudík, Robert Schapire

This paper proves, in very general settings, that convex risk minimization is a procedure to select a unique conditional probability model determined by the classification problem.

General Classification

Paper
Add Code

Doubly Robust Policy Evaluation and Optimization

no code implementations • 10 Mar 2015 • Miroslav Dudík, Dumitru Erhan, John Langford, Lihong Li

As such, we expect the doubly robust approach to become common practice in policy evaluation and optimization.

Decision Making Multi-Armed Bandits

Paper
Add Code

Contextual Dueling Bandits

no code implementations • 23 Feb 2015 • Miroslav Dudík, Katja Hofmann, Robert E. Schapire, Aleksandrs Slivkins, Masrour Zoghi

The first of these algorithms achieves particularly low regret, even when data is adversarial, although its time and space requirements are linear in the size of the policy space.

Paper
Add Code

Market Making with Decreasing Utility for Information

no code implementations • 30 Jul 2014 • Miroslav Dudík, Rafael Frongillo, Jennifer Wortman Vaughan

We study information elicitation in cost-function-based combinatorial prediction markets when the market maker's utility for information decreases over time.

Paper
Add Code

Generative and Discriminative Learning with Unknown Labeling Bias

no code implementations • NeurIPS 2008 • Steven J. Phillips, Miroslav Dudík

For the generative case, we derive an entropy-based weighting that maximizes expected log likelihood under the worst-case true class proportions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.