Search Results for author: Nicolas Gast

Found 8 papers, 2 papers with code

Trading-off price for data quality to achieve fair online allocation

no code implementations • NeurIPS 2023 • Mathieu Molina, Nicolas Gast, Patrick Loiseau, Vianney Perchet

We consider the problem of online allocation subject to a long-term fairness penalty.

Paper
Add Code

Decentralized model-free reinforcement learning in stochastic games with average-reward objective

no code implementations • 13 Jan 2023 • Romain Cravic, Nicolas Gast, Bruno Gaujal

We propose the first model-free algorithm that achieves low regret performance for decentralized learning in two-player zero-sum tabular stochastic games with infinite-horizon average-reward objective.

Q-Learning reinforcement-learning +1

Paper
Add Code

On Fair Selection in the Presence of Implicit and Differential Variance

no code implementations • 10 Dec 2021 • Vitalii Emelianov, Nicolas Gast, Krishna P. Gummadi, Patrick Loiseau

In the second setting (with known variances), imposing the $\gamma$-rule decreases the utility but we prove a bound on the utility loss due to the fairness mechanism.

Fairness

Paper
Add Code

Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?

no code implementations • 16 Jun 2021 • Nicolas Gast, Bruno Gaujal, Kimang Khun

While the regret bound and runtime of vanilla implementations of PSRL and UCRL2 are exponential in the number of bandits, we show that the episodic regret of MB-PSRL and MB-UCRL2 is $\tilde{O}(S\sqrt{nK})$ where $K$ is the number of episodes, $n$ is the number of bandits and $S$ is the number of states of each bandit (the exact bound in S, n and K is given in the paper).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

no code implementations • 16 Dec 2020 • Nicolas Gast, Bruno Gaujal, Chen Yan

In this paper we show that, under the same conditions, the convergence rate is exponential in the number of bandits, unless the fixed point is singular (to be defined later).

Performance Optimization and Control Probability

Paper
Add Code

On Fair Selection in the Presence of Implicit Variance

no code implementations • 24 Jun 2020 • Vitalii Emelianov, Nicolas Gast, Krishna P. Gummadi, Patrick Loiseau

We then compare the utility obtained by imposing a fairness mechanism that we term $\gamma$-rule (it includes demographic parity and the four-fifths rule as special cases), to that of a group-oblivious selection algorithm that picks the candidates with the highest estimated quality independently of their group.

Fairness

Paper
Add Code

The Price of Local Fairness in Multistage Selection

1 code implementation • 15 Jun 2019 • Vitalii Emelianov, George Arvanitakis, Nicolas Gast, Krishna Gummadi, Patrick Loiseau

In particular, our experiments show that the price of local fairness is generally smaller when the sensitive attribute is observed at the first stage; but globally fair selections are more locally fair when the sensitive attribute is observed at the second stage---hence in both cases it is often possible to have a selection that has a small price of local fairness and is close to locally fair.

Attribute Decision Making +1

Paper
Code

Linear Regression from Strategic Data Sources

1 code implementation • 30 Sep 2013 • Nicolas Gast, Stratis Ioannidis, Patrick Loiseau, Benjamin Roussillon

In this paper, we study a setting in which features are public but individuals choose the precision of the outputs they reveal to an analyst.

regression

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.