Search Results for author: Yusuke Kaneko

Found 2 papers, 0 papers with code

Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy

no code implementations • 23 Oct 2020 • Masahiro Kato, Yusuke Kaneko

The goal of off-policy evaluation (OPE) is to evaluate a new policy using historical data obtained via a behavior policy.

Off-policy evaluation

Paper
Add Code

Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games

no code implementations • 4 Jul 2020 • Kenshi Abe, Yusuke Kaneko

The proposed estimators project exploitability that is often used as a metric for determining how close a policy profile (i. e., a tuple of policies) is to a Nash equilibrium in two-player zero-sum games.

Off-policy evaluation Vocal Bursts Valence Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.