Search Results for author: Yusuke Kaneko

Found 2 papers, 0 papers with code

Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy

no code implementations23 Oct 2020 Masahiro Kato, Yusuke Kaneko

The goal of off-policy evaluation (OPE) is to evaluate a new policy using historical data obtained via a behavior policy.

Off-policy evaluation

Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games

no code implementations4 Jul 2020 Kenshi Abe, Yusuke Kaneko

The proposed estimators project exploitability that is often used as a metric for determining how close a policy profile (i. e., a tuple of policies) is to a Nash equilibrium in two-player zero-sum games.

Off-policy evaluation Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.