no code implementations • 4 Dec 2022 • Yusuke Narita, Kyohei Okumura, Akihiro Shimizu, Kohei Yata
Off-policy evaluation (OPE) attempts to predict the performance of counterfactual policies using log data from a different policy.
no code implementations • 9 Nov 2021 • Kohei Yata
I consider a class of statistical decision problems in which the policy maker must decide between two alternative policies to maximize social welfare based on a finite sample.
2 code implementations • 26 Apr 2021 • Yusuke Narita, Kohei Yata
Algorithms make a growing portion of policy and business decisions.
no code implementations • 20 Feb 2020 • Yusuke Narita, Shota Yasui, Kohei Yata
Efficient methods to evaluate new algorithms are critical for improving interactive bandit and reinforcement learning systems such as recommendation systems.
no code implementations • 10 Sep 2018 • Yusuke Narita, Shota Yasui, Kohei Yata
What is the most statistically efficient way to do off-policy evaluation and optimization with batch data from bandit feedback?
Ranked #1 on Visual Object Tracking on VOT2014