Search Results for author: Koichi Tanaka

Found 1 papers, 0 papers with code

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

no code implementations20 Aug 2024 Tatsuhiro Shimizu, Koichi Tanaka, Ren Kishimoto, Haruka Kiyohara, Masahiro Nomura, Yuta Saito

We explore off-policy evaluation and learning (OPE/L) in contextual combinatorial bandits (CCB), where a policy selects a subset in the action space.

Off-policy evaluation Recommendation Systems +1

Cannot find the paper you are looking for? You can Submit a new open access paper.