1 code implementation • 26 Jan 2023 • Mikhail Konobeev, Jalal Etesami, Negar Kiyavash
We study the causal bandit problem when the causal graph is unknown and develop an efficient algorithm for finding the parent node of the reward node using atomic interventions.
1 code implementation • 14 Feb 2022 • Etienne Boursier, Mikhail Konobeev, Nicolas Flammarion
Multi-task learning leverages structural similarities between multiple tasks to learn despite very few samples.
no code implementations • 11 Feb 2021 • Branislav Kveton, Mikhail Konobeev, Manzil Zaheer, Chih-Wei Hsu, Martin Mladenov, Craig Boutilier, Csaba Szepesvari
Efficient exploration in bandits is a fundamental online learning problem.
no code implementations • 31 Oct 2020 • Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvári
A key problem in the theory of meta-learning is to understand how the task distributions influence transfer risk, the expected error of a meta-learner on a new task drawn from the unknown task distribution.