Search Results for author: Shinto Eguchi

Found 2 papers, 0 papers with code

Active Learning by Query by Committee with Robust Divergences

no code implementations18 Nov 2022 Hideitsu Hino, Shinto Eguchi

In this paper, the measure of disagreement is defined by the Bregman divergence, which includes the Kullback--Leibler divergence as an instance, and the dual $\gamma$-power divergence.

Active Learning

Minimum information divergence of Q-functions for dynamic treatment resumes

no code implementations16 Nov 2022 Shinto Eguchi

In a standard framework of reinforcement learning, a Q-function is defined as the conditional expectation of a reward given a state and an action for a single-stage situation.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.