no code implementations • 31 May 2023 • TaeHyun Hwang, Kyuwook Chai, Min-hwan Oh
Approximating this unknown score function with deep neural networks, we propose algorithms: Combinatorial Neural UCB ($\texttt{CN-UCB}$) and Combinatorial Neural Thompson Sampling ($\texttt{CN-TS}$).
no code implementations • 27 Dec 2022 • TaeHyun Hwang, Min-hwan Oh
In this paper, we establish a provably efficient RL algorithm for the MDP whose state transition is given by a multinomial logistic model.
Model-based Reinforcement Learning reinforcement-learning +1
no code implementations • 16 Jan 2014 • Sunho Park, TaeHyun Hwang, Seungjin Choi
Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer.