Odds-Ratio Thompson Sampling to Control for Time-Varying Effect

4 Mar 2020Sulgi KimKyungmin Kim

Multi-armed bandit methods have been used for dynamic experiments particularly in online services. Among the methods, thompson sampling is widely used because it is simple but shows desirable performance... (read more)

PDF Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.