1 code implementation • 10 Jan 2017 • Han Cai, Kan Ren, Wei-Nan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, Defeng Guo
In this paper, we formulate the bid decision process as a reinforcement learning problem, where the state space is represented by the auction information and the campaign's real-time parameters, while an action is the bid price to set.