no code implementations • 27 Sep 2018 • Dong Xu, Eleanor Quint, Zeynep Hakguder, Haluk Dogan, Stephen Scott, Matthew Dwyer
We study the problem of deep reinforcement learning where the agent's action sequences are constrained, e. g., prohibition of dithering or overactuating action sequences that might damage a robot, drone, or other physical device.