In this paper, we propose a new algorithm PPG (Proximal Policy Gradient), which is close to both VPG (vanilla policy gradient) and PPO (proximal policy optimization).
We present a novel approach to train a natural media painting using reinforcement learning.
We present a novel reinforcement learning-based natural media painting algorithm.
Action selection is guided by a given reference image, which the agent attempts to replicate subject to the limitations of the action space and the agent's learned policy.
Doodling is a useful and common intelligent skill that people can learn and master.