no code implementations • 9 Sep 2021 • Charles Evans, Atoosa Kasirzadeh
In this paper, we introduce new formal methods and provide empirical evidence to highlight a unique safety concern prevalent in reinforcement learning (RL)-based recommendation algorithms -- 'user tampering.'