no code implementations • 13 Mar 2021 • Manash Pratim Das, Anirudh Vemula, Mayank Pathak, Sandip Aine, Maxim Likhachev
In this work, we investigate how would the robot with the help of a simulator, learn to maximize the number of boxes unloaded by each action.