no code implementations • 18 Apr 2020 • Saeed Moazami, Peggy Doerschuk
To that end, a substitute model for the reward function approximator is introduced that learns to avoid terminal states rather than to maximize accumulated rewards from safe states.
no code implementations • 12 Dec 2019 • Mehdi Dadvar, Saeed Moazami, Harley R. Myler, Hassan Zargarzadeh
Consider a dynamic task allocation problem, where tasks are unknowingly distributed over an environment.
no code implementations • 12 Dec 2019 • Mehdi Dadvar, Saeed Moazami, Harley R. Myler, Hassan Zargarzadeh
Besides, we present a coordination solution between hunters and gatherers by integrating the novel notion of profit margins into the concept of expected information gain.
no code implementations • 6 Aug 2019 • Saeed Moazami, M-Mahdi Naddaf-Sh, Srinivas Palanki, Hassan Zargarzadeh
The propelling mechanism of this robot consists of two actuators: a slider, and a rotational pendulum located on the SR's diagonal shaft.
Robotics