no code implementations • 6 Nov 2023 • Farinaz Alamiyan-Harandi, Mersad Hassanjani, Pouria Ramazi
Experimental results in the Cleanup and Harvest environments show that training based on the KindMARL method enabled the agents to earn 89\% (resp.
counterfactual Counterfactual Reasoning +3