1 code implementation • 12 Oct 2023 • Valentina Zangirolami, Matteo Borrotti
One of the major dilemmas in Reinforcement Learning (RL) where an autonomous agent has to balance two contrasting needs in making its decisions is: exploiting the current knowledge of the environment to maximize the cumulative reward as well as exploring actions that allow improving the knowledge of the environment, hopefully leading to higher reward values (exploration-exploitation trade-off).
no code implementations • 22 Oct 2021 • Alessandro Riboni, Nicolò Ghioldi, Antonio Candelieri, Matteo Borrotti
Automated driving systems (ADS) have undergone a significant improvement in the last years.