no code implementations • 2 Feb 2021 • Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg
We propose a new graphical criterion for value of control, establishing its soundness and completeness.
no code implementations • NeurIPS 2021 • Grégoire Delétang, Jordi Grau-Moya, Markus Kunesch, Tim Genewein, Rob Brekelmans, Shane Legg, Pedro A Ortega
Since the Gaussian free energy is known to be a certainty-equivalent sensitive to the mean and the variance, the learning rule has applications in risk-sensitive decision-making.