no code implementations • 27 Apr 2022 • Jonathan G. Richens, Rory Beard, Daniel H. Thompson
To act safely and ethically in the real world, agents must be able to reason about harm and avoid harmful actions.
1 code implementation • 15 Oct 2021 • Ludovic Denoyer, Alfredo De la Fuente, Song Duong, Jean-Baptiste Gaya, Pierre-Alexandre Kamienny, Daniel H. Thompson
SaLinA is a simple library that makes implementing complex sequential learning models easy, including reinforcement learning algorithms.
no code implementations • pproximateinference AABI Symposium 2019 • Divya Gautam, Maria Lomeli, Kostis Gourgoulias, Daniel H. Thompson, Saurabh Johri
We consider the effect of structure-agnostic and structure-dependent masking schemes when training a universal marginaliser (arXiv:1711. 00695) in order to learn conditional distributions of the form $P(x_i |\mathbf x_{\mathbf b})$, where $x_i$ is a given random variable and $\mathbf x_{\mathbf b}$ is some arbitrary subset of all random variables of the generative model of interest.