1 code implementation • 20 Jun 2017 • Sanket Kamthe, Marc Peter Deisenroth
Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks.
no code implementations • 3 Jan 2021 • Sanket Kamthe, Samuel Assefa, Marc Deisenroth
Learning the probabilistic model for the data is equivalent to estimating the density of the data.
no code implementations • 26 Apr 2022 • Shaltiel Eloul, Fran Silavong, Sanket Kamthe, Antonios Georgiadis, Sean J. Moran
We show that otherwise it is possible to directly recover all vectors in a mini-batch without any numerical optimisation due to the de-mixing nature of the cross entropy loss.