no code implementations • 11 Jan 2024 • Jesse Geneson, Linus Tang
In particular, we sharpen an upper bound for delayed ambiguous reinforcement learning by a factor of 2 and an upper bound for learning compositions of families of functions by a factor of 2. 41.