1 code implementation • 15 Jul 2021 • Joseph Palermo, Johnny Ye, Alok Singh
We convert the DeepMind Mathematics Dataset into a reinforcement learning environment by interpreting it as a program synthesis problem.
Mathematical Reasoning Program Synthesis +2