Search Results for author: Johnny Ye

Found 1 papers, 1 papers with code

A Reinforcement Learning Environment for Mathematical Reasoning via Program Synthesis

1 code implementation15 Jul 2021 Joseph Palermo, Johnny Ye, Alok Singh

We convert the DeepMind Mathematics Dataset into a reinforcement learning environment by interpreting it as a program synthesis problem.

Mathematical Reasoning Program Synthesis +2

Cannot find the paper you are looking for? You can Submit a new open access paper.