Formal Mathematics Statement Curriculum Learning

We explore the use of expert iteration in the context of language modeling applied to formal mathematics. We show that at same compute budget, expert iteration, by which we mean proof search interleaved with learning, dramatically outperforms proof search only. We also observe that when applied to a collection of formal statements of sufficiently varied difficulty, expert iteration is capable of finding and solving a curriculum of increasingly difficult problems, without the need for associated ground-truth proofs. Finally, by applying this expert iteration to a manually curated set of problem statements, we achieve state-of-the-art on the miniF2F benchmark, automatically solving multiple challenging problems drawn from high school olympiads.

PDF Abstract

Datasets


Results from the Paper


Ranked #3 on Automated Theorem Proving on miniF2F-test (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Automated Theorem Proving miniF2F-test Lean Expert Iteration Pass@1 29.6 # 3
Pass@8 34.5 # 1
Pass@64 36.6 # 4

Methods


No methods listed for this paper. Add relevant methods here