Automated Theorem Proving

70 papers with code • 10 benchmarks • 8 datasets

The goal of Automated Theorem Proving is to automatically generate a proof, given a conjecture (the target theorem) and a knowledge base of known facts, all expressed in a formal language. Automated Theorem Proving is useful in a wide range of applications, including the verification and synthesis of software and hardware systems.

Source: Learning to Prove Theorems by Learning to Generate Theorems

Benchmarks

Add a Result

These leaderboards are used to track progress in Automated Theorem Proving

Dataset	Best Model	Compare
miniF2F-test	Thor + expert iteration on autoformalised theorems	See all
miniF2F-valid	Lean GPT-f	See all
HolStep (Conditional)	MPNN-DagLSTM	See all
HOList benchmark	4-hop GNN, sub-expression sharing	See all
HolStep (Unconditional)	FormulaNet	See all
Metamath set.mm	Evariste	See all
miniF2F-curriculum	Evariste-7d	See all
CompCert	Proverbot9001	See all
CoqGym	ASTactic	See all
LeanDojo Benchmark	ReProver	See all

Libraries

Use these libraries to find Automated Theorem Proving models and implementations

eleutherai/gpt-neox

2 papers

6,574

Datasets

Latest papers

Most implemented Social Latest No code

Towards Large Language Models as Copilots for Theorem Proving in Lean

lean-dojo/leancopilot • • 18 Apr 2024

In this paper, we explore LLMs as copilots that assist humans in proving theorems.

775

18 Apr 2024

Paper
Code

A Survey on Deep Learning for Theorem Proving

zhaoyu-li/dl4tp • 15 Apr 2024

Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in mathematical language to rigorous derivations in formal systems.

15 Apr 2024

Paper
Code

Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

jinpz/dtv • 26 Mar 2024

Large language models (LLM), such as Google's Minerva and OpenAI's GPT families, are becoming increasingly capable of solving mathematical quantitative reasoning problems.

26 Mar 2024

Paper
Code

LeanReasoner: Boosting Complex Logical Reasoning with Lean

some-random/theorem-proving-reasoning • 20 Mar 2024

Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning.

20 Mar 2024

Paper
Code

REFACTOR: Learning to Extract Theorems from Proofs

jinpz/refactor • • 26 Feb 2024

With newly extracted theorems, we show that the existing proofs in the MetaMath database can be refactored.

26 Feb 2024

Paper
Code

On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

g-pichler/in_feasibility_of_ml_backdoor_detection • 26 Feb 2024

We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition.

26 Feb 2024

Paper
Code

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

eleanor-h/mustard • • 14 Feb 2024

Recent large language models (LLMs) have witnessed significant advancement in various tasks, including mathematical reasoning and theorem proving.

14 Feb 2024

Paper
Code

Llemma: An Open Language Model For Mathematics

eleutherai/gpt-neox • • 16 Oct 2023

We present Llemma, a large language model for mathematics.

6,574

16 Oct 2023

Paper
Code

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

menik1126/TRIGO • 16 Oct 2023

Automated theorem proving (ATP) has become an appealing domain for exploring the reasoning ability of the recent successful generative language models.

16 Oct 2023

Paper
Code

An In-Context Learning Agent for Formal Theorem-Proving

trishullab/copra • 6 Oct 2023

We evaluate our implementation of COPRA on the miniF2F benchmark for Lean and a set of Coq tasks from the CompCert project.

06 Oct 2023

Paper
Code

Automated Theorem Proving

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result