Automated Theorem Proving

70 papers with code • 10 benchmarks • 8 datasets

The goal of Automated Theorem Proving is to automatically generate a proof, given a conjecture (the target theorem) and a knowledge base of known facts, all expressed in a formal language. Automated Theorem Proving is useful in a wide range of applications, including the verification and synthesis of software and hardware systems.

Source: Learning to Prove Theorems by Learning to Generate Theorems

Libraries

Use these libraries to find Automated Theorem Proving models and implementations
2 papers
6,574

Towards Large Language Models as Copilots for Theorem Proving in Lean

lean-dojo/leancopilot 18 Apr 2024

In this paper, we explore LLMs as copilots that assist humans in proving theorems.

775
18 Apr 2024

A Survey on Deep Learning for Theorem Proving

zhaoyu-li/dl4tp 15 Apr 2024

Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in mathematical language to rigorous derivations in formal systems.

54
15 Apr 2024

Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

jinpz/dtv 26 Mar 2024

Large language models (LLM), such as Google's Minerva and OpenAI's GPT families, are becoming increasingly capable of solving mathematical quantitative reasoning problems.

6
26 Mar 2024

LeanReasoner: Boosting Complex Logical Reasoning with Lean

some-random/theorem-proving-reasoning 20 Mar 2024

Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning.

10
20 Mar 2024

REFACTOR: Learning to Extract Theorems from Proofs

jinpz/refactor 26 Feb 2024

With newly extracted theorems, we show that the existing proofs in the MetaMath database can be refactored.

2
26 Feb 2024

On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

g-pichler/in_feasibility_of_ml_backdoor_detection 26 Feb 2024

We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition.

0
26 Feb 2024

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data

eleanor-h/mustard 14 Feb 2024

Recent large language models (LLMs) have witnessed significant advancement in various tasks, including mathematical reasoning and theorem proving.

14
14 Feb 2024

Llemma: An Open Language Model For Mathematics

eleutherai/gpt-neox 16 Oct 2023

We present Llemma, a large language model for mathematics.

6,574
16 Oct 2023

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

menik1126/TRIGO 16 Oct 2023

Automated theorem proving (ATP) has become an appealing domain for exploring the reasoning ability of the recent successful generative language models.

7
16 Oct 2023

An In-Context Learning Agent for Formal Theorem-Proving

trishullab/copra 6 Oct 2023

We evaluate our implementation of COPRA on the miniF2F benchmark for Lean and a set of Coq tasks from the CompCert project.

35
06 Oct 2023