Automated Theorem Proving

83 papers with code • 9 benchmarks • 8 datasets

The goal of Automated Theorem Proving is to automatically generate a proof, given a conjecture (the target theorem) and a knowledge base of known facts, all expressed in a formal language. Automated Theorem Proving is useful in a wide range of applications, including the verification and synthesis of software and hardware systems.

Source: Learning to Prove Theorems by Learning to Generate Theorems

Libraries

Use these libraries to find Automated Theorem Proving models and implementations
2 papers
6,882
2 papers
49

SubgoalXL: Subgoal-based Expert Learning for Theorem Proving

zhaoxlpku/subgoalxl 20 Aug 2024

This paper introduces SubgoalXL, a novel approach that synergizes subgoal-based proofs with expert learning to enhance LLMs' capabilities in formal theorem proving within the Isabelle environment.

18
20 Aug 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

deepseek-ai/deepseek-prover-v1.5 15 Aug 2024

We introduce DeepSeek-Prover-V1. 5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes.

207
15 Aug 2024

miniCTX: Neural Theorem Proving with (Long-)Contexts

cmu-l3/ntp-toolkit 5 Aug 2024

We introduce miniCTX, which tests a model's ability to prove formal mathematical theorems that depend on new context that is not seen during training.

11
05 Aug 2024

LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover

internlm/internlm-math 24 Jul 2024

To address this issue, we propose LEAN-GitHub, a dataset consisting of large-scale formal data extracted from almost all Lean 4 repositories on GitHub.

417
24 Jul 2024

PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

trishullab/putnambench 15 Jul 2024

We present PutnamBench, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems.

49
15 Jul 2024

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts

RickySkywalker/TheoremLlama 3 Jul 2024

However, due to the scarcity of aligned NL and Formal Language (FL) theorem-proving data most modern LLMs exhibit suboptimal performance. This scarcity results in a paucity of methodologies for training LLMs and techniques to fully utilize their capabilities in composing formal proofs.

27
03 Jul 2024

Learning Formal Mathematics From Intrinsic Motivation

gpoesia/peano 30 Jun 2024

We propose novel methods for hindsight relabeling on proof search trees to significantly improve the agent's sample efficiency in both tasks.

49
30 Jun 2024

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

fveler/fvel 20 Jun 2024

In this paper, we propose FVEL, an interactive Formal Verification Environment with LLMs.

0
20 Jun 2024

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

internlm/internlm-math 6 Jun 2024

Our results indicate that the synthetic data pipeline can provide useful training data and improve the performance of LLMs in translating and understanding complex mathematical problems and proofs.

417
06 Jun 2024

Proving Theorems Recursively

wiio12/poetry 23 May 2024

This approach allows the theorem to be tackled incrementally by outlining the overall theorem at the first level and then solving the intermediate conjectures at deeper levels.

11
23 May 2024