Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps.

PDF Abstract ACL 2021 PDF ACL 2021 Abstract

Datasets


Introduced in the Paper:

Geometry3K

Used in the Paper:

GeoS PGPS9K PGDP5K

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Mathematical Question Answering Geometry3K Inter-GPS (GT) Accuracy (%) 78.3 # 2
Mathematical Question Answering Geometry3K Random Accuracy (%) 25.0 # 6
Mathematical Question Answering Geometry3K Human Accuracy (%) 56.9 # 5
Mathematical Question Answering Geometry3K Inter-GPS Accuracy (%) 57.5 # 4
Mathematical Question Answering Geometry3K Human Expert Accuracy (%) 90.9 # 1
Mathematical Question Answering GeoS Inter-GPS Accuracy (%) 67 # 1
Scene Parsing PGDP5K Inter-GPS Total Accuracy 27.3 # 2
Mathematical Reasoning PGPS9K Inter-GPS Completion accuracy 59.8 # 2

Methods


No methods listed for this paper. Add relevant methods here