Towards Automatically Extracting UML Class Diagrams from Natural Language Specifications

26 Oct 2022  ·  Song Yang, Houari Sahraoui ·

In model-driven engineering (MDE), UML class diagrams serve as a way to plan and communicate between developers. However, it is complex and resource-consuming. We propose an automated approach for the extraction of UML class diagrams from natural language software specifications. To develop our approach, we create a dataset of UML class diagrams and their English specifications with the help of volunteers. Our approach is a pipeline of steps consisting of the segmentation of the input into sentences, the classification of the sentences, the generation of UML class diagram fragments from sentences, and the composition of these fragments into one UML class diagram. We develop a quantitative testing framework specific to UML class diagram extraction. Our approach yields low precision and recall but serves as a benchmark for future research.

PDF Abstract

Datasets


Introduced in the Paper:

UML Classes With Specs
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Model extraction UML Classes With Specs three-step-original Exact Match 0.171 # 1

Methods


No methods listed for this paper. Add relevant methods here