Named Entity Recognition in the Romanian Legal Domain

Recognition of named entities present in text is an important step towards information extraction and natural language understanding. This work presents a named entity recognition system for the Romanian legal domain. The system makes use of the gold annotated LegalNERo corpus. Furthermore, the system combines multiple distributional representations of words, including word embeddings trained on a large legal domain corpus. All the resources, including the corpus, model and word embeddings are open sourced. Finally, the best system is available for direct usage in the RELATE platform.

PDF Abstract

Datasets


Introduced in the Paper:

LegalNERo
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Named Entity Recognition (NER) LegalNERo Marcell Avg F1 85.34 # 1

Methods


No methods listed for this paper. Add relevant methods here