HEProto: A Hierarchical Enhancing ProtoNet based on Multi-Task Learning for Few-shot Named Entity Recognition

Few-shot Named Entity Recognition (NER) task, which aims to identify and classify entities from different domains with limited training samples, has long been treated as a basic step for knowledge graph (KG) construction. Great efforts have been made on this task with competitive performance, however, they usually treat the two subtasks, namely span detection and type classification, as mutually independent, and the integrity and correlation between subtasks have been largely ignored. Moreover, prior arts may fail to absorb the coarse-grained features of entities, resulting in a semantic-insufficient representation of entity types. To that end, in this paper, we propose a Hierarchical Enhancing ProtoNet (HEProto) based on multi-task learning, which is utilized to jointly learn these two subtasks and model their correlation. Specifically, we adopt contrastive learning to enhance the span boundary information and the type semantic representations in these two subtasks. Then, the hierarchical prototypical network is designed to leverage the coarse-grained information of entities in the type classification stage, which could help the model to better learn the fine-grained semantic representations. Along this line, we construct a similarity margin loss to reduce the similarity between fine-grained entities and other irrelevant coarse-grained prototypes. Finally, extensive experiments on the Few-NERD dataset prove that our solution outperforms competitive baseline methods. The source code of HEProto is available at https://github.com/fanshu6hao/HEProto.

PDF

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Few-shot NER Few-NERD (INTER) HEProto 5 way 1~2 shot 66.40±0.18 # 3
5 way 5~10 shot 72.53±0.11 # 3
10 way 1~2 shot 60.91±0.20 # 4
10 way 5~10 shot 68.92±0.20 # 4
Few-shot NER Few-NERD (INTRA) HEProto 5 way 1~2 shot 53.03±0.30 # 4
5 way 5~10 shot 65.70±0.21 # 4
10 way 1~2 shot 46.45±0.21 # 4
10 way 5~10 shot 58.98±0.22 # 4

Methods