Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning

Findings (NAACL) 2022 · Siyu Ren, Kenny Zhu ·

Pretrained masked language models (PLMs) were shown to be inheriting a considerable amount of relational knowledge from the source corpora. In this paper, we present an in-depth and comprehensive study concerning specializing PLMs into relational models from the perspective of network pruning. We show that it is possible to find subnetworks capable of representing grounded commonsense relations at non-trivial sparsity while being more generalizable than original PLMs in scenarios requiring knowledge of single or multiple commonsense relations.

PDF Abstract