no code implementations • CCL 2021 • Shen Tielin, Wang Daling, Feng Shi, Zhang Yifei
For general sentences whose target entity is multi-token word we further present the differences of last hid-den states of [MASK]-entity (MASK-lhs for short) in BERT for noise and non-noise sentences. We regard the dependency and MASK-lhs in BERT as two semantic features of sentences.