Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding

ACL 2021  ·  Hidetaka Kamigaito, Katsuhiko Hayashi ·

In knowledge graph embedding, the theoretical relationship between the softmax cross-entropy and negative sampling loss functions has not been investigated. This makes it difficult to fairly compare the results of the two different loss functions. We attempted to solve this problem by using the Bregman divergence to provide a unified interpretation of the softmax cross-entropy and negative sampling loss functions. Under this interpretation, we can derive theoretical findings for fair comparison. Experimental results on the FB15k-237 and WN18RR datasets show that the theoretical findings are valid in practical settings.

PDF Abstract ACL 2021 PDF ACL 2021 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Link Prediction FB15k-237 RESCAL (SCE w/ LS pretrained) MRR 0.364 # 19
Hits@10 0.55 # 11
Hits@3 0.402 # 11
Hits@1 0.269 # 14
Link Prediction FB15k-237 RESCAL (SCE w/ LS) MRR 0.363 # 20
Hits@10 0.548 # 17
Hits@3 0.4 # 13
Hits@1 0.269 # 14
Link Prediction WN18RR ComplEx (SCE w/ LS) MRR 0.477 # 41
Hits@10 0.546 # 51
Hits@3 0.491 # 33
Hits@1 0.441 # 31
Link Prediction WN18RR ComplEx (SCE w/ LS pretrained) MRR 0.481 # 34
Hits@10 0.553 # 46
Hits@3 0.496 # 28
Hits@1 0.444 # 23

Methods