Embedding Entities and Relations for Learning and Inference in Knowledge Bases

20 Dec 2014  ·  Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, Li Deng ·

We consider learning representations of entities and relations in KBs using the neural-embedding approach. We show that most existing models, including NTN (Socher et al., 2013) and TransE (Bordes et al., 2013b), can be generalized under a unified learning framework, where entities are low-dimensional vectors learned from a neural network and relations are bilinear and/or linear mapping functions. Under this framework, we compare a variety of embedding models on the link prediction task. We show that a simple bilinear formulation achieves new state-of-the-art results for the task (achieving a top-10 accuracy of 73.2% vs. 54.7% by TransE on Freebase). Furthermore, we introduce a novel approach that utilizes the learned relation embeddings to mine logical rules such as "BornInCity(a,b) and CityInCountry(b,c) => Nationality(a,c)". We find that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication. More interestingly, we demonstrate that our embedding-based rule extraction approach successfully outperforms a state-of-the-art confidence-based rule mining approach in mining Horn rules that involve compositional reasoning.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Link Prediction FB15k-237 DistMult MRR 0.241 # 61
Hits@10 0.419 # 64
Link Property Prediction ogbl-biokg DistMult Test MRR 0.8043 ± 0.0003 # 14
Validation MRR 0.8055 ± 0.0003 # 14
Number of params 187648000 # 6
Ext. data No # 1
Link Property Prediction ogbl-wikikg2 DistMult (500dim) Validation MRR 0.3506 ± 0.0042 # 27
Test MRR 0.3729 ± 0.0045 # 27
Number of params 1250569500 # 26
Ext. data No # 1
Link Property Prediction ogbl-wikikg2 DistMult (100dim) Validation MRR 0.3150 ± 0.0088 # 28
Test MRR 0.3447 ± 0.0082 # 28
Number of params 250113900 # 12
Ext. data No # 1
Link Prediction UMLS DistMult Hits@10 0.846 # 10
MR 5.52 # 10
Link Prediction WN18 DistMult MRR 0.822 # 28
Hits@10 0.936 # 31
Hits@3 0.914 # 21
Hits@1 0.728 # 22
MR 902 # 20
Link Prediction WN18RR DisMult MRR 0.43 # 61
Hits@1 0.39 # 52

Methods