O-GNN: Incorporating Ring Priors into Molecular Modeling

Cyclic compounds that contain at least one ring play an important role in drug design. Despite the recent success of molecular modeling with graph neural networks (GNNs), few models explicitly take rings in compounds into consideration, consequently limiting the expressiveness of the models. In this work, we design a new variant of GNN, ring-enhanced GNN (O-GNN), that explicitly models rings in addition to atoms and bonds in compounds. In O-GNN, each ring is represented by a latent vector, which contributes to and is iteratively updated by atom and bond representations. Theoretical analysis shows that O-GNN is able to distinguish two isomorphic subgraphs lying on different rings using only one layer while conventional graph convolutional neural networks require multiple layers to distinguish, demonstrating that O-GNN is more expressive. Through experiments, O-GNN shows good performance on public datasets. In particular, it achieves state-of-the-art validation result on the PCQM4Mv1 benchmark (outperforming the previous KDDCup champion solution) and the drug-drug interaction prediction task on DrugBank. Furthermore, O-GNN outperforms strong baselines (without modeling rings) on the molecular property prediction and retrosynthesis prediction tasks.

PDF Abstract

Results from the Paper


 Ranked #1 on Graph Regression on PCQM4M-LSC (Validation MAE metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Graph Regression PCQM4M-LSC O-GNN Validation MAE 0.1148 # 1
Single-step retrosynthesis USPTO-50k O-GNN (reaction class unknown) Top-1 accuracy 54.1 # 17
Top-3 accuracy 77.7 # 10
Top-5 accuracy 86.0 # 8
Top-10 accuracy 92.5 # 6
Top-50 accuracy 98.2 # 2
Single-step retrosynthesis USPTO-50k O-GNN (reaction class as prior) Top-1 accuracy 65.7 # 5
Top-3 accuracy 87.7 # 1
Top-5 accuracy 93.4 # 1
Top-10 accuracy 96.9 # 1
Top-50 accuracy 98.3 # 1

Methods


No methods listed for this paper. Add relevant methods here