TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Molecular Property Prediction	BACE	GROVER (base)	ROC-AUC	82.6	# 6
Molecular Property Prediction	BACE	GROVER (large)	ROC-AUC	81.0	# 7
Molecular Property Prediction	BBBP	GROVER (large)	ROC-AUC	69.5	# 10
Molecular Property Prediction	BBBP	GROVER (base)	ROC-AUC	70.0	# 8
Molecular Property Prediction	ClinTox	GROVER (large)	ROC-AUC	76.2	# 13
Molecular Property Prediction	ClinTox	GROVER (large)	Molecules (M)	11	# 4
Molecular Property Prediction	ClinTox	GROVER (base)	ROC-AUC	81.2	# 10
Molecular Property Prediction	ClinTox	GROVER (base)	Molecules (M)	11	# 4
Molecular Property Prediction	FreeSolv	GROVER (large)	RMSE	2.272	# 6
Molecular Property Prediction	FreeSolv	GROVER (base)	RMSE	2.176	# 5
Molecular Property Prediction	Lipophilicity	GROVER (base)	RMSE	0.817	# 8
Molecular Property Prediction	Lipophilicity	GROVER (large)	RMSE	0.823	# 9
Molecular Property Prediction	QM7	GROVER (large)	MAE	92.0	# 4
Molecular Property Prediction	QM7	GROVER (base)	MAE	94.5	# 6
Molecular Property Prediction	QM8	GROVER (large)	MAE	0.0224	# 7
Molecular Property Prediction	QM8	GROVER (base)	MAE	0.0218	# 6
Molecular Property Prediction	QM9	GROVER (large)	MAE	0.00986	# 7
Molecular Property Prediction	QM9	GROVER (base)	MAE	0.00984	# 6
Molecular Property Prediction	SIDER	GROVER (base)	ROC-AUC	64.8	# 9
Molecular Property Prediction	SIDER	GROVER (large)	ROC-AUC	65.4	# 8
Molecular Property Prediction	Tox21	GROVER (large)	ROC-AUC	73.5	# 12
Molecular Property Prediction	Tox21	GROVER (base)	ROC-AUC	74.3	# 10
Molecular Property Prediction	ToxCast	GROVER (base)	ROC-AUC	65.4	# 5
Molecular Property Prediction	ToxCast	GROVER (large)	ROC-AUC	65.3	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-qm7)](https://paperswithcode.com/sota/molecular-property-prediction-on-qm7?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-freesolv)](https://paperswithcode.com/sota/molecular-property-prediction-on-freesolv?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-toxcast-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-toxcast-1?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-bace-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-bace-1?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-qm8)](https://paperswithcode.com/sota/molecular-property-prediction-on-qm8?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-qm9)](https://paperswithcode.com/sota/molecular-property-prediction-on-qm9?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-bbbp-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-bbbp-1?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on)](https://paperswithcode.com/sota/molecular-property-prediction-on?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-sider-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-sider-1?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-clintox-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-clintox-1?p=grover-self-supervised-message-passing)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/grover-self-supervised-message-passing/molecular-property-prediction-on-tox21-1)](https://paperswithcode.com/sota/molecular-property-prediction-on-tox21-1?p=grover-self-supervised-message-passing)`

Self-Supervised Graph Transformer on Large-Scale Molecular Data

NeurIPS 2020 · Yu Rong, Yatao Bian, Tingyang Xu, Weiyang Xie, Ying WEI, Wenbing Huang, Junzhou Huang ·

How to obtain informative representations of molecules is a crucial prerequisite in AI-driven drug design and discovery. Recent researches abstract molecules as graphs and employ Graph Neural Networks (GNNs) for molecular representation learning. Nevertheless, two issues impede the usage of GNNs in real scenarios: (1) insufficient labeled molecules for supervised training; (2) poor generalization capability to new-synthesized molecules. To address them both, we propose a novel framework, GROVER, which stands for Graph Representation frOm self-superVised mEssage passing tRansformer. With carefully designed self-supervised tasks in node-, edge- and graph-level, GROVER can learn rich structural and semantic information of molecules from enormous unlabelled molecular data. Rather, to encode such complex information, GROVER integrates Message Passing Networks into the Transformer-style architecture to deliver a class of more expressive encoders of molecules. The flexibility of GROVER allows it to be trained efficiently on large-scale molecular dataset without requiring any supervision, thus being immunized to the two issues mentioned above. We pre-train GROVER with 100 million parameters on 10 million unlabelled molecules -- the biggest GNN and the largest training dataset in molecular representation learning. We then leverage the pre-trained GROVER for molecular property prediction followed by task-specific fine-tuning, where we observe a huge improvement (more than 6% on average) from current state-of-the-art methods on 11 challenging benchmarks. The insights we gained are that well-designed self-supervision losses and largely-expressive pre-trained models enjoy the significant potential on performance boosting.

PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract

Code

Add Remove Mark official

tencent-ailab/grover official

310

deepchem/deepchem

5,106

dengjianyuan/respite_mpp

Tasks

Add Remove

Molecular Property Prediction

molecular representation

Property Prediction

Representation Learning

Datasets

MoleculeNet

QM9

Tox21 QM7 BBBP (Blood-Brain Barrier Penetration) BACE (β-secretase enzyme)

SIDER ClinTox ToxCast (Toxicity Forecaster) FreeSolv (Free Solvation) QM8

Results from the Paper

Add Remove

Ranked #4 on Molecular Property Prediction on QM7

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Uses Extra Training Data	Benchmark
Molecular Property Prediction	Lipophilicity	GROVER (large)	RMSE	0.823	# 9		Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Molecular Property Prediction	BACE	GROVER (base)	ROC-AUC	82.6	# 6	See all
Molecular Property Prediction	BACE	GROVER (large)	ROC-AUC	81.0	# 7	See all
Molecular Property Prediction	BBBP	GROVER (large)	ROC-AUC	69.5	# 10	See all
Molecular Property Prediction	BBBP	GROVER (base)	ROC-AUC	70.0	# 8	See all
Molecular Property Prediction	ClinTox	GROVER (large)	ROC-AUC	76.2	# 13	See all
Molecular Property Prediction	ClinTox	GROVER (large)	Molecules (M)	11	# 4	See all
Molecular Property Prediction	ClinTox	GROVER (base)	ROC-AUC	81.2	# 10	See all
Molecular Property Prediction	ClinTox	GROVER (base)	Molecules (M)	11	# 4	See all
Molecular Property Prediction	FreeSolv	GROVER (large)	RMSE	2.272	# 6	See all
Molecular Property Prediction	FreeSolv	GROVER (base)	RMSE	2.176	# 5	See all
Molecular Property Prediction	Lipophilicity	GROVER (base)	RMSE	0.817	# 8	See all
Molecular Property Prediction	QM7	GROVER (large)	MAE	92.0	# 4	See all
Molecular Property Prediction	QM7	GROVER (base)	MAE	94.5	# 6	See all
Molecular Property Prediction	QM8	GROVER (large)	MAE	0.0224	# 7	See all
Molecular Property Prediction	QM8	GROVER (base)	MAE	0.0218	# 6	See all
Molecular Property Prediction	QM9	GROVER (large)	MAE	0.00986	# 7	See all
Molecular Property Prediction	QM9	GROVER (base)	MAE	0.00984	# 6	See all
Molecular Property Prediction	SIDER	GROVER (base)	ROC-AUC	64.8	# 9	See all
Molecular Property Prediction	SIDER	GROVER (large)	ROC-AUC	65.4	# 8	See all
Molecular Property Prediction	Tox21	GROVER (large)	ROC-AUC	73.5	# 12	See all
Molecular Property Prediction	Tox21	GROVER (base)	ROC-AUC	74.3	# 10	See all
Molecular Property Prediction	ToxCast	GROVER (base)	ROC-AUC	65.4	# 5	See all
Molecular Property Prediction	ToxCast	GROVER (large)	ROC-AUC	65.3	# 6	See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Self-Supervised Graph Transformer on Large-Scale Molecular Data

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove