TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Graph Property Prediction	ogbg-molhiv	Graphormer (pre-trained on PCQM4M)	Test ROC-AUC	0.8051 ± 0.0053	# 15
Graph Property Prediction	ogbg-molhiv	Graphormer (pre-trained on PCQM4M)	Validation ROC-AUC	0.8310 ± 0.0089	# 18
Graph Property Prediction	ogbg-molhiv	Graphormer (pre-trained on PCQM4M)	Number of params	47183040	# 1
Graph Property Prediction	ogbg-molhiv	Graphormer (pre-trained on PCQM4M)	Ext. data	Yes	# 1
Graph Property Prediction	ogbg-molhiv	Graphormer + FPs	Test ROC-AUC	0.8225 ± 0.0001	# 7
Graph Property Prediction	ogbg-molhiv	Graphormer + FPs	Validation ROC-AUC	0.8396 ± 0.0001	# 10
Graph Property Prediction	ogbg-molhiv	Graphormer + FPs	Number of params	47085378	# 3
Graph Property Prediction	ogbg-molhiv	Graphormer + FPs	Ext. data	No	# 1
Graph Property Prediction	ogbg-molhiv	Graphormer	Test ROC-AUC	0.8051 ± 0.0053	# 15
Graph Property Prediction	ogbg-molhiv	Graphormer	Validation ROC-AUC	0.8310 ± 0.0089	# 18
Graph Property Prediction	ogbg-molhiv	Graphormer	Number of params	47183040	# 1
Graph Property Prediction	ogbg-molhiv	Graphormer	Ext. data	Yes	# 1
Graph Property Prediction	ogbg-molpcba	Graphormer	Test AP	0.3140 ± 0.0032	# 5
Graph Property Prediction	ogbg-molpcba	Graphormer	Validation AP	0.3227 ± 0.0024	# 4
Graph Property Prediction	ogbg-molpcba	Graphormer	Number of params	119529664	# 2
Graph Property Prediction	ogbg-molpcba	Graphormer (pre-trained on PCQM4M)	Test AP	0.3140 ± 0.0032	# 5
Graph Property Prediction	ogbg-molpcba	Graphormer (pre-trained on PCQM4M)	Validation AP	0.3227 ± 0.0024	# 4
Graph Property Prediction	ogbg-molpcba	Graphormer (pre-trained on PCQM4M)	Number of params	119529664	# 2
Graph Property Prediction	ogbg-molpcba	Graphormer (pre-trained on PCQM4M)	Ext. data	Yes	# 1
Graph Regression	PCQM4M-LSC	Graphormer	Validation MAE	0.1234	# 4
Graph Regression	PCQM4M-LSC	Graphormer	Test MAE	13.28	# 1
Graph Regression	PCQM4Mv2-LSC	Graphormer	Validation MAE	0.0864	# 11
Graph Regression	PCQM4Mv2-LSC	Graphormer	Test MAE	-	# 14
Graph Regression	ZINC-500k	Graphormer-SLIM	MAE	0.122	# 19

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-transformers-really-perform-bad-for-graph/graph-regression-on-pcqm4m-lsc)](https://paperswithcode.com/sota/graph-regression-on-pcqm4m-lsc?p=do-transformers-really-perform-bad-for-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-transformers-really-perform-bad-for-graph/graph-property-prediction-on-ogbg-molpcba)](https://paperswithcode.com/sota/graph-property-prediction-on-ogbg-molpcba?p=do-transformers-really-perform-bad-for-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-transformers-really-perform-bad-for-graph/graph-property-prediction-on-ogbg-molhiv)](https://paperswithcode.com/sota/graph-property-prediction-on-ogbg-molhiv?p=do-transformers-really-perform-bad-for-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-transformers-really-perform-bad-for-graph/graph-regression-on-pcqm4mv2-lsc)](https://paperswithcode.com/sota/graph-regression-on-pcqm4mv2-lsc?p=do-transformers-really-perform-bad-for-graph)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-transformers-really-perform-bad-for-graph/graph-regression-on-zinc-500k)](https://paperswithcode.com/sota/graph-regression-on-zinc-500k?p=do-transformers-really-perform-bad-for-graph)`

Do Transformers Really Perform Bad for Graph Representation?

9 Jun 2021 · Chengxuan Ying, Tianle Cai, Shengjie Luo, Shuxin Zheng, Guolin Ke, Di He, Yanming Shen, Tie-Yan Liu ·

The Transformer architecture has become a dominant choice in many domains, such as natural language processing and computer vision. Yet, it has not achieved competitive performance on popular leaderboards of graph-level prediction compared to mainstream GNN variants. Therefore, it remains a mystery how Transformers could perform well for graph representation learning. In this paper, we solve this mystery by presenting Graphormer, which is built upon the standard Transformer architecture, and could attain excellent results on a broad range of graph representation learning tasks, especially on the recent OGB Large-Scale Challenge. Our key insight to utilizing Transformer in the graph is the necessity of effectively encoding the structural information of a graph into the model. To this end, we propose several simple yet effective structural encoding methods to help Graphormer better model graph-structured data. Besides, we mathematically characterize the expressive power of Graphormer and exhibit that with our ways of encoding the structural information of graphs, many popular GNN variants could be covered as the special cases of Graphormer.

PDF Abstract

Code

Add Remove Mark official

Microsoft/Graphormer official

1,925

microsoft/Graphormer

1,924

dpstart/graphormer_new

ytchx1999/Graphormer

Tasks

Add Remove

Graph Classification

Graph Property Prediction

Graph Regression

Graph Representation Learning

Representation Learning

Datasets

OGB

ZINC Open Graph Benchmark

OGB-LSC

PCQM4Mv2-LSC

Results from the Paper

Edit

Ranked #1 on Graph Regression on PCQM4M-LSC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Graph Property Prediction	ogbg-molhiv	Graphormer (pre-trained on PCQM4M)	Test ROC-AUC	0.8051 ± 0.0053	# 15	Compare
			Validation ROC-AUC	0.8310 ± 0.0089	# 18	Compare
			Number of params	47183040	# 1	Compare
			Ext. data	Yes	# 1	Compare
Graph Property Prediction	ogbg-molhiv	Graphormer + FPs	Test ROC-AUC	0.8225 ± 0.0001	# 7	Compare
			Validation ROC-AUC	0.8396 ± 0.0001	# 10	Compare
			Number of params	47085378	# 3	Compare
			Ext. data	No	# 1	Compare
Graph Property Prediction	ogbg-molhiv	Graphormer	Test ROC-AUC	0.8051 ± 0.0053	# 15	Compare
			Validation ROC-AUC	0.8310 ± 0.0089	# 18	Compare
			Number of params	47183040	# 1	Compare
			Ext. data	Yes	# 1	Compare
Graph Property Prediction	ogbg-molpcba	Graphormer	Test AP	0.3140 ± 0.0032	# 5	Compare
			Validation AP	0.3227 ± 0.0024	# 4	Compare
			Number of params	119529664	# 2	Compare
Graph Property Prediction	ogbg-molpcba	Graphormer (pre-trained on PCQM4M)	Test AP	0.3140 ± 0.0032	# 5	Compare
			Validation AP	0.3227 ± 0.0024	# 4	Compare
			Number of params	119529664	# 2	Compare
			Ext. data	Yes	# 1	Compare
Graph Regression	PCQM4M-LSC	Graphormer	Validation MAE	0.1234	# 4	Compare
Graph Regression	PCQM4M-LSC	Graphormer	Test MAE	13.28	# 1	Compare
Graph Regression	PCQM4Mv2-LSC	Graphormer	Validation MAE	0.0864	# 11	Compare
Graph Regression	PCQM4Mv2-LSC	Graphormer	Test MAE	-	# 14	Compare
Graph Regression	ZINC-500k	Graphormer-SLIM	MAE	0.122	# 19	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Do Transformers Really Perform Bad for Graph Representation?

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove