TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Graph Classification	CIFAR10 100k	EGT	Accuracy (%)	68.702	# 9
Node Classification	CLUSTER	EGT	Accuracy	79.232	# 2
Graph Classification	MNIST	EGT	Accuracy	98.173	# 3
Graph Property Prediction	ogbg-molhiv	EGT	Test ROC-AUC	0.806 ± 0.0065	# 12
Graph Property Prediction	ogbg-molpcba	EGT	Test AP	0.2961 ± 0.0024	# 14
Node Classification	PATTERN	EGT	Accuracy	86.821	# 2
Node Classification	PATTERN 100k	EGT	Accuracy (%)	86.816	# 1
Graph Regression	PCQM4M-LSC	EGT	Validation MAE	0.1224	# 3
Graph Regression	PCQM4Mv2-LSC	EGT	Validation MAE	0.0857	# 9
Graph Regression	PCQM4Mv2-LSC	EGT	Test MAE	0.0862	# 7
Graph Regression	PCQM4Mv2-LSC	EGT + Triangular Attention	Validation MAE	0.0671	# 1
Graph Regression	PCQM4Mv2-LSC	EGT + Triangular Attention	Test MAE	0.0683	# 1
Link Prediction	TSP/HCP Benchmark set	EGT	F1	0.853	# 3
Graph Regression	ZINC 100k	EGT	MAE	0.143	# 4
Graph Regression	ZINC-500k	EGT	MAE	0.108	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/node-classification-on-pattern-100k)](https://paperswithcode.com/sota/node-classification-on-pattern-100k?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-regression-on-pcqm4mv2-lsc)](https://paperswithcode.com/sota/graph-regression-on-pcqm4mv2-lsc?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/node-classification-on-cluster)](https://paperswithcode.com/sota/node-classification-on-cluster?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/node-classification-on-pattern)](https://paperswithcode.com/sota/node-classification-on-pattern?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-classification-on-mnist)](https://paperswithcode.com/sota/graph-classification-on-mnist?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-regression-on-pcqm4m-lsc)](https://paperswithcode.com/sota/graph-regression-on-pcqm4m-lsc?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/link-prediction-on-tsp-hcp-benchmark-set)](https://paperswithcode.com/sota/link-prediction-on-tsp-hcp-benchmark-set?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-regression-on-zinc-100k)](https://paperswithcode.com/sota/graph-regression-on-zinc-100k?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-classification-on-cifar10-100k)](https://paperswithcode.com/sota/graph-classification-on-cifar10-100k?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-property-prediction-on-ogbg-molhiv)](https://paperswithcode.com/sota/graph-property-prediction-on-ogbg-molhiv?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-property-prediction-on-ogbg-molpcba)](https://paperswithcode.com/sota/graph-property-prediction-on-ogbg-molpcba?p=edge-augmented-graph-transformers-global-self)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/edge-augmented-graph-transformers-global-self/graph-regression-on-zinc-500k)](https://paperswithcode.com/sota/graph-regression-on-zinc-500k?p=edge-augmented-graph-transformers-global-self)`

Global Self-Attention as a Replacement for Graph Convolution

7 Aug 2021 · Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian ·

We propose an extension to the transformer neural network architecture for general-purpose graph learning by adding a dedicated pathway for pairwise structural information, called edge channels. The resultant framework - which we call Edge-augmented Graph Transformer (EGT) - can directly accept, process and output structural information of arbitrary form, which is important for effective learning on graph-structured data. Our model exclusively uses global self-attention as an aggregation mechanism rather than static localized convolutional aggregation. This allows for unconstrained long-range dynamic interactions between nodes. Moreover, the edge channels allow the structural information to evolve from layer to layer, and prediction tasks on edges/links can be performed directly from the output embeddings of these channels. We verify the performance of EGT in a wide range of graph-learning experiments on benchmark datasets, in which it outperforms Convolutional/Message-Passing Graph Neural Networks. EGT sets a new state-of-the-art for the quantum-chemical regression task on the OGB-LSC PCQM4Mv2 dataset containing 3.8 million molecular graphs. Our findings indicate that global self-attention based aggregation can serve as a flexible, adaptive and effective replacement of graph convolution for general-purpose graph learning. Therefore, convolutional local neighborhood aggregation is not an essential inductive bias.

PDF Abstract

Code

Add Remove Mark official

shamim-hussain/egt_pytorch official

shamim-hussain/egt official

shamim-hussain/egt_triangular official

Tasks

Add Remove

Edge Classification

Graph Classification

Graph Learning

Graph Property Prediction

Graph Regression

Inductive Bias

Link Prediction

Node Classification

Transfer Learning

Datasets

CIFAR-10

MNIST

OGB

ZINC CLUSTER PATTERN

OGB-LSC

PCQM4Mv2-LSC TSP/HCP Benchmark set

Results from the Paper

Edit

Ranked #1 on Graph Regression on PCQM4Mv2-LSC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Graph Classification	CIFAR10 100k	EGT	Accuracy (%)	68.702	# 9	Compare
Node Classification	CLUSTER	EGT	Accuracy	79.232	# 2	Compare
Graph Classification	MNIST	EGT	Accuracy	98.173	# 3	Compare
Graph Property Prediction	ogbg-molhiv	EGT	Test ROC-AUC	0.806 ± 0.0065	# 12	Compare
Graph Property Prediction	ogbg-molpcba	EGT	Test AP	0.2961 ± 0.0024	# 14	Compare
Node Classification	PATTERN	EGT	Accuracy	86.821	# 2	Compare
Node Classification	PATTERN 100k	EGT	Accuracy (%)	86.816	# 1	Compare
Graph Regression	PCQM4M-LSC	EGT	Validation MAE	0.1224	# 3	Compare
Graph Regression	PCQM4Mv2-LSC	EGT	Validation MAE	0.0857	# 9	Compare
Graph Regression	PCQM4Mv2-LSC	EGT	Test MAE	0.0862	# 7	Compare
Graph Regression	PCQM4Mv2-LSC	EGT + Triangular Attention	Validation MAE	0.0671	# 1	Compare
Graph Regression	PCQM4Mv2-LSC	EGT + Triangular Attention	Test MAE	0.0683	# 1	Compare
Link Prediction	TSP/HCP Benchmark set	EGT	F1	0.853	# 3	Compare
Graph Regression	ZINC 100k	EGT	MAE	0.143	# 4	Compare
Graph Regression	ZINC-500k	EGT	MAE	0.108	# 18	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Convolution • Dense Connections • Dropout • EGT • Graph Convolutional Networks • Graph Transformer • Label Smoothing • LapEigen • Laplacian PE • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Global Self-Attention as a Replacement for Graph Convolution

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove