TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Node Property Prediction	ogbn-arxiv	GIANT+XRT+GATv2	Test Accuracy	0.7415 ± 0.0005	# 25
Node Property Prediction	ogbn-arxiv	GIANT+XRT+GATv2	Validation Accuracy	0.7527 ± 0.0008	# 21
Node Property Prediction	ogbn-arxiv	GIANT+XRT+GATv2	Number of params	207520	# 56
Node Property Prediction	ogbn-arxiv	GIANT+XRT+GATv2	Ext. data	Yes	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/how-attentive-are-graph-attention-networks/node-property-prediction-on-ogbn-arxiv)](https://paperswithcode.com/sota/node-property-prediction-on-ogbn-arxiv?p=how-attentive-are-graph-attention-networks)`

How Attentive are Graph Attention Networks?

ICLR 2022 · Shaked Brody, Uri Alon, Eran Yahav ·

Graph Attention Networks (GATs) are one of the most popular GNN architectures and are considered as the state-of-the-art architecture for representation learning with graphs. In GAT, every node attends to its neighbors given its own representation as the query. However, in this paper we show that GAT computes a very limited kind of attention: the ranking of the attention scores is unconditioned on the query node. We formally define this restricted kind of attention as static attention and distinguish it from a strictly more expressive dynamic attention. Because GATs use a static attention mechanism, there are simple graph problems that GAT cannot express: in a controlled problem, we show that static attention hinders GAT from even fitting the training data. To remove this limitation, we introduce a simple fix by modifying the order of operations and propose GATv2: a dynamic graph attention variant that is strictly more expressive than GAT. We perform an extensive evaluation and show that GATv2 outperforms GAT across 11 OGB and other benchmarks while we match their parametric costs. Our code is available at https://github.com/tech-srl/how_attentive_are_gats . GATv2 is available as part of the PyTorch Geometric library, the Deep Graph Library, and the TensorFlow GNN library.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

tech-srl/how_attentive_are_gats official

282

labmlai/annotated_deep_learning_pap…

↳ View annotated code at

labml.ai

48,096

rusty1s/pytorch_geometric

20,120

dmlc/dgl

13,001

tensorflow/gnn

633

See all 8 implementations

Tasks

Add Remove

Graph Attention

Graph Property Prediction

Link Prediction

Node Property Prediction

Representation Learning

Datasets

OGB

QM9

Results from the Paper

Add Remove

Ranked #25 on Node Property Prediction on ogbn-arxiv

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Node Property Prediction	ogbn-arxiv	GIANT+XRT+GATv2	Test Accuracy	0.7415 ± 0.0005	# 25	Compare
			Validation Accuracy	0.7527 ± 0.0008	# 21	Compare
			Number of params	207520	# 56	Compare
			Ext. data	Yes	# 1	Compare

Methods

Add Remove

GATv2

Edit Social Preview

How Attentive are Graph Attention Networks?

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove