Understanding over-squashing and bottlenecks on graphs via curvature

Most graph neural networks (GNNs) use the message passing paradigm, in which node features are propagated on the input graph. Recent works pointed to the distortion of information flowing from distant nodes as a factor limiting the efficiency of message passing for tasks relying on long-distance interactions. This phenomenon, referred to as 'over-squashing', has been heuristically attributed to graph bottlenecks where the number of $k$-hop neighbors grows rapidly with $k$. We provide a precise description of the over-squashing phenomenon in GNNs and analyze how it arises from bottlenecks in the graph. For this purpose, we introduce a new edge-based combinatorial curvature and prove that negatively curved edges are responsible for the over-squashing issue. We also propose and experimentally test a curvature-based graph rewiring method to alleviate the over-squashing.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Node Classification Actor SDRF Accuracy 28.42 ± 0.75 # 50
Node Classification Chameleon SDRF Accuracy 42.73±0.15 # 54
Node Classification Citeseer SDRF Accuracy 72.58±0.20 # 43
Node Classification Cora SDRF Accuracy 82.76±0.23% # 49
Node Classification Cornell SDRF Accuracy 54.60±0.39 # 52
Node Classification Pubmed SDRF Accuracy 79.10±0.11 # 51
Node Classification Squirrel SDRF Accuracy 37.05±0.17 # 45
Node Classification Texas SDRF Accuracy 64.46±0.38 # 50
Node Classification Wisconsin SDRF Accuracy 55.51±0.27 # 54

Methods


No methods listed for this paper. Add relevant methods here