TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Node Classification	AMZ Computers	CPF-ind-GAT	Accuracy	85.5%	# 1
Node Classification	AMZ Photo	CPF-ind-GAT	Accuracy	94.10%	# 6
Node Classification	CiteSeer with Public Split: fixed 20 nodes per class	CPF-tra-APPNP	Accuracy	74.6%	# 5
Node Classification	Cora (0.5%)	CPF-ind_APPNP	Accuracy	77.3%	# 1
Node Classification	Cora (1%)	CPF-ind-APPNP	Accuracy	80.24%	# 1
Node Classification	Cora (3%)	CPF-tra-GCNII	Accuracy	84.18%	# 1
Node Classification	Cora: fixed 10 node per class	CPF-tra-GCNII	Accuracy	84.1%	# 1
Node Classification	Cora: fixed 5 node per class	CPF-tra-APPNP	Accuracy	80.26%	# 1
Node Classification	Cora with Public Split: fixed 20 nodes per class	CPF-ind-APPNP	Accuracy	85.3%	# 4
Node Classification	PubMed with Public Split: fixed 20 nodes per class	CPF-tra-GCNII	Accuracy	83.20%	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-amz-computers)](https://paperswithcode.com/sota/node-classification-on-amz-computers?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-05)](https://paperswithcode.com/sota/node-classification-on-cora-05?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-1)](https://paperswithcode.com/sota/node-classification-on-cora-1?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-3)](https://paperswithcode.com/sota/node-classification-on-cora-3?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-fixed-10-node-per)](https://paperswithcode.com/sota/node-classification-on-cora-fixed-10-node-per?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-fixed-5-node-per)](https://paperswithcode.com/sota/node-classification-on-cora-fixed-5-node-per?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-pubmed-with-public)](https://paperswithcode.com/sota/node-classification-on-pubmed-with-public?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-cora-with-public-split)](https://paperswithcode.com/sota/node-classification-on-cora-with-public-split?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-citeseer-with-public)](https://paperswithcode.com/sota/node-classification-on-citeseer-with-public?p=extract-the-knowledge-of-graph-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/extract-the-knowledge-of-graph-neural/node-classification-on-amz-photo)](https://paperswithcode.com/sota/node-classification-on-amz-photo?p=extract-the-knowledge-of-graph-neural)`

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

4 Mar 2021 · Cheng Yang, Jiawei Liu, Chuan Shi ·

Semi-supervised learning on graphs is an important problem in the machine learning area. In recent years, state-of-the-art classification methods based on graph neural networks (GNNs) have shown their superiority over traditional ones such as label propagation. However, the sophisticated architectures of these neural models will lead to a complex prediction mechanism, which could not make full use of valuable prior knowledge lying in the data, e.g., structurally correlated nodes tend to have the same class. In this paper, we propose a framework based on knowledge distillation to address the above issues. Our framework extracts the knowledge of an arbitrary learned GNN model (teacher model), and injects it into a well-designed student model. The student model is built with two simple prediction mechanisms, i.e., label propagation and feature transformation, which naturally preserves structure-based and feature-based prior knowledge, respectively. In specific, we design the student model as a trainable combination of parameterized label propagation and feature transformation modules. As a result, the learned student can benefit from both prior knowledge and the knowledge in GNN teachers for more effective predictions. Moreover, the learned student model has a more interpretable prediction process than GNNs. We conduct experiments on five public benchmark datasets and employ seven GNN models including GCN, GAT, APPNP, SAGE, SGC, GCNII and GLP as the teacher models. Experimental results show that the learned student model can consistently outperform its corresponding teacher model by 1.4% - 4.7% on average. Code and data are available at https://github.com/BUPT-GAMMA/CPF

PDF Abstract

Code

Add Remove Mark official

BUPT-GAMMA/CPF official

Tasks

Add Remove

Knowledge Distillation

Node Classification

Datasets

Pubmed

Cora Citeseer AMZ Computers

Results from the Paper

Edit

Ranked #1 on Node Classification on Cora (0.5%)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Node Classification	AMZ Computers	CPF-ind-GAT	Accuracy	85.5%	# 1	Compare
Node Classification	AMZ Photo	CPF-ind-GAT	Accuracy	94.10%	# 6	Compare
Node Classification	CiteSeer with Public Split: fixed 20 nodes per class	CPF-tra-APPNP	Accuracy	74.6%	# 5	Compare
Node Classification	Cora (0.5%)	CPF-ind_APPNP	Accuracy	77.3%	# 1	Compare
Node Classification	Cora (1%)	CPF-ind-APPNP	Accuracy	80.24%	# 1	Compare
Node Classification	Cora (3%)	CPF-tra-GCNII	Accuracy	84.18%	# 1	Compare
Node Classification	Cora: fixed 10 node per class	CPF-tra-GCNII	Accuracy	84.1%	# 1	Compare
Node Classification	Cora: fixed 5 node per class	CPF-tra-APPNP	Accuracy	80.26%	# 1	Compare
Node Classification	Cora with Public Split: fixed 20 nodes per class	CPF-ind-APPNP	Accuracy	85.3%	# 4	Compare
Node Classification	PubMed with Public Split: fixed 20 nodes per class	CPF-tra-GCNII	Accuracy	83.20%	# 2	Compare

Methods

Add Remove

APPNP • GAT • GCN • GCNII • Knowledge Distillation • Residual Connection

Edit Social Preview

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove