MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

3 Jan 2023  ·  Shuhao Shi, Kai Qiao, Jian Chen, Shuai Yang, Jie Yang, Baojie Song, Linyuan Wang, Bin Yan ·

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

PDF Abstract

Datasets


Introduced in the Paper:

MGTAB

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Stance Detection MGTAB RGT Acc 87.8 # 1
F1 86.9 # 1
Stance Detection MGTAB Simple-HGN Acc 85.3 # 2
F1 84.4 # 2
Twitter Bot Detection MGTAB GAT Acc 87 # 3
F1 82.3 # 3
Stance Detection MGTAB GAT Acc 82.2 # 4
F1 81 # 4
Stance Detection MGTAB GCN Acc 82.4 # 3
F1 81.5 # 3
Twitter Bot Detection MGTAB BotRGCN Acc 89.6 # 2
F1 87.2 # 2
Twitter Bot Detection MGTAB GCN Acc 85.8 # 4
F1 78.3 # 4
Twitter Bot Detection MGTAB RGT Acc 92.1 # 1
F1 90.4 # 1

Methods


GAT GCN RGCN