Weibo-COV: A Large-Scale COVID-19 Social Media Dataset from Weibo

EMNLP (NLP-COVID19) 2020  ·  Yong Hu, Heyan Huang, Anfan Chen, Xian-Ling Mao ·

With the rapid development of COVID-19, people are asked to maintain "social distance" and "stay at home". In this scenario, more and more social interactions move online, especially on social media like Twitter and Weibo. People post tweets to share information, express opinions and seek help during the pandemic, and these tweets on social media are valuable for studies against COVID-19, such as disease surveillance. Therefore, in this paper, we release Weibo-COV, a first large-scale COVID-19 social media dataset from Weibo, covering more than 30 million tweets from 1 November 2019 to 30 April 2020. Moreover, the field information of the dataset is very rich, including basic tweets information, interactive information, location information and retweet network. We hope this dataset can promote studies of COVID-19 from multiple perspectives and enable better and faster researches to combat the spread of this disease.

PDF Abstract EMNLP (NLP-COVID19) 2020 PDF EMNLP (NLP-COVID19) 2020 Abstract

Datasets


Introduced in the Paper:

Weibo-COV