Real-Time Detection of COVID-19 Events From Twitter: A Spatial-Temporally Bursty-Aware Method

被引:5
作者
Fei, Gaolei [1 ]
Cheng, Yong [1 ]
Ma, Wanlun [2 ]
Chen, Chao [3 ]
Wen, Sheng [2 ]
Hu, Guangmin [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[2] Swinburne Univ Technol, Fac Sci Engn & Technol, Hawthorn, Vic 3122, Australia
[3] James Cook Univ, Coll Sci & Engn, Townsville, Qld 4811, Australia
基金
中国国家自然科学基金;
关键词
COVID-19; Social networking (online); Event detection; Blogs; Real-time systems; Feature extraction; Spatiotemporal phenomena; event detection; Twitter;
D O I
10.1109/TCSS.2022.3169742
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the last two years, the outbreak of COVID-19 has significantly affected human life, society, and the economy worldwide. To prevent people from contracting COVID-19 and mitigate its spread, it is crucial to timely distribute complete, accurate, and up-to-date information about the pandemic to the public. In this article, we propose a spatial-temporally bursty-aware method called STBA for real-time detection of COVID-19 events from Twitter. STBA has three consecutive stages. In the first stage, STBA identifies a set of keywords that represent COVID-19 events according to the spatiotemporally bursty characteristics of words using Ripley's K function. STBA will also filter out tweets that do not contain the keywords to reduce the interference of noise tweets on event detection. In the second stage, STBA uses online density-based spatial clustering of applications with noise clustering to aggregate tweets that describe the same event as much as possible, which provides more information for event identification. In the third stage, STBA further utilizes the temporal bursty characteristic of event location information in the clusters to identify real-world COVID-19 events. Each stage of STBA can be regarded as a noise filter. It gradually filters out COVID-19-related events from noisy tweet streams. To evaluate the performance of STBA, we collected over 116 million Twitter posts from 36 consecutive days (from March 22, 2020 to April 26, 2020) and labeled 501 real events in this dataset. We compared STBA with three state-of-the-art methods, EvenTweet, event detection via microblog cliques (EDMC), and GeoBurst+ in the evaluation. The experimental results suggest that STBA outperforms GeoBurst+ by 13.8%, 12.7%, and 13.3% in terms of precision, recall, and F ₁ score. STBA achieved even more improvements compared with EvenTweet and EDMC.
引用
收藏
页码:656 / 672
页数:17
相关论文
共 52 条
  • [1] EvenTweet: Online Localized Event Detection from Twitter
    Abdelhaq, Flamed
    Sengstock, Christian
    Gertz, Michael
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1326 - 1329
  • [2] Arabic Event Detection in Social Media
    Alsaedi, Nasser
    Burnap, Pete
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 384 - 401
  • [3] A SURVEY OF TECHNIQUES FOR EVENT DETECTION IN TWITTER
    Atefeh, Farzindar
    Khreich, Wael
    [J]. COMPUTATIONAL INTELLIGENCE, 2015, 31 (01) : 132 - 164
  • [4] BeckerH, 2011, P 5 INT AAAI C WEBL, P438
  • [5] Bird S., 2006, P ACL 02 WORKSH EFF, P69, DOI [DOI 10.3115/1118108.1118117, 10.3115/1118108.1118117]
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] Borgelt C., 2005, P 1 INT WORKSH OP SO, P1, DOI [DOI 10.1145/1133905.1133907, 10.1145/1133905.1133907]
  • [8] Cao Y., PROC WEB C, P3383
  • [9] Chen JD, 2019, AAAI CONF ARTIF INTE, P6252
  • [10] Chen L., 2009, CIKM, P523