WhatsUp: An event resolution approach for co-occurring events in social media

被引:0
作者
Hettiarachchi, Hansi [1 ]
Adedoyin-Olowe, Mariam [1 ]
Bhogal, Jagdev [1 ]
Gaber, Mohamed Medhat [1 ]
机构
[1] Birmingham City Univ, Sch Comp & Digital Technol, Birmingham, England
关键词
Word embedding; Dendrograms; Clustering; Social media; TOPIC DETECTION; TWITTER; MODEL;
D O I
10.1016/j.ins.2023.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid growth of social media networks has resulted in the generation of a vast data amount, making it impractical to conduct manual analyses to extract newsworthy events. Thus, automated event detection mechanisms are invaluable to the community. However, a clear majority of the available approaches rely only on data statistics without considering linguistics. A few approaches involved linguistics, only to extract textual event details without the corresponding temporal details. Since linguistics define words' structure and meaning, a severe information loss can happen without considering them. Targeting this limitation, we propose a novel method named WhatsUp to detect temporal and fine-grained textual event details, using linguistics captured by self-learned word embeddings and their hierarchical relationships and statistics captured by frequency-based measures. We evaluate our approach on recent social media data from two diverse domains and com-pare the performance with several state-of-the-art methods. Evaluations cover temporal and textual event aspects, and results show that WhatsUp notably outperforms state-of-the-art methods. We also analyse the efficiency, revealing that WhatsUp is sufficiently fast for (near) real-time detection. Further, the usage of unsupervised learning techniques, including self-learned embedding, makes our approach expandable to any language, plat-form and domain and provides capabilities to understand data-specific linguistics. (c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:553 / 577
页数:25
相关论文
共 50 条
  • [1] A rule dynamics approach to event detection in Twitter with its application to sports and politics
    Adedoyin-Olowe, Mariam
    Gaber, Mohamed Medhat
    Dancausa, Carlos M.
    Stahl, Frederic
    Gomes, Joao Bartolo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 : 351 - 360
  • [2] [Anonymous], 2012, P COLING 2012
  • [3] Antoniak M., 2018, Trans. Assoc. Comput. Linguistics, V6, P107, DOI [DOI 10.1162/TACLA00008, 10.1162/tacla00008]
  • [4] Castillo C, 2011, P 20 INT C WORLD WID, P675
  • [5] Chaffey D., 2021, Global social media research summary 2021
  • [6] A nonparametric model for online topic discovery with word embeddings
    Chen, Junyang
    Gong, Zhiguo
    Liu, Weiwen
    [J]. INFORMATION SCIENCES, 2019, 504 : 32 - 47
  • [7] Social event detection with retweeting behavior correlation
    Chen, Xi
    Zhou, Xiangmin
    Sellis, Timos
    Li, Xue
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 516 - 523
  • [8] Emerging topic detection in twitter stream based on high utility pattern mining
    Choi, Hyeok-Jun
    Park, Cheong Hee
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 27 - 36
  • [9] Word Embedding based Clustering to Detect Topics in Social Media
    Comito, Carmela
    Forestiero, Agostino
    Pizzuti, Clara
    [J]. 2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 192 - 199
  • [10] Bursty Event Detection in Twitter Streams
    Comito, Carmela
    Forestiero, Agostino
    Pizzuti, Clara
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (04)