Real-Time Novel Event Detection from Social Media

被引:37
作者
Li, Quanzhi [1 ]
Nourbakhsh, Armineh [1 ]
Shah, Sameena [1 ]
Liu, Xiaomo [1 ]
机构
[1] Thomson Reuters, Res & Dev, 3 Times Sq, New York, NY 10036 USA
来源
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017) | 2017年
关键词
event detection; event novelty; novel event; temporal identification; temporal information; semantic class; social media;
D O I
10.1109/ICDE.2017.157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a new approach for detecting novel events from social media, specially Twitter, at real-time. An event is usually defined by who, what, where and when, and an event tweet usually contains terms corresponding to these aspects. To exploit this information, we propose a method that incorporates simple semantics by splitting the tweet term space into groups of terms that have the meaning of the same type. These groups are called semantic categories (classes) and each reflects one or more event aspects. The semantic classes include named entity, mention, location, hashtag, verb, noun and embedded link. To group tweets talking about the same event into the same cluster, similarity measuring is conducted by calculating class-wise similarity and then aggregating them together. Users of a real-time event detection system are usually only interested in novel (new) events, which are happening now or just happened a short time ago. To fulfill this requirement, a temporal identification module is used to filter out event clusters that are about old stories. The clustering module also computes a novelty score for each event cluster, which reflects how novel the event is, compared to previous events. We evaluated our event detection method using multiple quality metrics and a large-scale event corpus having millions of tweets. The experiment results show that the proposed online event detection method achieves the state-of-the-art performance. Our experiment also shows that the temporal identification module can effectively detect old events.
引用
收藏
页码:1129 / 1139
页数:11
相关论文
共 50 条
  • [31] Real-Time Focused Extraction of Social Media Users
    Martinez-Castano, Rodrigo
    Losada, David E.
    Pichel, Juan C.
    IEEE ACCESS, 2022, 10 : 42607 - 42622
  • [32] A platform for real-time opinion mining from social media and news streams
    Tsirakis, Nikos
    Poulopoulos, Vasilis
    Tsantilas, Panagiotis
    Varlamis, Iraklis
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2, 2015, : 223 - 228
  • [33] A New Mashup Based Method for Event Detection from Social Media
    Abir Troudi
    Corinne Amel Zayani
    Salma Jamoussi
    Ikram Amous Ben Amor
    Information Systems Frontiers, 2018, 20 : 981 - 992
  • [34] A New Mashup Based Method for Event Detection from Social Media
    Troudi, Abir
    Zayani, Corinne Amel
    Jamoussi, Salma
    Ben Amor, Ikram Amous
    INFORMATION SYSTEMS FRONTIERS, 2018, 20 (05) : 981 - 992
  • [35] Generalized durative event detection on social media
    Zhang, Yihong
    Shirakawa, Masumi
    Hara, Takahiro
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (01) : 73 - 95
  • [36] A bibliometric analysis of event detection in social media
    Chen, Xieling
    Wang, Shan
    Tang, Yong
    Hao, Tianyong
    ONLINE INFORMATION REVIEW, 2019, 43 (01) : 29 - 52
  • [37] Event Detection Over Twitter Social Media
    Kanwar, Sartaj
    Mangal, Nimita
    Niyogi, Rajdeep
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 177 - 185
  • [38] Generalized durative event detection on social media
    Yihong Zhang
    Masumi Shirakawa
    Takahiro Hara
    Journal of Intelligent Information Systems, 2023, 60 : 73 - 95
  • [39] GeoBurst: Real-Time Local Event Detection in Geo-Tagged Tweet Streams
    Zhang, Chao
    Zhou, Guangyu
    Yuan, Quan
    Zhuang, Honglei
    Zheng, Yu
    Kaplan, Lance
    Wang, Shaowen
    Han, Jiawei
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 513 - 522
  • [40] Real-time Detection of Data Completeness Degree for Traffic Simulation Using Text Similarity and Time Relevance of Data from Social Media
    Putri, Eviana Tjatur
    Buliali, Joko Lianto
    Ermawati, Myrna
    2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 109 - 114