Extracting news events from microblogs

被引:7
|
作者
Repp, Oystein [1 ]
Ramampiaro, Heri [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
关键词
Text mining; Deep Learning; Word Embedding; Information Extraction; Event Detection; Social Media;
D O I
10.1080/09720510.2018.1486273
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Twitter stream has become a large source of information, but the magnitude of tweets posted and the noisy nature of its content makes harvesting of knowledge from Twitter has challenged researchers for long time. Aiming at overcoming some of the main challenges of extracting hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-theart solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance.
引用
收藏
页码:695 / 723
页数:29
相关论文
共 50 条
  • [21] Exploring the Interactions of Storylines from Informative News Events
    Po Hu
    Min-Lie Huang
    Xiao-Yan Zhu
    Journal of Computer Science and Technology, 2014, 29 : 502 - 518
  • [22] Exploring the Interactions of Storylines from Informative News Events
    Hu, Po
    Huang, Min-Lie
    Zhu, Xiao-Yan
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (03) : 502 - 518
  • [23] TKES: A Novel System for Extracting Trendy Keywords from Online News Sites
    Tham Vo
    Phuc Do
    Journal of the Operations Research Society of China, 2022, 10 : 801 - 816
  • [24] Using Machine Learning for Extracting Information from Natural Disaster News Reports
    Tellez Valero, Alberto
    Montes y Gomez, Manuel
    Villasenor Pineda, Luis
    COMPUTACION Y SISTEMAS, 2009, 13 (01): : 33 - 44
  • [25] TKES: A Novel System for Extracting Trendy Keywords from Online News Sites
    Vo, Tham
    Do, Phuc
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2022, 10 (04) : 801 - 816
  • [26] Predicting Stock Trends Based on News Events
    Zhang M.
    Du W.
    Zheng N.
    Data Analysis and Knowledge Discovery, 2019, 3 (05) : 11 - 18
  • [27] An ensemble method for extracting adverse drug events from social media
    Liu, Jing
    Zhao, Songzheng
    Zhang, Xiaodi
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 70 : 62 - 76
  • [28] Extracting Physical Events from Digital Chatter for Covid-19
    Nagapudi, Vikram
    Agrawal, Ameeta
    Bulusu, Nirupama
    2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 398 - 400
  • [29] Extracting and Displaying Temporal and Geospatial Entities from Articles on Historical Events
    Chasin, Rachel
    Woodward, Daryl
    Witmer, Jeremy
    Kalita, Jugal
    COMPUTER JOURNAL, 2014, 57 (03) : 403 - 426
  • [30] An Efficient Method for Extracting Web News Content
    Sun, Jian
    Tang, Luyang
    Liao, Dan
    Chang, Victor
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,