Extracting news events from microblogs

被引:7
|
作者
Repp, Oystein [1 ]
Ramampiaro, Heri [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
关键词
Text mining; Deep Learning; Word Embedding; Information Extraction; Event Detection; Social Media;
D O I
10.1080/09720510.2018.1486273
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Twitter stream has become a large source of information, but the magnitude of tweets posted and the noisy nature of its content makes harvesting of knowledge from Twitter has challenged researchers for long time. Aiming at overcoming some of the main challenges of extracting hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-theart solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance.
引用
收藏
页码:695 / 723
页数:29
相关论文
共 50 条
  • [41] Extracting multiple news attributes based on visual features
    Liu, Wei
    Yan, Hualiang
    Xiao, Jianguo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (02) : 465 - 486
  • [42] Bank distress in the news: Describing events through deep learning
    Ronnqvist, Samuel
    Sarlin, Peter
    NEUROCOMPUTING, 2017, 264 : 57 - 70
  • [43] Extracting Events from Web Documents for Social Media Monitoring Using Structured SVM
    Choi, Yoonjae
    Ryu, Pum-Mo
    Kim, Hyunki
    Lee, Changki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (06) : 1410 - 1414
  • [44] EXTRACTING BIO-MOLECULAR EVENTS FROM LITERATUREuTHE BIONLP'09 SHARED TASK
    Kim, Jin-Dong
    Ohta, Tomoko
    Pyysalo, Sampo
    Kano, Yoshinobu
    Tsujii, Jun'ichi
    COMPUTATIONAL INTELLIGENCE, 2011, 27 (04) : 513 - 540
  • [45] Mining Financial Risk Events from News and Assessing their Impact on Stocks
    Bhadani, Saumya
    Verma, Ishan
    Dey, Lipika
    MINING DATA FOR FINANCIAL APPLICATIONS, 2020, 11985 : 85 - 100
  • [46] BIGRAM-BASED FEATURES FOR REAL-WORLD EVENT IDENTIFICATION FROM MICROBLOGS
    Samant, Surender Singh
    Murthy, N. L. Bhanu
    Malapati, Aruna
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [47] Overview of CLEF 2019 Lab ProtestNews: Extracting Protests from News in a Cross-Context Setting
    Hurriyetoglu, Ali
    Yoruk, Erdem
    Yuret, Deniz
    Yoltar, Cagri
    Gurel, Burak
    Durusan, Firat
    Mutlu, Osman
    Akdemir, Arda
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2019), 2019, 11696 : 425 - 432
  • [48] Discovering News Events that Move Markets
    Gurin, Yuriy
    Szymanski, Terrence
    Keane, Mark T.
    PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 452 - 461
  • [49] Lifecycle-Based Event Detection from Microblogs
    Mu, Lin
    Jin, Peiquan
    Zheng, Lizhou
    Chen, En-Hong
    Yue, Lihua
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 283 - 290
  • [50] Classifying and Summarizing Information from Microblogs During Epidemics
    Rudra, Koustav
    Sharma, Ashish
    Ganguly, Niloy
    Imran, Muhammad
    INFORMATION SYSTEMS FRONTIERS, 2018, 20 (05) : 933 - 948