Analysis of Online News Coverage on Earthquakes Through Text Mining

被引:4
|
作者
Camilleri, Stephen [1 ]
Agius, Matthew R. [2 ,3 ]
Azzopardi, Joel [1 ]
机构
[1] Univ Malta, Dipartiment Intelligenza Artificjali, Fak Teknol Informat & Komunikazzjoni, Msida, Malta
[2] Univ Malta, Fak Xjenza, Dipartiment Geoxjenza, Msida, Malta
[3] Univ Roma Tre, Dipartimento Sci, Rome, Italy
基金
欧盟地平线“2020”;
关键词
big data and analytics; information extraction; earthquakes; news agencies; online news analysis; MACROSEISMIC DATA; SOCIAL MEDIA; INFORMATION;
D O I
10.3389/feart.2020.00141
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
News agencies work around the clock to report critical news such as earthquakes. We investigate the relationship between online news articles and seismic events that happen around the world in real time. We utilize computer text mining tools to automatically harvest, identify, cluster and extract information from earthquake-related reports, and carry out cross-validation on the mined information. Earthquake parameters retrieved from the United States Geological Survey (USGS) Application Programming Interface (API) are organized into earthquake events, with each event consisting of daily earthquake readings taking place in a particular geographical location. The results are then visualized on a user-friendly dashboard. 268,182 news reports published by 23 news agencies from different parts of the world and 14,717 earthquakes of magnitude ranging from 4 to 8.2 listed in the bulletin were processed during a 1-year study between 2018 and 2019. 1.25% of the analyzed articles had the word "quake" and 0.4% were clustered and then mapped to an earthquake event. The use of multilingual news sources from 16 countries (6 languages) gives the advantage of reducing potential news bias originating from English-written reports only. The mapping of articles with an earthquake catalog helps verify earthquake reports and determine relationships. We find that the distribution of the reported seismicity is from earthquakes that occur on or very close to land. We propose a general relationship between the number of news agencies, the earthquake magnitude and the anticipated number of published articles. News reports tend to mention higher earthquake magnitudes than those in the USGS earthquake catalog, and the reports on earthquakes can last from a few days to a couple of weeks following the earthquake.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Knowledge enhancement through ontology-guided text mining
    Abulaish, M
    Dey, L
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 601 - 604
  • [42] Detecting Online Gambling Promotions on Indonesian Twitter Using Text Mining Algorithm
    Perdana, Reza Bayu
    Ardin, Indra
    Budi, Indra
    Santoso, Aris Budi
    Ramadiah, Amanah
    Putra, Prabu Kresna
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 942 - 949
  • [43] Research on online shopping contextual cues: refining classification from text mining
    Wang, Lin
    Gao, Huaxia
    Zhao, Yang
    ASIA PACIFIC JOURNAL OF MARKETING AND LOGISTICS, 2023, 35 (11) : 2704 - 2726
  • [44] Understanding Satisfied and Dissatisfied Hotel Customers: Text Mining of Online Hotel Reviews
    Berezina, Katerina
    Bilgihan, Anil
    Cobanoglu, Cihan
    Okumus, Fevzi
    JOURNAL OF HOSPITALITY MARKETING & MANAGEMENT, 2016, 25 (01) : 1 - 24
  • [45] Analysing customers' reviews and ratings for online food deliveries: A text mining approach
    Khan, Farheen Mujeeb
    Khan, Suhail Ahmad
    Shamim, Khalid
    Gupta, Yuvika
    Sherwani, Shariq I.
    INTERNATIONAL JOURNAL OF CONSUMER STUDIES, 2023, 47 (03) : 953 - 976
  • [46] Utilization of text mining as a big data analysis tool for food science and nutrition
    Tao, Dandan
    Yang, Pengkun
    Feng, Hao
    COMPREHENSIVE REVIEWS IN FOOD SCIENCE AND FOOD SAFETY, 2020, 19 (02) : 875 - 894
  • [47] Mining the text of online consumer reviews to analyze brand image and brand positioning
    Alzate, Miriam
    Arce-Urriza, Marta
    Cebollada, Javier
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2022, 67
  • [48] Defensive Modeling of Fake News Through Online Social Networks
    Shrivastava, Gulshan
    Kumar, Prabhat
    Ojha, Rudra Pratap
    Srivastava, Pramod Kumar
    Mohan, Senthilkumar
    Srivastava, Gautam
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (05): : 1159 - 1167
  • [49] Democracy, Populism and the Printed Ballot Discourse: Facebook Content Analysis Through Text Mining and Semantic Networks
    de Oliveira, Augusto Neftali Corte
    DADOS-REVISTA DE CIENCIAS SOCIAIS, 2024, 67 (04):
  • [50] A text-mining based cyber-risk assessment and mitigation framework for critical analysis of online hacker forums
    Biswas, Baidyanath
    Mukhopadhyay, Arunabha
    Bhattacharjee, Sudip
    Kumar, Ajay
    Delen, Dursun
    DECISION SUPPORT SYSTEMS, 2020, 152