Quantifying Online News Media Coverage of the COVID-19 Pandemic: Text Mining Study and Resource

被引:2
作者
Krawczyk, Konrad [1 ]
Chelkowski, Tadeusz [2 ]
Laydon, Daniel J. [3 ]
Mishra, Swapnil [3 ]
Xifara, Denise [4 ]
Flaxman, Seth [4 ,5 ]
Mellan, Thomas [3 ]
Schwammle, Veit [6 ]
Rottger, Richard [1 ]
Hadsund, Johannes T. [1 ]
Bhatt, Samir [3 ,7 ]
机构
[1] Univ Southern Denmark, Dept Math & Comp Sci, Campusvej 55, DK-5230 Odense, Denmark
[2] Kozminski Univ, Dept Management Network Soc, Warsaw, Poland
[3] Imperial Coll London, MRC Ctr Global Infect Dis Anal, Dept Infect Dis Epidemiol, London, England
[4] Nupinion, London, England
[5] Imperial Coll London, Dept Math, London, England
[6] Univ Southern Denmark, Dept Biochem & Mol Biol, Odense, Denmark
[7] Univ Copenhagen, Dept Publ Hlth, Sect Epidemiol, Copenhagen, Denmark
基金
英国医学研究理事会;
关键词
text mining; COVID-19; infoveillance; sentiment analysis; public health; PUBLIC-HEALTH; INFORMATION;
D O I
10.2196/28253
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Before the advent of an effective vaccine, nonpharmaceutical interventions, such as mask-wearing, social distancing, and lockdowns, have been the primary measures to combat the COVID-19 pandemic. Such measures are highly effective when there is high population-wide adherence, which requires information on current risks posed by the pandemic alongside a clear exposition of the rules and guidelines in place. Objective: Here we analyzed online news media coverage of COVID-19. We quantified the total volume of COVID-19 articles, their sentiment polarization, and leading subtopics to act as a reference to inform future communication strategies. Methods: We collected 26 million news articles from the front pages of 172 major online news sources in 11 countries (available online at SciRide). Using topic detection, we identified COVID-19-related content to quantify the proportion of total coverage the pandemic received in 2020. The sentiment analysis tool Vader was employed to stratify the emotional polarity of COVID-19 reporting. Further topic detection and sentiment analysis was performed on COVID-19 coverage to reveal the leading themes in pandemic reporting and their respective emotional polarizations. Results: We found that COVID-19 coverage accounted for approximately 25.3% of all front-page online news articles between January and October 2020. Sentiment analysis of English-language sources revealed that overall COVID-19 coverage was not exclusively negatively polarized, suggesting wide heterogeneous reporting of the pandemic. Within this heterogenous coverage, 16% of COVID-19 news articles (or 4% of all English-language articles) can be classified as highly negatively polarized, citing issues such as death, fear, or crisis. Conclusions: The goal of COVID-19 public health communication is to increase understanding of distancing rules and to maximize the impact of governmental policy. The extent to which the quantity and quality of information from different communication channels (eg, social media, government pages, and news) influence public understanding of public health measures remains to be established. Here we conclude that a quarter of all reporting in 2020 covered COVID-19, which is indicative of information overload. In this capacity, our data and analysis form a quantitative basis for informing health communication strategies along traditional news media channels to minimize the risks of COVID-19 while vaccination is rolled out.
引用
收藏
页数:15
相关论文
共 39 条
[1]  
Agarwal Arul, 2020, 2020 12th International Conference on Computational Intelligence and Communication Networks (CICN), P312, DOI 10.1109/CICN49253.2020.9242579
[2]   Sentiments and emotions evoked by news headlines of coronavirus disease (COVID-19) outbreak [J].
Aslam, Faheem ;
Awan, Tahir Mumtaz ;
Syed, Jabir Hussain ;
Kashif, Aisha ;
Parveen, Mahwish .
HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2020, 7 (01)
[3]   SARS wars: An examination of the quantity and construction of health information in the news media [J].
Berry, Tanya R. ;
Wharf-Higgins, Joan ;
Naylor, P. J. .
HEALTH COMMUNICATION, 2007, 21 (01) :35-44
[4]  
BRAUNER JM, 2020, EFFECTIVENESS PERCEI, DOI DOI 10.1126/SCIENCE.ABD9338
[5]   MEDIA CELEBRITIES AND PUBLIC-HEALTH - RESPONSES TO MAGIC-JOHNSON HIV DISCLOSURE AND ITS IMPACT ON AIDS RISK AND HIGH-RISK BEHAVIORS [J].
BROWN, WJ ;
BASIL, MD .
HEALTH COMMUNICATION, 1995, 7 (04) :345-370
[6]  
Brownstein JS, 2009, NEW ENGL J MED, V360, P2153, DOI 10.1056/NEJMp0904012
[7]   Around the world in 60 days: an exploratory study of impact of COVID-19 on online global news sentiment [J].
Chakraborty, Amartya ;
Bose, Sunanda .
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2020, 3 (02) :367-400
[8]   Letter to the editor: Headline stress disorder caused by Netnews during the outbreak of COVID-19 [J].
Dong, Mengyuan ;
Zheng, Jin .
HEALTH EXPECTATIONS, 2020, 23 (02) :259-260
[9]   The scale and dynamics of COVID-19 epidemics across Europe [J].
Dye, Christopher ;
Cheng, Russell C. H. ;
Dagpunar, John S. ;
Williams, Brian G. .
ROYAL SOCIETY OPEN SCIENCE, 2020, 7 (11)
[10]  
Evanega S., 2020, CORONAVIRUS MISINFOR, DOI [10.2196/preprints.25143, DOI 10.2196/PREPRINTS.25143]