Data Science, Machine learning and big data in Digital Journalism: A survey of state-of-the-art, challenges and opportunities

被引:13
作者
Fernandes, Elizabeth [1 ]
Moro, Sergio [2 ]
Cortez, Paulo [3 ]
机构
[1] ISTAR, ISCTE Inst Univ Lisboa ISCTE IUL, Ave Forcas Armadas,Edificio II,D615, P-1649026 Lisbon, Portugal
[2] ISTAR, Inst Univ Lisboa ISCTE IUL, Lisbon, Portugal
[3] Univ Minho, ALGORITMI Res Ctr, Guimaraes, Portugal
关键词
Data science; Digital journalism; Text mining; Systematic literature review; Media analytics; Machine Learning; NEWS ARTICLE RECOMMENDATION; AUTOMATED JOURNALISM; SENTIMENT ANALYSIS; ONLINE; SYSTEM; INTELLIGENCE; POPULARITY; PERCEPTION; 3RD-PERSON; ALGORITHM;
D O I
10.1016/j.eswa.2023.119795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Digital journalism has faced a dramatic change and media companies are challenged to use data science algo-rithms to be more competitive in a Big Data era. While this is a relatively new area of study in the media landscape, the use of machine learning and artificial intelligence has increased substantially over the last few years. In particular, the adoption of data science models for personalization and recommendation has attracted the attention of several media publishers. Following this trend, this paper presents a research literature analysis on the role of Data Science (DS) in Digital Journalism (DJ). Specifically, the aim is to present a critical literature review, synthetizing the main application areas of DS in DJ, highlighting research gaps, challenges, and op-portunities for future studies. Through a systematic literature review integrating bibliometric search, text min-ing, and qualitative discussion, the relevant literature was identified and extensively analyzed. The review reveals an increasing use of DS methods in DJ, with almost 47% of the research being published in the last three years. An hierarchical clustering highlighted six main research domains focused on text mining, event extraction, online comment analysis, recommendation systems, automated journalism, and exploratory data analysis along with some machine learning approaches. Future research directions comprise developing models to improve personalization and engagement features, exploring recommendation algorithms, testing new automated jour-nalism solutions, and improving paywall mechanisms.
引用
收藏
页数:20
相关论文
共 157 条
[1]   A study of literature in modular integrated construction - Critical review and future directions [J].
Abdelmageed, Sherif ;
Zayed, Tarek .
JOURNAL OF CLEANER PRODUCTION, 2020, 277
[2]   Snowball: Extracting Causal Chains from Climate Change Text Corpora [J].
Alashri, Saud ;
Tsai, Jiun-Yi ;
Koppela, Anvesh Reddy ;
Davulcu, Hasan .
2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, :234-241
[3]   Online Newsrooms as Communities of Practice: Exploring Digital Journalists' Applied Ethics [J].
Alberto Garcia-Aviles, Jose .
JOURNAL OF MASS MEDIA ETHICS, 2014, 29 (04) :258-272
[4]   Research trends on Big Data in Marketing: A text mining and topic modeling based literature analysis [J].
Amado, Alexandra ;
Cortez, Paulo ;
Rita, Paulo ;
Moro, Sergio .
EUROPEAN RESEARCH ON MANAGEMENT AND BUSINESS ECONOMICS, 2018, 24 (01) :1-7
[5]  
Antonio N., 2018, TOURISM MANAGEMENT S
[6]  
Antoun Wissam, 2020, 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT), P519, DOI 10.1109/ICIoT48696.2020.9089487
[7]   bibliometrix: An R-tool for comprehensive science mapping analysis [J].
Aria, Massimo ;
Cuccurullo, Corrado .
JOURNAL OF INFORMETRICS, 2017, 11 (04) :959-975
[8]  
Arrese Angel., 2016, Journalism Studies, V17, P1051, DOI https://doi.org/10.1080/1461670X.2015.1027788
[9]  
Attfield S., 2011, WSDM WORKSHOP USER M
[10]   Truth or Lie? Automatically Fact Checking News [J].
Azevedo, Lucas .
COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, :807-811