Data Analysis of the Web News Headlines based on Natural Language Processing

被引:0
|
作者
Karna, Hrvoje [1 ,2 ]
Braovic, Maja [3 ]
Vickovic, Linda [3 ]
Krstinic, Damir [3 ]
机构
[1] Minist Def Republ Croatia, Zagreb, Croatia
[2] Univ Split, Split, Croatia
[3] Univ Split, Fac Elect Engn Mech Engn & Naval Architecture, Dept Elect & Comp, Split, Croatia
关键词
data mining; information extraction; natural language processing; news portals; text analysis;
D O I
10.24138/jcomss-2023-0047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
paper explores the problem of media content data analysis with the focus on the phenomenon of vaccination, closely related to the COVID-19 pandemic. The presented research is an extension of the previous work, but it differs in two main areas. Firstly, the text corpus submitted to the analysis has been considerably increased. Secondly, the previous data analysis was performed on the body part of the posts, while now it is focused on the most prominent part of the news posts, their headlines. This change from body to headline analysis was provoked by significant differences in their characteristics and the fact that most people read only headlines. Described data acquisition uses an advanced content collection approach followed by the modeling process, during which a set of natural language processing algorithms were applied. To enable the comparison, the model uses the same set of algorithms in the modeling phase like in previous work. The main contributions of the work are manifested in: i) approaching the problem from a new perspective, ii) applying more efficient method of data collection, and crucially iii) enabling the comparison of analysis results for individual parts of the content, which ensured a comprehensive insight into the characteristics of news posts.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [11] A Natural Language processing for semantic web services
    Stanojevic, M
    Vranes, S
    Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 229 - 232
  • [12] A Survey on Natural Language Processing for Fake News Detection
    Oshikawa, Ray
    Qian, Jing
    Wang, William Yang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6086 - 6093
  • [13] News Media Analysis Using Focused Crawl and Natural Language Processing: Case of Lithuanian News Websites
    Krilavicius, Tomas
    Medelis, Zygimantas
    Kapociute-Dzikiene, Jurgita
    Zalandauskas, Tomas
    INFORMATION AND SOFTWARE TECHNOLOGIES, 2012, 319 : 48 - +
  • [14] Teanga: A Linked Data based platform for Natural Language Processing
    Ziad, Housam
    McCrae, John P.
    Buitelaar, Paul
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2410 - 2415
  • [15] Using natural language processing technology for qualitative data analysis
    Crowston, Kevin
    Allen, Eileen E.
    Heckman, Robert
    INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2012, 15 (06) : 523 - 543
  • [16] Applying Natural Language Processing Techniques to Generate Open Data Web APIs Documentation
    Gonzalez-Mora, Cesar
    Barros, Cristina
    Garrigos, Irene
    Zubcoff, Jose
    Lloret, Elena
    Mazon, Jose-Norberto
    WEB ENGINEERING, ICWE 2020, 2020, 12128 : 416 - 432
  • [17] News Analytical Toolkit: An Online Natural Language Processing Platform to Analyze News
    McCann, Ian
    Tahmassebi, Amirhessam
    Foo, Simon Y.
    Erlebacher, Gordon
    Meyer-Baese, Anke
    NEXT-GENERATION ANALYST VI, 2018, 10653
  • [18] Hot News Prediction Method Based on Natural Language Processing Technology and Its Application
    Bao, Yiqin
    Sun, Zhengtang
    Zhao, Qiang
    Lin, Tianya
    Zheng, Hao
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (01) : 83 - 94
  • [19] Identifying Fake News on Social Networks Based on Natural Language Processing: Trends and Challenges
    de Oliveira, Nicollas R.
    Pisa, Pedro S.
    Lopez, Martin Andreoni
    de Medeiros, Dianne Scherly V.
    Mattos, Diogo M. F.
    INFORMATION, 2021, 12 (01) : 1 - 32
  • [20] Hot News Prediction Method Based on Natural Language Processing Technology and Its Application
    Zhengtang Yiqin Bao
    Qiang Sun
    Tianya Zhao
    Hao Lin
    Automatic Control and Computer Sciences, 2022, 56 : 83 - 94