Data Analysis of the Web News Headlines based on Natural Language Processing

被引:0
作者
Karna, Hrvoje [1 ,2 ]
Braovic, Maja [3 ]
Vickovic, Linda [3 ]
Krstinic, Damir [3 ]
机构
[1] Minist Def Republ Croatia, Zagreb, Croatia
[2] Univ Split, Split, Croatia
[3] Univ Split, Fac Elect Engn Mech Engn & Naval Architecture, Dept Elect & Comp, Split, Croatia
关键词
data mining; information extraction; natural language processing; news portals; text analysis;
D O I
10.24138/jcomss-2023-0047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
paper explores the problem of media content data analysis with the focus on the phenomenon of vaccination, closely related to the COVID-19 pandemic. The presented research is an extension of the previous work, but it differs in two main areas. Firstly, the text corpus submitted to the analysis has been considerably increased. Secondly, the previous data analysis was performed on the body part of the posts, while now it is focused on the most prominent part of the news posts, their headlines. This change from body to headline analysis was provoked by significant differences in their characteristics and the fact that most people read only headlines. Described data acquisition uses an advanced content collection approach followed by the modeling process, during which a set of natural language processing algorithms were applied. To enable the comparison, the model uses the same set of algorithms in the modeling phase like in previous work. The main contributions of the work are manifested in: i) approaching the problem from a new perspective, ii) applying more efficient method of data collection, and crucially iii) enabling the comparison of analysis results for individual parts of the content, which ensured a comprehensive insight into the characteristics of news posts.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [41] Performance Evaluation of Natural Language Processing Algorithms for Sentiment Analysis
    S. H. Annie Silviya
    S. Julia Faith
    R. Seetha
    M. Hemalatha
    SN Computer Science, 5 (6)
  • [42] Natural Language Processing Workflow for Customer Request Analysis in a Company
    Smirnov, Alexander
    Teslya, Nikolay
    Shilov, Nikolay
    Frank, Diethard
    Weidig, Dirk
    Minina, Elena
    Evers, Kathrin
    IFAC PAPERSONLINE, 2021, 54 (01): : 1206 - 1211
  • [43] Sentiment Analysis of Multilingual Tweets Based on Natural Language Processing (NLP)
    Bera, Abhijit
    Ghose, Mrinal Kanti
    Pal, Dibyendu Kumar
    INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2021, 10 (04)
  • [44] A Toolkit for Text Extraction and Analysis for Natural Language Processing Tasks
    Sefara, Tshephisho Joseph
    Mbooi, Mahlatse
    Mashile, Katlego
    Rambuda, Thompho
    Rangata, Mapitsi
    5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD2022), 2022,
  • [45] Collection and Automatic Analysis with Natural Language Processing on a Corpus of Andean Oral Literature Implemented on the Web
    Soria Solis, Ivan
    Castro Buleje, Carlos Yinmel
    Silvera Reynaga, Humberto
    Mamani Macedo, Mauro Felix
    Leon Soncco, Dionicia
    Mautino Guillen, Alejandro Giancarlo
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2024, 2024, 1068 : 449 - 463
  • [46] Big data for Natural Language Processing: A streaming approach
    Agerri, Rodrigo
    Artola, Xabier
    Beloki, Zuhaitz
    Rigau, German
    Soroa, Aitor
    KNOWLEDGE-BASED SYSTEMS, 2015, 79 : 36 - 42
  • [47] Data Extraction by Using Natural Language Processing Tool
    More, Sujata D.
    Madankar, Mangala S.
    Chandak, M. B.
    HELIX, 2018, 8 (05): : 3846 - 3848
  • [48] Natural language processing data services for healthcare providers
    Yeung, Joshua Au
    Shek, Anthony
    Searle, Thomas
    Kraljevic, Zeljko
    Dinu, Vlad
    Ratas, Mart
    Al-Agil, Mohammad
    Foy, Aleksandra
    Rafferty, Barbara
    Oliynyk, Vitaliy
    Teo, James T.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [49] Data science in light of natural language processing: An overview
    Zeroual, Imad
    Lakhouaja, Abdelhak
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 : 82 - 91
  • [50] Data augmentation approaches in natural language processing: A survey
    Li, Bohan
    Hou, Yutai
    Che, Wanxiang
    AI OPEN, 2022, 3 : 71 - 90