Data Analysis of the Web News Headlines based on Natural Language Processing

被引:0
作者
Karna, Hrvoje [1 ,2 ]
Braovic, Maja [3 ]
Vickovic, Linda [3 ]
Krstinic, Damir [3 ]
机构
[1] Minist Def Republ Croatia, Zagreb, Croatia
[2] Univ Split, Split, Croatia
[3] Univ Split, Fac Elect Engn Mech Engn & Naval Architecture, Dept Elect & Comp, Split, Croatia
关键词
data mining; information extraction; natural language processing; news portals; text analysis;
D O I
10.24138/jcomss-2023-0047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
paper explores the problem of media content data analysis with the focus on the phenomenon of vaccination, closely related to the COVID-19 pandemic. The presented research is an extension of the previous work, but it differs in two main areas. Firstly, the text corpus submitted to the analysis has been considerably increased. Secondly, the previous data analysis was performed on the body part of the posts, while now it is focused on the most prominent part of the news posts, their headlines. This change from body to headline analysis was provoked by significant differences in their characteristics and the fact that most people read only headlines. Described data acquisition uses an advanced content collection approach followed by the modeling process, during which a set of natural language processing algorithms were applied. To enable the comparison, the model uses the same set of algorithms in the modeling phase like in previous work. The main contributions of the work are manifested in: i) approaching the problem from a new perspective, ii) applying more efficient method of data collection, and crucially iii) enabling the comparison of analysis results for individual parts of the content, which ensured a comprehensive insight into the characteristics of news posts.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [31] The Effects of Natural Language Processing on Big Data Analysis: Sentiment Analysis Case Study
    Khader, Mariam
    Awajan, Arafat
    Al-Naymat, Ghazi
    2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2018, : 45 - 51
  • [32] Natural Language Processing for Sentiment Analysis
    Chong, Wei Yen
    Selvaretnam, Bhawani
    Soon, Lay-Ki
    PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, : 212 - 217
  • [33] Fake News Detection Using Deep Learning and Natural Language Processing
    Matheven, Anand
    Venkata, Burra
    Kumar, Durga
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 11 - 14
  • [34] A Sesotho news headlines dataset for sentiment analysis
    Mokhosi, Refuoe
    Shivachi, Casper-Shikali
    Sethobane, Matello
    DATA IN BRIEF, 2024, 54
  • [35] Natural Language Processing Resources: Using Semantic Web Technologies
    Pohorec, Sandi
    Ceh, Ines
    Zorman, Milan
    Mernik, Marjan
    Kokol, Peter
    PROCEEDINGS OF THE ITI 2012 34TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES (ITI), 2012, : 397 - 402
  • [36] Big Data Mining Analysis Technology For Natural Language Processing Robot Design
    Wang, Yongqiang
    Yang, Li
    Lun, Zhixin
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2024, 27 (12): : 3677 - 3686
  • [37] Toward News Authenticity: Synthesizing Natural Language Processing and Human Expert Opinion to Evaluate News
    Mahmud, Md. Anisul Islam
    Talukder, A. A. Talha
    Sultana, Arbiya
    Bhuiyan, Kazi Iftesam Amin
    Rahman, Md. Samiur
    Pranto, Tahmid Hasan
    Rahman, Rashedur M.
    IEEE ACCESS, 2023, 11 : 11405 - 11421
  • [38] Performance Evaluation of Natural Language Processing Algorithms for Sentiment Analysis
    S. H. Annie Silviya
    S. Julia Faith
    R. Seetha
    M. Hemalatha
    SN Computer Science, 5 (6)
  • [39] Jurisprudence search in Colombia based on natural language processing (NLP) and Lynked Data
    Camilo Ordonez, Cristian
    Armando Ordonez, Jose
    Ordonez Eraso, Hugo Armando
    Urbano, Franco
    INGE CUC, 2020, 16 (02)
  • [40] A Toolkit for Text Extraction and Analysis for Natural Language Processing Tasks
    Sefara, Tshephisho Joseph
    Mbooi, Mahlatse
    Mashile, Katlego
    Rambuda, Thompho
    Rangata, Mapitsi
    5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD2022), 2022,