Automatic Detection of Clickbait Headlines Using Semantic Analysis and Machine Learning Techniques

被引:11
|
作者
Bronakowski, Mark [1 ]
Al-khassaweneh, Mahmood [1 ]
Al Bataineh, Ali [2 ]
机构
[1] Lewis Univ, Comp & Math Sci, 1Engineering, Romeoville, IL 60446 USA
[2] Norwich Univ, 2Department Elect & Comp Engn, Northfield, VT 05663 USA
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 04期
关键词
clickbait; classification; machine learning; semantic analysis;
D O I
10.3390/app13042456
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Clickbait headlines are misleading headiness designed to attract attention and entice users to click on the link. Links can host malware, trojans and phishing attacks. Clickbaiting is one of the more subtle methods used by hackers and scammers. For these reasons, clickbait is a serious issue that must be addressed. This paper presents a method for identifying clickbait headlines using semantic analysis and machine learning techniques. The method involves analyzing thirty unique semantic features and exploring six different machine learning classification algorithms individually and in ensemble forms. Results show that the top models have an accuracy of 98% in classifying clickbait headlines. The proposed models can serve as a template for developing practical applications to detect clickbait headlines automatically.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Automatic Semantic Categorization of News Headlines using Ensemble Machine Learning: A Comparative Study
    Bogery, Raghad
    Al Babtain, Nora
    Aslam, Nida
    Alkabour, Nada
    Al Hashim, Yara
    Khan, Irfan Ullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 689 - 696
  • [2] Sarcasm Text Detection on News Headlines Using Novel Hybrid Machine Learning Techniques
    Singh, Neha
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2024, 13
  • [3] Experimental Evaluation of Clickbait Detection Using Machine Learning Models
    Ahmad, Iftikhar
    Alqarni, Mohammed A.
    Almazroi, Abdulwahab Ali
    Tariq, Abdullah
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (06): : 1335 - 1344
  • [4] Semi-automatic semantic annotation of images using machine learning techniques
    Marques, O
    Barman, N
    SEMANTIC WEB - ISWC 2003, 2003, 2870 : 550 - 565
  • [5] Automatic detection of rock boundaries using a hybrid recurrence quantification analysis and machine learning techniques
    Keyumars Anvari
    Amin Mousavi
    Ahmad Reza Sayadi
    Ewan Sellers
    Ebrahim F. Salmi
    Bulletin of Engineering Geology and the Environment, 2022, 81
  • [6] Automatic detection of rock boundaries using a hybrid recurrence quantification analysis and machine learning techniques
    Anvari, Keyumars
    Mousavi, Amin
    Sayadi, Ahmad Reza
    Sellers, Ewan
    Salmi, Ebrahim F.
    BULLETIN OF ENGINEERING GEOLOGY AND THE ENVIRONMENT, 2022, 81 (10)
  • [7] Clickbait Detection using Deep Learning
    Agrawal, Amol
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 268 - 272
  • [8] Semantic Analysis of NIH Stroke Scale using Machine Learning Techniques
    Yu, Jaehak
    Kim, Damee
    Park, Hongkyu
    Chon, Seung-chul
    Cho, Kang Hee
    Kim, Sun-Jin
    Yu, Sungkyu
    Park, Sejin
    Hong, Seunghee
    2019 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2019, : 82 - 86
  • [9] Clickbait Pattern Detection and Classification of News Headlines using Natural Language Processing
    Manjesh, Suraj
    Kanakagiri, Tushar
    Vaishak, P.
    Chettiar, Vivek
    Shobha, G.
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTION (CSITSS-2017), 2017, : 153 - 158
  • [10] Clickbait detection using multiple categorisation techniques
    Pujahari, Abinash
    Sisodia, Dilip Singh
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (01) : 118 - 128