Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers

被引:1
|
作者
Mladenovic, Djordje [1 ]
Antonijevic, Milos [2 ]
Jovanovic, Luka [2 ]
Simic, Vladimir [3 ,4 ,5 ]
Zivkovic, Miodrag [2 ]
Bacanin, Nebojsa [2 ,6 ,7 ]
Zivkovic, Tamara [8 ]
Perisic, Jasmina [2 ]
机构
[1] ICT Coll Vocat Studies, Belgrade 11000, Serbia
[2] Singidunum Univ, Fac Informat & Comp, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11010, Serbia
[4] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Taoyuan City 320315, Taiwan
[5] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
[6] SIMATS, Saveetha Sch Engn, Dept Math, Chennai 602105, Tamilnadu, India
[7] Middle East Univ, MEU Res Unit, Amman 11831, Jordan
[8] Univ Belgrade, Sch Elect Engn, Belgrade 11000, Serbia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Insider threat; Natural language processing; Hyperparameter optimization; XGBoost; AdaBoost;
D O I
10.1038/s41598-024-77240-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study examines the formidable and complex challenge of insider threats to organizational security, addressing risks such as ransomware incidents, data breaches, and extortion attempts. The research involves six experiments utilizing email, HTTP, and file content data. To combat insider threats, emerging Natural Language Processing techniques are employed in conjunction with powerful Machine Learning classifiers, specifically XGBoost and AdaBoost. The focus is on recognizing the sentiment and context of malicious actions, which are considered less prone to change compared to commonly tracked metrics like location and time of access. To enhance detection, a term frequency-inverse document frequency-based approach is introduced, providing a more robust, adaptable, and maintainable method. Moreover, the study acknowledges the significant impact of hyperparameter selection on classifier performance and employs various contemporary optimizers, including a modified version of the red fox optimization algorithm. The proposed approach undergoes testing in three simulated scenarios using a public dataset, showcasing commendable outcomes.
引用
收藏
页数:39
相关论文
共 50 条
  • [31] Classification of Neurodegenerative Disease Stages using Ensemble Machine Learning Classifiers
    Rohini, M.
    Surendran, D.
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 66 - 73
  • [32] Email Spam Classification and Detection using Various Machine Learning Classifiers
    Saraswathi, N.
    Pradeep, S.
    Sathiyavathi, V.
    Sabitha, K.
    Kambattan, K. Rajesh
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [33] Malicious Insider Threat Detection Using Sentiment Analysis of Social Media Topics
    Kenny, Matt
    Pitropakis, Nikolaos
    Sayeed, Sarwar
    Chrysoulas, Christos
    Mylonas, Alexios
    ICT SYSTEMS SECURITY AND PRIVACY PROTECTION, SEC 2024, 2024, 710 : 264 - 278
  • [34] Onto-based sentiment classification using Machine Learning Techniques
    Saranya, K.
    Jayanthy, S.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [35] ChatGPT Tweets Sentiment Analysis Using Machine Learning and Data Classification
    Sabir A.
    Ali H.A.
    Aljabery M.A.
    Informatica (Slovenia), 2024, 48 (07): : 103 - 112
  • [36] Classification of Sentiment Reviews for Indian Railways Using Machine Learning Methods
    Bagga, Manju
    Aggarwa, Ritu
    Arora, Nitika
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 171 - 177
  • [37] Twitter Sentiment Classification Using Machine Learning Techniques for Stock Markets
    Qasem, Mohammed
    Thulasiram, Ruppa
    Thulasiram, Parimala
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 834 - 840
  • [38] Sentiment Analysis for Arabic Reviews using Machine Learning Classification Algorithms
    Sayed, Awny A.
    Elgeldawi, Enas
    Zaki, Alaa M.
    Galal, Ahmed R.
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMMUNICATION AND COMPUTER ENGINEERING (ITCE), 2020, : 56 - 63
  • [39] Heart disease classification using optimized Machine learning algorithms
    Kadhim M.A.
    Radhi A.M.
    Iraqi Journal for Computer Science and Mathematics, 2023, 4 (02): : 31 - 42
  • [40] An Optimized Framework for Breast Cancer Classification Using Machine Learning
    Michael, Epimack
    Ma, He
    Li, Hong
    Qi, Shouliang
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022