Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers

被引:1
|
作者
Mladenovic, Djordje [1 ]
Antonijevic, Milos [2 ]
Jovanovic, Luka [2 ]
Simic, Vladimir [3 ,4 ,5 ]
Zivkovic, Miodrag [2 ]
Bacanin, Nebojsa [2 ,6 ,7 ]
Zivkovic, Tamara [8 ]
Perisic, Jasmina [2 ]
机构
[1] ICT Coll Vocat Studies, Belgrade 11000, Serbia
[2] Singidunum Univ, Fac Informat & Comp, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11010, Serbia
[4] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Taoyuan City 320315, Taiwan
[5] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
[6] SIMATS, Saveetha Sch Engn, Dept Math, Chennai 602105, Tamilnadu, India
[7] Middle East Univ, MEU Res Unit, Amman 11831, Jordan
[8] Univ Belgrade, Sch Elect Engn, Belgrade 11000, Serbia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Insider threat; Natural language processing; Hyperparameter optimization; XGBoost; AdaBoost;
D O I
10.1038/s41598-024-77240-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study examines the formidable and complex challenge of insider threats to organizational security, addressing risks such as ransomware incidents, data breaches, and extortion attempts. The research involves six experiments utilizing email, HTTP, and file content data. To combat insider threats, emerging Natural Language Processing techniques are employed in conjunction with powerful Machine Learning classifiers, specifically XGBoost and AdaBoost. The focus is on recognizing the sentiment and context of malicious actions, which are considered less prone to change compared to commonly tracked metrics like location and time of access. To enhance detection, a term frequency-inverse document frequency-based approach is introduced, providing a more robust, adaptable, and maintainable method. Moreover, the study acknowledges the significant impact of hyperparameter selection on classifier performance and employs various contemporary optimizers, including a modified version of the red fox optimization algorithm. The proposed approach undergoes testing in three simulated scenarios using a public dataset, showcasing commendable outcomes.
引用
收藏
页数:39
相关论文
共 50 条
  • [21] Thumbs up? Sentiment classification using machine learning techniques
    Pang, B
    Lee, L
    Vaithyanathan, S
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 79 - 86
  • [22] Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning
    Zahoor, Kanwal
    Bawany, Narmeen Zakaria
    Hamid, Soomaiya
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [23] Insider threat detection using supervised machine learning algorithmsInsider threat detection using supervised machine learning algorithmsP. Manoharan et al.
    Phavithra Manoharan
    Jiao Yin
    Hua Wang
    Yanchun Zhang
    Wenjie Ye
    Telecommunication Systems, 2024, 87 (4) : 899 - 915
  • [24] Insider Threat Detection Using Supervised Machine Learning Algorithms on an Extremely Imbalanced Dataset
    Sheykhkanloo, Naghmeh Moradpoor
    Hall, Adam
    INTERNATIONAL JOURNAL OF CYBER WARFARE AND TERRORISM, 2020, 10 (02) : 1 - 26
  • [25] Handling imbalance dataset issue in insider threat detection using machine learning methods
    Jaiswal, Ayshwarya
    Dwivedi, Pragya
    Dewang, Rupesh Kumar
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [26] Insider Threat Detection using Virtual Machine Introspection
    Crawford, Martin
    Peterson, Gilbert
    PROCEEDINGS OF THE 46TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2013, : 1821 - 1830
  • [27] Sentiment Classification for Film Reviews in Gujarati Text Using Machine Learning and Sentiment Lexicons
    Shah, Parita
    Swaminarayan, Priya
    Patel, Maitri
    JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2022, 17 (01) : 1 - 16
  • [28] Phishing Website Detection Using Machine Learning Classifiers Optimized by Feature Selection
    Mehanovic, Dzelila
    Kevric, Jasmin
    TRAITEMENT DU SIGNAL, 2020, 37 (04) : 563 - 569
  • [29] Sentiment classification on product reviews using machine learning and deep learning techniques
    Singh, Neha
    Jaiswal, Umesh Chandra
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (12) : 5726 - 5741
  • [30] Insider Threat Identification Using the Simultaneous Neural Learning of Multi-Source Logs
    Liu, Liu
    Chen, Chao
    Zhang, Jun
    De Vel, Olivier
    Xiang, Yang
    IEEE ACCESS, 2019, 7 : 183162 - 183176