Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers

被引:1
|
作者
Mladenovic, Djordje [1 ]
Antonijevic, Milos [2 ]
Jovanovic, Luka [2 ]
Simic, Vladimir [3 ,4 ,5 ]
Zivkovic, Miodrag [2 ]
Bacanin, Nebojsa [2 ,6 ,7 ]
Zivkovic, Tamara [8 ]
Perisic, Jasmina [2 ]
机构
[1] ICT Coll Vocat Studies, Belgrade 11000, Serbia
[2] Singidunum Univ, Fac Informat & Comp, Belgrade 11000, Serbia
[3] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11010, Serbia
[4] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Taoyuan City 320315, Taiwan
[5] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
[6] SIMATS, Saveetha Sch Engn, Dept Math, Chennai 602105, Tamilnadu, India
[7] Middle East Univ, MEU Res Unit, Amman 11831, Jordan
[8] Univ Belgrade, Sch Elect Engn, Belgrade 11000, Serbia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Insider threat; Natural language processing; Hyperparameter optimization; XGBoost; AdaBoost;
D O I
10.1038/s41598-024-77240-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study examines the formidable and complex challenge of insider threats to organizational security, addressing risks such as ransomware incidents, data breaches, and extortion attempts. The research involves six experiments utilizing email, HTTP, and file content data. To combat insider threats, emerging Natural Language Processing techniques are employed in conjunction with powerful Machine Learning classifiers, specifically XGBoost and AdaBoost. The focus is on recognizing the sentiment and context of malicious actions, which are considered less prone to change compared to commonly tracked metrics like location and time of access. To enhance detection, a term frequency-inverse document frequency-based approach is introduced, providing a more robust, adaptable, and maintainable method. Moreover, the study acknowledges the significant impact of hyperparameter selection on classifier performance and employs various contemporary optimizers, including a modified version of the red fox optimization algorithm. The proposed approach undergoes testing in three simulated scenarios using a public dataset, showcasing commendable outcomes.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers
    Islam, Mazharul
    Anjum, Aftab
    Ahsan, Tanveer
    Wang, Lin
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 3097 - 3103
  • [2] Insider Threat Detection Using Machine Learning Approach
    Sarhan, Bushra Bin
    Altwaijry, Najwa
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [3] Drug Sentiment Analysis using Machine Learning Classifiers
    Uddin, Mohammed Nazim
    Bin Hafiz, Md Ferdous
    Hossain, Sohrab
    Islam, Shah Mohammad Mominul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 92 - 100
  • [4] Optimization of sentiment analysis using machine learning classifiers
    Singh, Jaspreet
    Singh, Gurvinder
    Singh, Rajinder
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2017, 7
  • [5] Breast cancer detection and classification using metaheuristic optimized ensemble extreme learning machine
    Pattnaik R.K.
    Siddique M.
    Mishra S.
    Gelmecha D.J.
    Singh R.S.
    Satapathy S.
    International Journal of Information Technology, 2023, 15 (8) : 4551 - 4563
  • [6] Machine Learning for Insider Threat Detection
    Naicker, Tristan Sashlen
    van Niekerk, Brett
    PROCEEDINGS OF THE 3RD EUROPEAN CONFERENCE ON THE IMPACT OF ARTIFICIAL INTELLIGENCE AND ROBOTICS (ECIAIR 2021), 2021, : 122 - 131
  • [7] Insider threat detection using supervised machine learning algorithms
    Manoharan, Phavithra
    Yin, Jiao
    Wang, Hua
    Zhang, Yanchun
    Ye, Wenjie
    TELECOMMUNICATION SYSTEMS, 2024, 87 (04) : 899 - 915
  • [8] Sentiment Analysis Using Machine Learning Classifiers: Evaluation of Performance
    Rai, Shamantha B.
    Shetty, Sweekriti M.
    Rai, Prakhyath
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 21 - 25
  • [9] Classification of Sentiment Analysis Using Machine Learning
    Parikh, Satyen M.
    Shah, Mitali K.
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, 2020, 46 : 76 - 86
  • [10] Sentiment identification on Twitter using machine learning
    Morales-Castro, Wendy
    Careta, Eduardo Perez
    Rayas, Angelica Hernandez
    Mukhopadhyay, Tirtha Prasad
    Crespo, J. Armando Perez
    Cabrera, Rafael Guzman
    2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 28 - 31