A comprehensive investigation of clustering algorithms for User and Entity Behavior Analytics

被引:2
|
作者
Artioli, Pierpaolo [1 ]
Maci, Antonio [1 ]
Magri, Alessio [1 ]
机构
[1] BV TECH SpA, Cybersecur Lab, Milan, Italy
来源
FRONTIERS IN BIG DATA | 2024年 / 7卷
关键词
clustering; data analytics; machine learning; UEBA; unsupervised learning; BIG DATA;
D O I
10.3389/fdata.2024.1375818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction Government agencies are now encouraging industries to enhance their security systems to detect and respond proactively to cybersecurity incidents. Consequently, equipping with a security operation center that combines the analytical capabilities of human experts with systems based on Machine Learning (ML) plays a critical role. In this setting, Security Information and Event Management (SIEM) platforms can effectively handle network-related events to trigger cybersecurity alerts. Furthermore, a SIEM may include a User and Entity Behavior Analytics (UEBA) engine that examines the behavior of both users and devices, or entities, within a corporate network.Methods In recent literature, several contributions have employed ML algorithms for UEBA, especially those based on the unsupervised learning paradigm, because anomalous behaviors are usually not known in advance. However, to shorten the gap between research advances and practice, it is necessary to comprehensively analyze the effectiveness of these methodologies. This paper proposes a thorough investigation of traditional and emerging clustering algorithms for UEBA, considering multiple application contexts, i.e., different user-entity interaction scenarios.Results and discussion Our study involves three datasets sourced from the existing literature and fifteen clustering algorithms. Among the compared techniques, HDBSCAN and DenMune showed promising performance on the state-of-the-art CERT behavior-related dataset, producing groups with a density very close to the number of users.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Applying Google Analytics to Understand User Behavior and Smartphone Applications
    Wartman, Elicia C.
    Galvez, Jorge A.
    Simpao, Allan F.
    Richards, Jody M.
    Lockman, Justin
    Schleelein, Laura
    Rehman, Mohammed
    ANESTHESIA AND ANALGESIA, 2014, 119
  • [22] Data Privacy Enhancing in the IoT User/Device Behavior Analytics
    Li, Shancang
    Zhao, Shanshan
    Gope, Prosanta
    Xu, Li Da
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (02)
  • [23] User and Entity Behavior Analysis under Urban Big Data
    Tian, Zhihong
    Luo, Chaochao
    Lu, Hui
    Su, Shen
    Sun, Yanbin
    Zhang, Man
    ACM/IMS Transactions on Data Science, 2020, 1 (03):
  • [24] Churn Prediction analysis Using Various Clustering Algorithms in KNIME Analytics Platform
    Franciska, I.
    Swaminathan, B.
    2017 IEEE 3RD INTERNATIONAL CONFERENCE ON SENSING, SIGNAL PROCESSING AND SECURITY (ICSSS), 2017, : 166 - 170
  • [25] Comprehensive analysis of clustering algorithms: exploring limitations and innovative solutions
    Wani, Aasim Ayaz
    PeerJ Computer Science, 2024, 10 : 1 - 45
  • [26] Comprehensive analysis of clustering algorithms: exploring limitations and innovative solutions
    Wani, Aasim Ayaz
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [27] Study of the Clustering Result Based on User Behavior Feedback
    Rui, Li
    Jian, Li
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 371 - 375
  • [28] Modeling Evolving User Behavior via Sequential Clustering
    Boeva, Veselka
    Nordahl, Christian
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 12 - 20
  • [29] Research on User Clustering Algorithm Based on Software System User Behavior Trajectory
    Tang, Yanping
    Ma, Bingxian
    Wu, Zheng
    PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 11 - 14
  • [30] Prediction of user's behavior based on matrix clustering
    Peng, Yu-Qing
    Xiao, Guo-Xi
    Lin, Tao
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1343 - +