Partial Undersampling of Imbalanced Data for Cyber Threats Detection

被引:6
|
作者
Moniruzzaman, Md [1 ]
Bagirov, A. M. [1 ]
Gondal, Iqbal [2 ]
机构
[1] Federat Univ Australia, Ballarat, Vic, Australia
[2] Internet Commerce Secur Lab ICSL, Ballarat, Vic, Australia
关键词
Cyber threats; Supervised learning; Clustering; Imbalanced data; SMOTE;
D O I
10.1145/3373017.3373026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Real-time detection of cyber threats is a challenging task in cyber security. With the advancement of technology and ease of access to the internet, more and more individuals and organizations are becoming the target for various cyber attacks such as malware, ransomware, spyware. The target of these attacks is to steal money or valuable information from the victims. Signature-based detection methods fail to keep up with the constantly evolving new threats. Machine learning based detection has drawn more attention of researchers due to its capability of detecting new and modified attacks based on previous attack's behaviour. The number of malicious activities in a certain domain is significantly low compared to the number of normal activities. Therefore, cyber threats detection data sets are imbalanced. In this paper, we proposed a partial undersampling method to deal with imbalanced data for detecting cyber threats.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Undersampling with Support Vectors for Multi-Class Imbalanced Data Classification
    Krawczyk, Bartosz
    Bellinger, Colin
    Corizzo, Roberto
    Japkowicz, Nathalie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] Neighbourhood-based undersampling approach for handling imbalanced and overlapped data
    Vuttipittayamongkol, Pattaramon
    Elyan, Eyad
    INFORMATION SCIENCES, 2020, 509 : 47 - 70
  • [23] PSU: Particle Stacking Undersampling Method for Highly Imbalanced Big Data
    Jeon, Yong-Seok
    Lim, Dong-Joon
    IEEE ACCESS, 2020, 8 : 131920 - 131927
  • [24] Fuzzy Distance-based Undersampling Technique for Imbalanced Flood Data
    Mahamud, Ku Ruhana Ku
    Zorkeflee, Maisarah
    Din, Aniza Mohamed
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2016, 2016, : 509 - 513
  • [25] Efficient hybrid oversampling and intelligent undersampling for imbalanced big data classification
    Vairetti, Carla
    Assadi, Jose Luis
    Maldonado, Sebastian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [26] A fuzzy rough set-based undersampling approach for imbalanced data
    Zhang, Xiao
    He, Zhaoqian
    Yang, Yanyan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2799 - 2810
  • [27] CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification
    Koziarski, Michal
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Fast Attack Detection Method for Imbalanced Data in Industrial Cyber-Physical Systems
    Huang, Meng
    Li, Tao
    Li, Beibei
    Zhang, Nian
    Huang, Hanyuan
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (04) : 229 - 245
  • [29] Multiclass Classification for Cyber Threats Detection on Twitter
    Hussein, Adnan
    Almazro, Abdulwahab Ali
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (03): : 3853 - 3866
  • [30] A novel progressively undersampling method based on the density peaks sequence for imbalanced data
    Xie, Xiaoying
    Liu, Huawen
    Zeng, Shouzhen
    Lin, Lingbin
    Li, Wen
    KNOWLEDGE-BASED SYSTEMS, 2021, 213