Partial Undersampling of Imbalanced Data for Cyber Threats Detection

被引:6
|
作者
Moniruzzaman, Md [1 ]
Bagirov, A. M. [1 ]
Gondal, Iqbal [2 ]
机构
[1] Federat Univ Australia, Ballarat, Vic, Australia
[2] Internet Commerce Secur Lab ICSL, Ballarat, Vic, Australia
关键词
Cyber threats; Supervised learning; Clustering; Imbalanced data; SMOTE;
D O I
10.1145/3373017.3373026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Real-time detection of cyber threats is a challenging task in cyber security. With the advancement of technology and ease of access to the internet, more and more individuals and organizations are becoming the target for various cyber attacks such as malware, ransomware, spyware. The target of these attacks is to steal money or valuable information from the victims. Signature-based detection methods fail to keep up with the constantly evolving new threats. Machine learning based detection has drawn more attention of researchers due to its capability of detecting new and modified attacks based on previous attack's behaviour. The number of malicious activities in a certain domain is significantly low compared to the number of normal activities. Therefore, cyber threats detection data sets are imbalanced. In this paper, we proposed a partial undersampling method to deal with imbalanced data for detecting cyber threats.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Evolutionary Undersampling for Imbalanced Big Data Classification
    Triguero, I.
    Galar, M.
    Vluymans, S.
    Cornelis, C.
    Bustince, H.
    Herrera, F.
    Saeys, Y.
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 715 - 722
  • [2] Collaborative Detection of Cyber Security Threats in Big Data
    Zhang, Jiange
    Guo, Yuanbo
    Chen, Yue
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (02) : 186 - 193
  • [3] K-Nearest Neighbors Undersampling as Balancing Data for Cyber Troll Detection
    Luqyana, Wanda Athira
    Ahmadie, Beryl Labique
    Supianto, Ahmad Afif
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 322 - 325
  • [4] Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data
    Camacho-Nieto, Oscar
    Yanez-Marquez, Cornelio
    Villuendas-Rey, Yenny
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (06) : 698 - 719
  • [5] An Iterative Undersampling of Extremely Imbalanced Data Using CSVM
    Lee, Jong Bum
    Lee, Jee-Hyong
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [6] Radial-Based Undersampling for imbalanced data classification
    Koziarski, Michal
    PATTERN RECOGNITION, 2020, 102
  • [7] A Membership Probability–Based Undersampling Algorithm for Imbalanced Data
    Gilseung Ahn
    You-Jin Park
    Sun Hur
    Journal of Classification, 2021, 38 : 2 - 15
  • [8] On Properties of Undersampling Bagging and Its Extensions for Imbalanced Data
    Stefanowski, Jerzy
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 407 - 417
  • [9] Relevant information undersampling to support imbalanced data classification
    Hoyos-Osorio, J.
    Alvarez-Meza, A.
    Daza-Santacoloma, G.
    Orozco-Gutierrez, A.
    Castellanos-Dominguez, G.
    NEUROCOMPUTING, 2021, 436 : 136 - 146
  • [10] An approach for classification of highly imbalanced data using weighting and undersampling
    Ashish Anand
    Ganesan Pugalenthi
    Gary B. Fogel
    P. N. Suganthan
    Amino Acids, 2010, 39 : 1385 - 1391