Assessing the Impact of Batch-Based Data Aggregation Techniques for Feature Engineering on Machine Learning-Based Network IDSs

被引:2
作者
Magan-Carrion, Roberto [1 ]
Urda, Daniel [2 ]
Diaz-Cano, Ignacio [3 ]
Dorronsoro, Bernabe [4 ]
机构
[1] Univ Granada, Network Engn & Secur Grp, Dept Signal Theory Commun & Telemat, Granada, Spain
[2] Univ Burgos, Grp Inteligencia Computac Aplicada GICAP, Dept Ingn Informat, Escuela Politecn Super, Av Cantabria S-N, Burgos 09006, Spain
[3] Univ Cadiz, Appl Robot Grp, Dept Automat Elect Comp Architecture & Com Net En, Cadiz, Spain
[4] Univ Cadiz, Dept Comp Engn, Graph Methods Optimizat & Learning GOAL Grp, Cadiz, Spain
来源
14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS AND 12TH INTERNATIONAL CONFERENCE ON EUROPEAN TRANSNATIONAL EDUCATIONAL (CISIS 2021 AND ICEUTE 2021) | 2022年 / 1400卷
关键词
Machine learning; Feature engineering; NIDS; Cybersecurity; Information security;
D O I
10.1007/978-3-030-87872-6_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Communication networks and systems are continuously threatened by a great variety of cybersecurity attacks coming from new malware that targets old and new systems' vulnerabilities. In this sense, Intrusion Detection Systems (IDSs) and, specifically, Network IDSs (NIDSs) are used to count on robust methods and techniques to detect and classify security attacks. One of the important parts in the assessment of NIDSs, is the Feature Engineering (FE) process, where raw datasets are transformed onto derived ones where both, features and observations are smartly transformed. In this work, the ff4ml framework, which includes the Feature as a Counter (FaaC) FE approach, is used to transform raw features into new ones that are counters of the originals. The FaaC approach aggregates raw observations by time intervals, thus limiting its use to network datasets containing timestamps. This work proposes a batch-based aggregation technique that allows applying FaaC in timestamp-less datasets and analyzes its impact on the performance of Machine Learning (ML)-based NIDSs in comparison to timestamp-based aggregation approaches.
引用
收藏
页码:116 / 125
页数:10
相关论文
共 50 条
[31]   A Survey of Machine Learning-based loT Intrusion Detection Techniques [J].
Long, Jing ;
Fang, Fei ;
Luo, Haibo .
2021 IEEE 6TH INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2021), 2021, :7-12
[32]   Numerical Feature Selection and Hyperbolic Tangent Feature Scaling in Machine Learning-Based Detection of Anomalies in the Computer Network Behavior [J].
Protic, Danijela ;
Stankovic, Miomir ;
Prodanovic, Radomir ;
Vulic, Ivan ;
Stojanovic, Goran M. ;
Simic, Mitar ;
Ostojic, Gordana ;
Stankovski, Stevan .
ELECTRONICS, 2023, 12 (19)
[33]   Impact of the Choice of Cross-Validation Techniques on the Results of Machine Learning-Based Diagnostic Applications [J].
Tougui, Ilias ;
Jilbab, Abdelilah ;
El Mhamdi, Jamal .
HEALTHCARE INFORMATICS RESEARCH, 2021, 27 (03) :189-199
[34]   Deep learning-based feature engineering for stock price movement prediction [J].
Long, Wen ;
Lu, Zhichen ;
Cui, Lingxiao .
KNOWLEDGE-BASED SYSTEMS, 2019, 164 :163-173
[35]   LMFE: Learning-Based Multiscale Feature Engineering in Partial Discharge Detection [J].
Huang, Chao ;
Ding, Shengxian ;
Li, Shihua ;
Liu, Rongjie .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) :5848-5856
[36]   Machine Learning-Based Embedding for Discontinuous Time Series Machine Data [J].
Aremu, Oluseun Omotola ;
Hyland-Wood, David ;
McAree, Peter Ross .
2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, :1321-1326
[37]   A Machine Learning-based Method for Clustering the Traffic of Linux NATed Network Entities with TCP/IP Feature [J].
Liu, Kehong ;
Wang, Qi ;
Gao, Tianye ;
Ma, Tianxing ;
Zang, Tianning .
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, :1010-1016
[38]   Machine learning-based genetic feature identification and fatigue life prediction [J].
Zhou, Kun ;
Sun, Xingyue ;
Shi, Shouwen ;
Song, Kai ;
Chen, Xu .
FATIGUE & FRACTURE OF ENGINEERING MATERIALS & STRUCTURES, 2021, 44 (09) :2524-2537
[39]   Feature extraction for machine learning-based intrusion detection in IoT networks [J].
Sarhan, Mohanad ;
Layeghy, Siamak ;
Moustafa, Nour ;
Gallagher, Marcus ;
Portmann, Marius .
DIGITAL COMMUNICATIONS AND NETWORKS, 2024, 10 (01) :205-216
[40]   Machine Learning-Based Feature Mapping for Enhanced Understanding of the Housing Market [J].
Lystbaek, Michael Sahl ;
Srirajan, Tharsika Pakeerathan .
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 :530-543