Network Traffic Data Collection for Machine Learning Analysis

被引:0
作者
Chao, James [1 ]
Rodriguez, Ramiro [1 ]
机构
[1] Naval Informat Warfare Ctr Pacif, San Diego, CA 53560 USA
来源
SPIE FUTURE SENSING TECHNOLOGIES 2023 | 2023年 / 12327卷
关键词
network traffic classification; machine learning; data collection;
D O I
10.1117/12.2664375
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Network traffic has increased substantially due to the introduction of advanced network-enabled applications and devices. The introduction of software defined networks (SDNs) and machine learning (ML) has empowered optimizing network operations and network traffic monitoring, resulting in improved complex traffic operations and security with faster malicious intention detections. This paper focuses on network traffic data collection systems, and the data is evaluated using a survey of ML algorithms, depending on the data type (tabular or image). Adhering to system architecture best practices including a decoupled design to integrate with existing network monitoring infrastructures and cybersecurity standards; and online and offline data collection via packet capture (PCAP) standards. For packet based network traffic data analysis, we convert captured data into images and feed into a convolutional neural network to classify the data based on requirements. For statistical based network traffic data analysis, we apply feature engineering on tabular data and feed into various ML systems to classify based on requirements. Finally, We show that the same ML algorithm outperforms publicly available datasets using our collection method.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Diverse Analysis of Data Mining and Machine Learning Algorithms to Secure Computer Network
    Neeraj Kumar
    Upendra Kumar
    Wireless Personal Communications, 2022, 124 : 1033 - 1059
  • [42] Traffic Accident Analysis Using Machine Learning Paradigms
    Chong, Miao
    Abraham, Ajith
    Paprzycki, Marcin
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2005, 29 (01): : 89 - 98
  • [43] Prediction of Twitter Traffic Based on Machine Learning and Data Analytics
    Li, Fuyou
    Zhang, Zitian
    Zhu, Yunpeng
    Zhang, Jie
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 443 - 448
  • [44] Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications
    Abisheva, Gulsipat
    Goranin, Nikolaj
    Razakhova, Bibigul
    Aidynov, Tolegen
    Satybaldina, Dina
    SENSORS, 2024, 24 (16)
  • [45] A machine learning approach for feature selection traffic classification using security analysis
    Shafiq, Muhammad
    Yu, Xiangzhan
    Bashir, Ali Kashif
    Chaudhry, Hassan Nazeer
    Wang, Dawei
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (10) : 4867 - 4892
  • [46] A machine learning approach for feature selection traffic classification using security analysis
    Muhammad Shafiq
    Xiangzhan Yu
    Ali Kashif Bashir
    Hassan Nazeer Chaudhry
    Dawei Wang
    The Journal of Supercomputing, 2018, 74 : 4867 - 4892
  • [47] QUIC Network Traffic Classification Using Ensemble Machine Learning Techniques
    Almuhammadi, Sultan
    Alnajim, Abdullatif
    Ayub, Mohammed
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [48] Intrusion Detection using Network Traffic Profiling and Machine Learning for IoT
    Rose, Joseph R.
    Swann, Matthew
    Bendiab, Gueltoum
    Shiaeles, Stavros
    Kolokotronis, Nicholas
    PROCEEDINGS OF THE 2021 IEEE 7TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2021): ACCELERATING NETWORK SOFTWARIZATION IN THE COGNITIVE AGE, 2021, : 409 - 415
  • [49] Machine learning approaches to network intrusion detection for contemporary internet traffic
    Muhammad U. Ilyas
    Soltan Abed Alharbi
    Computing, 2022, 104 : 1061 - 1076
  • [50] Network Traffic Classification Using Machine Learning for Software Defined Networks
    Kuranage, Menuka Perera Jayasuriya
    Piamrat, Kandaraj
    Hamma, Salima
    MACHINE LEARNING FOR NETWORKING (MLN 2019), 2020, 12081 : 28 - 39