Network Traffic Data Collection for Machine Learning Analysis

被引:0
作者
Chao, James [1 ]
Rodriguez, Ramiro [1 ]
机构
[1] Naval Informat Warfare Ctr Pacif, San Diego, CA 53560 USA
来源
SPIE FUTURE SENSING TECHNOLOGIES 2023 | 2023年 / 12327卷
关键词
network traffic classification; machine learning; data collection;
D O I
10.1117/12.2664375
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Network traffic has increased substantially due to the introduction of advanced network-enabled applications and devices. The introduction of software defined networks (SDNs) and machine learning (ML) has empowered optimizing network operations and network traffic monitoring, resulting in improved complex traffic operations and security with faster malicious intention detections. This paper focuses on network traffic data collection systems, and the data is evaluated using a survey of ML algorithms, depending on the data type (tabular or image). Adhering to system architecture best practices including a decoupled design to integrate with existing network monitoring infrastructures and cybersecurity standards; and online and offline data collection via packet capture (PCAP) standards. For packet based network traffic data analysis, we convert captured data into images and feed into a convolutional neural network to classify the data based on requirements. For statistical based network traffic data analysis, we apply feature engineering on tabular data and feed into various ML systems to classify based on requirements. Finally, We show that the same ML algorithm outperforms publicly available datasets using our collection method.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Machine Learning-Powered Encrypted Network Traffic Analysis: A Comprehensive Survey
    Shen, Meng
    Ye, Ke
    Liu, Xingtong
    Zhu, Liehuang
    Kang, Jiawen
    Yu, Shui
    Li, Qi
    Xu, Ke
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (01): : 791 - 824
  • [22] Network traffic analysis using machine learning: an unsupervised approach to understand and slice your network
    Ons Aouedi
    Kandaraj Piamrat
    Salima Hamma
    J. K. Menuka Perera
    Annals of Telecommunications, 2022, 77 : 297 - 309
  • [23] Data Collection and Exploratory Analysis for Cyber Threat Intelligence Machine Learning Processes
    Wolf, Shaya
    Foster, Rita
    Mack, Andrea
    Priest, Zachary
    Haile, Jed
    2022 9TH SWISS CONFERENCE ON DATA SCIENCE (SDS), 2022, : 7 - 12
  • [24] Research on the reliability of network traffic data collection based on Hadoop
    Zong Feng
    PROCEEDINGS OF THE 2015 JOINT INTERNATIONAL MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY CONFERENCE (JIMET 2015), 2015, 10 : 454 - 457
  • [25] Using Machine Learning to Analyze Network Traffic Anomalies
    Khudoyarova, Anastasia
    Burlakov, Mikhail
    Kupriyashin, Mikhail
    PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 2344 - 2348
  • [26] Automated Hyperparameter Tuning and Ensemble Machine Learning Approach for Network Traffic Classification
    Chen, Liwei
    Sun, Xiu
    Li, Yuchan
    Jaseemuddin, Muhammad
    Kazi, Baha Uddin
    19TH IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, BMSB 2024, 2024, : 690 - 695
  • [27] A Practical Model for Traffic Forecasting based on Big Data, Machine-learning, and Network KPIs
    Le, Luong-Vy
    Sinh, Do
    Tung, Li-Ping
    Lin, Bao-Shuh Paul
    2018 15TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2018,
  • [28] Analysis of IoT Device Network Traffic: Thinking Toward Machine Learning<bold> </bold>
    Ferman, Vian Adnan
    Tawfeeq, Mohammed Ali
    MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING, ICMETE 2021, 2022, 373 : 393 - 403
  • [29] Identification of User Application by an External Eavesdropper using Machine Learning Analysis on Network Traffic
    Fathi-Kazerooni, Sina
    Kaymak, Yagiz
    Rojas-Cessa, Roberto
    2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
  • [30] HOPES: An Integrative Digital Phenotyping Platform for Data Collection, Monitoring, and Machine Learning
    Wang, Xuancong
    Vouk, Nikola
    Heaukulani, Creighton
    Buddhika, Thisum
    Martanto, Wijaya
    Lee, Jimmy
    Morris, Robert J. T.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)