Data Transformation Schemes for CNN-Based Network Traffic Analysis: A Survey

被引:27
作者
Krupski, Jacek [1 ]
Graniszewski, Waldemar [1 ]
Iwanowski, Marcin [1 ]
机构
[1] Warsaw Univ Technol, Inst Control & Ind Elect, Ul Koszykowa 75, PL-00662 Warsaw, Poland
关键词
network traffic analysis; convolutional neural networks; machine learning; network traffic images; visualization of traffic; DEEP NEURAL-NETWORKS; CLASSIFICATION; INTERNET; FRAMEWORK; ATTACKS;
D O I
10.3390/electronics10162042
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The enormous growth of services and data transmitted over the internet, the bloodstream of modern civilization, has caused a remarkable increase in cyber attack threats. This fact has forced the development of methods of preventing attacks. Among them, an important and constantly growing role is that of machine learning (ML) approaches. Convolutional neural networks (CNN) belong to the hottest ML techniques that have gained popularity, thanks to the rapid growth of computing power available. Thus, it is no wonder that these techniques have started to also be applied in the network traffic classification domain. This has resulted in a constant increase in the number of scientific papers describing various approaches to CNN-based traffic analysis. This paper is a survey of them, prepared with particular emphasis on a crucial but often disregarded aspect of this topic-the data transformation schemes. Their importance is a consequence of the fact that network traffic data and machine learning data have totally different structures. The former is a time series of values-consecutive bytes of the datastream. The latter, in turn, are one-, two- or even three-dimensional data samples of fixed lengths/sizes. In this paper, we introduce a taxonomy of data transformation schemes. Next, we use this categorization to describe various CNN-based analytical approaches found in the literature.
引用
收藏
页数:35
相关论文
共 138 条
  • [41] Hussein A, 2019, 2019 SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), P146, DOI [10.1109/sds.2019.8768668, 10.1109/SDS.2019.8768668]
  • [42] An Unsupervised Deep Learning Model for Early Network Traffic Anomaly Detection
    Hwang, Ren-Hung
    Peng, Min-Chun
    Huang, Chien-Wei
    Lin, Po-Ching
    Van-Linh Nguyen
    [J]. IEEE ACCESS, 2020, 8 : 30387 - 30399
  • [43] Jia Fan, 2017, Transactions of Beijing Institute of Technology, V37, P1271, DOI 10.15918/J.TBIT1001-0645.2017.12.011
  • [44] Jia WQ, 2020, PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), P1144, DOI [10.1109/itnec48623.2020.9084918, 10.1109/ITNEC48623.2020.9084918]
  • [45] Jingliang Xue, 2020, Journal of Physics: Conference Series, V1617, DOI 10.1088/1742-6596/1617/1/012071
  • [46] Image-based anomaly detection technique: Algorithm, implementation and effectiveness
    Kim, Seong Soo
    Reddy, A. L. Narasimha
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2006, 24 (10) : 1942 - 1954
  • [47] Kim SS, 2005, IEEE ICC, P168
  • [48] Kim T, 2018, IEEE INT CONF BIG DA, P2960, DOI 10.1109/BigData.2018.8622568
  • [49] Network Prediction with Traffic Gradient Classification using Convolutional Neural Networks
    Ko, Taejin
    Raza, Syed M.
    Dang Thien Binh
    Kim, Moonseong
    Choo, Hyunseung
    [J]. PROCEEDINGS OF THE 2020 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM), 2020,
  • [50] Kolcun R., 2020, ARXIV201108605