Explaining Class-of-Service Oriented Network Traffic Classification with Superfeatures

被引:3
作者
Chowdhury, Sayantan [1 ]
Liang, Ben [1 ]
Tizghadam, Ali [2 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
[2] TELUS Communications, Technol Strategy & Business Transformat, Edmonton, AB, Canada
来源
BIG-DAMA'19: PROCEEDINGS OF THE 3RD ACM CONEXT WORKSHOP ON BIG DATA, MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE FOR DATA COMMUNICATION NETWORKS | 2019年
基金
加拿大自然科学与工程研究理事会;
关键词
Traffic classification; class of service; machine learning; explanation framework; Shapley values; NEURAL-NETWORKS; INTERNET;
D O I
10.1145/3359992.3366767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies have demonstrated that machine learning can be useful for application-oriented network traffic classification. However, a network operator may not be able to infer the application of a traffic flow due to the frequent appearance of new applications or due to privacy and other constraints set by regulatory bodies. In this work, we consider traffic flow classification based on the class of service (CoS), using delay sensitivity as an example in this preliminary study. Our focus is on direct CoS classification without first inferring the application. Our experiments with real-world encrypted TCP flows show that this direct approach can be substantially more accurate than a two-step approach that first classifies the flows based on their applications. However, without invoking application labels, the direct approach is more opaque than the two-step approach. Therefore, to provide human understandable interpretation of the trained learning model, we further propose an explanation framework that utilizes groups of superfeatures defined using domain knowledge and their Shapley values in a cooperative game that mimics the learning model. Our experimental results further demonstrate that this explanation framework is consistent and provides important insights into the classification results.
引用
收藏
页码:29 / 34
页数:6
相关论文
共 32 条
  • [21] A Survey of Techniques for Internet Traffic Classification using Machine Learning
    Nguyen, Thuy T. T.
    Armitage, Grenville
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2008, 10 (04): : 56 - 76
  • [22] Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
  • [23] "Why Should I Trust You?" Explaining the Predictions of Any Classifier
    Ribeiro, Marco Tulio
    Singh, Sameer
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1135 - 1144
  • [24] Roughan M., 2004, Proceedings of the 4th ACM SIGCOMM Conference on Internet Measurement, IMC'04, P135, DOI DOI 10.1145/1028788.1028805
  • [25] Shapley LS, 1953, CONTRIBUTIONS THEORY, VII, DOI [10.1515/9781400881970-018, DOI 10.1515/9781400881970-018]
  • [26] Explaining prediction models and individual predictions with feature contributions
    Strumbelj, Erik
    Kononenko, Igor
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 647 - 665
  • [27] A Framework for QoS-aware Traffic Classification Using Semi-supervised Machine Learning in SDNs
    Wang, Pu
    Lin, Shih-Chun
    Luo, Min
    [J]. PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 760 - 765
  • [28] Wang W, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P43, DOI 10.1109/ISI.2017.8004872
  • [29] Internet Traffic Classification Using Constrained Clustering
    Wang, Yu
    Xiang, Yang
    Zhang, Jun
    Zhou, Wanlei
    Wei, Guiyi
    Yang, Laurence T.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (11) : 2932 - 2943
  • [30] Zander S, 2005, LCN 2005: 30TH CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, P250