Explaining Class-of-Service Oriented Network Traffic Classification with Superfeatures

被引：3

作者：

Chowdhury, Sayantan ^{[1
]}

Liang, Ben ^{[1
]}

Tizghadam, Ali ^{[2
]}

机构：

[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada

[2] TELUS Communications, Technol Strategy & Business Transformat, Edmonton, AB, Canada

来源：

BIG-DAMA'19: PROCEEDINGS OF THE 3RD ACM CONEXT WORKSHOP ON BIG DATA, MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE FOR DATA COMMUNICATION NETWORKS | 2019年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Traffic classification; class of service; machine learning; explanation framework; Shapley values; NEURAL-NETWORKS; INTERNET;

D O I：

10.1145/3359992.3366767

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent studies have demonstrated that machine learning can be useful for application-oriented network traffic classification. However, a network operator may not be able to infer the application of a traffic flow due to the frequent appearance of new applications or due to privacy and other constraints set by regulatory bodies. In this work, we consider traffic flow classification based on the class of service (CoS), using delay sensitivity as an example in this preliminary study. Our focus is on direct CoS classification without first inferring the application. Our experiments with real-world encrypted TCP flows show that this direct approach can be substantially more accurate than a two-step approach that first classifies the flows based on their applications. However, without invoking application labels, the direct approach is more opaque than the two-step approach. Therefore, to provide human understandable interpretation of the trained learning model, we further propose an explanation framework that utilizes groups of superfeatures defined using domain knowledge and their Shapley values in a cooperative game that mimics the learning model. Our experimental results further demonstrate that this explanation framework is consistent and provides important insights into the classification results.

引用

页码：29 / 34

页数：6

共 32 条

[21] A Survey of Techniques for Internet Traffic Classification using Machine Learning
Nguyen, Thuy T. T.
Armitage, Grenville
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2008, 10 (04): : 56 - 76
[22] Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[23] "Why Should I Trust You?" Explaining the Predictions of Any Classifier
Ribeiro, Marco Tulio
Singh, Sameer
Guestrin, Carlos
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1135 - 1144
[24] Roughan M., 2004, Proceedings of the 4th ACM SIGCOMM Conference on Internet Measurement, IMC'04, P135, DOI DOI 10.1145/1028788.1028805
[25] Shapley LS, 1953, CONTRIBUTIONS THEORY, VII, DOI [10.1515/9781400881970-018, DOI 10.1515/9781400881970-018]
[26] Explaining prediction models and individual predictions with feature contributions
Strumbelj, Erik
Kononenko, Igor
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 647 - 665
[27] A Framework for QoS-aware Traffic Classification Using Semi-supervised Machine Learning in SDNs
Wang, Pu
Lin, Shih-Chun
Luo, Min
[J]. PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 760 - 765
[28] Wang W, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P43, DOI 10.1109/ISI.2017.8004872
[29] Internet Traffic Classification Using Constrained Clustering
Wang, Yu
Xiang, Yang
Zhang, Jun
Zhou, Wanlei
Wei, Guiyi
Yang, Laurence T.
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (11) : 2932 - 2943
[30] Zander S, 2005, LCN 2005: 30TH CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, P250

← 1 2 3 4 →