Timely Classification and Verification of Network Traffic Using Gaussian Mixture Models

被引:4
作者
Alizadeh, Hassan [1 ]
Vranken, Harald [1 ,2 ]
Zuquete, Andre [3 ]
Miri, Ali [4 ]
机构
[1] Open Univ, Dept Comp Sci, NL-6401 Heerlen, Netherlands
[2] Radboud Univ Nijmegen, Inst Comp & Informat Sci, NL-6500 Nijmegen, Netherlands
[3] Univ Aveiro, Inst Engn Elect & Informat Aveiro IEETA, P-3810193 Aveiro, Portugal
[4] Ryerson Univ, Dept Comp Sci, Toronto, ON M5B 2K3, Canada
关键词
Gaussian mixture model (GMM); traffic classification; traffic anomaly detection; INTRUSION DETECTION; IDENTIFICATION;
D O I
10.1109/ACCESS.2020.2992556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel approach for timely classification and verification of network traffic using Gaussian Mixture Models (GMMs). We generate a separate GMM for each class of applications using component-wise expectation-maximization (CEM) to match the network traffic distribution generated by these applications. We apply our models for both traffic classification, where the goal is to identify the source application from which the traffic originates, by evaluating the maximum posterior probability, and for traffic verification, where the goal is to verify whether the application that claims to be the source of the traffic is as expected, by likelihood testing. Our models use only the first initial packets of truncated flows in order to provide more efficient and timely traffic classification and verification. This allows for triggering timely countermeasures before the end of flows. We demonstrate the effectiveness of our approach by experiments on a public dataset collected from a real network. Our traffic classification approach outperforms other state-of-the-art approaches that are based on machine learning, and achieves up to 97.7 flow classification accuracy when using only 9 first initial packets of flows. We show that 96.6 flow classification accuracy can still be obtained when training the GMMs using only 0.5 of all flows. Our traffic verification approach achieves a minimum Half Total Error Rate (HTER) of 7.65 when using only 6 first initial packets of flows.
引用
收藏
页码:91287 / 91302
页数:16
相关论文
共 67 条
[1]  
Alizadeh H., 2015, P 2015 ACM INT WORKS, P11, DOI DOI 10.1145/2713579.2713586
[2]  
Alizadeh H., 2015 IEEE INT WORKSH, P1, DOI [10.1109/IWMN.2015.7322980, DOI 10.1109/IWMN.2015.7322980]
[3]   Traffic classification for managing Applications' networking profiles [J].
Alizadeh, Hassan ;
Zuquete, Andre .
SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (14) :2557-2575
[4]   PLOTS OF HIGH-DIMENSIONAL DATA [J].
ANDREWS, DF .
BIOMETRICS, 1972, 28 (01) :125-&
[5]  
[Anonymous], 2007, EM ALGORITHM EXTENSI
[6]  
[Anonymous], 2001, P 8 INT C ART INT ST
[7]  
[Anonymous], 2007, Bayesian Statist.
[8]  
[Anonymous], 2012, FEATURE SELECTION KN
[9]  
[Anonymous], P ACM CONEXT C
[10]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199