A streaming flow-based technique for traffic classification applied to 12 + 1 years of Internet traffic

被引:0
作者
Valentín Carela-Español
Pere Barlet-Ros
Albert Bifet
Kensuke Fukuda
机构
[1] UPC BarcelonaTech,
[2] HUAWEI Noah’s Ark Lab,undefined
[3] National Institute of Informatics (NII),undefined
来源
Telecommunication Systems | 2016年 / 63卷
关键词
Traffic classification; Machine learning; Stream classification; Hoeffding adaptive tree; Network monitoring;
D O I
暂无
中图分类号
学科分类号
摘要
The continuous evolution of Internet traffic and its applications makes the classification of network traffic a topic far from being completely solved. An essential problem in this field is that most of proposed techniques in the literature are based on a static view of the network traffic (i.e., they build a model or a set of patterns from a static, invariable dataset). However, very little work has addressed the practical limitations that arise when facing a more realistic scenario with an infinite, continuously evolving stream of network traffic flows. In this paper, we propose a streaming flow-based classification solution based on Hoeffding Adaptive Tree, a machine learning technique specifically designed for evolving data streams. The main novelty of our proposal is that it is able to automatically adapt to the continuous evolution of the network traffic without storing any traffic data. We apply our solution to a 12 + 1 year-long dataset from a transit link in Japan, and show that it can sustain a very high accuracy over the years, with significantly less cost and complexity than existing alternatives based on static learning algorithms, such as C4.5.
引用
收藏
页码:191 / 204
页数:13
相关论文
共 24 条
[1]  
Dainotti A(2012)Issues and future directions in traffic classification IEEE Network 26 35-40
[2]  
Pescapè A(2008)A survey of techniques for internet traffic classification using machine learning IEEE on Communications Surveys & Tutorials 10 56-76
[3]  
Claffy KC(2011)Analysis of the impact of sampling on netflow traffic classification Computer Networks 55 1083-1099
[4]  
Nguyen TT(2015)Independent comparison of popular dpi tools for traffic classification Computer Networks 76 75-89
[5]  
Armitage G(2014)Traffic identification engine: An open platform for traffic classification IEEE on Network 28 56-64
[6]  
Carela-Español V(2012)A survey on learning from data streams: current and future trends Progress in Artificial Intelligence 1 45-55
[7]  
Barlet-Ros P(1963)Probability inequalities for sums of bounded random variables Journal of the American Statistical Association 58 13-30
[8]  
Cabellos-Aparicio A(2009)Efficient application identification and the temporal and spatial stability of classification schema Computer Networks 53 790-809
[9]  
Solé-Pareta J(2006)A preliminary performance comparison of five machine learning algorithms for practical ip traffic flow classification ACM SIGCOMM Computer Communication Review Journal 36 5-16
[10]  
Bujlow T(undefined)undefined undefined undefined undefined-undefined