DBStream: A holistic approach to large-scale network traffic monitoring and analysis

被引:18
作者
Baer, Arian [1 ]
Casas, Pedro [2 ]
D'Alconzo, Alessandro [2 ]
Fiadino, Pierdomenico [3 ]
Golab, Lukasz [4 ]
Mellia, Marco [5 ]
Schikuta, Erich [6 ]
机构
[1] FTW Forschungszentrum Telekommunikat Wien, Donau City St 1, A-1220 Vienna, Austria
[2] Austrian Inst Technol GmbH, AIT, Vienna, Austria
[3] EURECAT Technol Ctr Catalonia, Ave Diagonal 177,Planta 9, Barcelona 08018, Spain
[4] Univ Waterloo, 200 Univ Ave West, Waterloo, ON, Canada
[5] Politecn Torino, Corso Duca Abruzzi 24, I-10129 Turin, Italy
[6] Univ Vienna, Waehringerstr 29, A-1090 Vienna, Austria
关键词
Network monitoring; Data stream warehouse; Machine-to-machine traffic; On-line traffic classification; Machine learning; Cellular networks; DEGRADATION; MAPREDUCE; YOUTUBE;
D O I
10.1016/j.comnet.2016.04.020
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the last decade, many systems for the extraction of operational statistics from computer network interconnects have been designed and implemented. Those systems generate huge amounts of data of various formats and in various granularities, from packet level to statistics about whole flows. In addition, the complexity of Internet services has increased drastically with the introduction of cloud infrastructures, Content Delivery Networks (CDNs) and mobile Internet usage, and complexity will continue to increase in the future with the rise of Machine-to-Machine communication and ubiquitous wearable devices. Therefore, current and future network monitoring frameworks cannot rely only on information gathered at a single network interconnect, but must consolidate information from various vantage points distributed across the network. In this paper, we present DBStream, a holistic approach to large-scale network monitoring and analysis applications. After a precise system introduction, we show how its Continuous Execution Language (CEL) can be used to automate several data processing and analysis tasks typical for monitoring operational ISP networks. We discuss the performance of DBStream as compared to MapReduce processing engines and show how intelligent job scheduling can increase its performance even further. Furthermore, we show the versatility of DBStream by explaining how it has been integrated to import and process data from two passive network monitoring systems, namely METAWIN and Tstat. Finally, multiple examples of network monitoring applications are given, ranging from simple statistical analysis to more complex traffic classification tasks applying machine learning techniques using the Weka toolkit. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:5 / 19
页数:15
相关论文
共 50 条
[1]   A Traffic Visualization Framework for Monitoring Large-scale Inter- DataCenter Network [J].
Elbaham, Meryem ;
Nguyen, Kim Khoa ;
Cheriet, Mohammed .
2016 12TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT AND WORKSHOPS(CNSM 2016), 2016, :277-281
[2]   Large-Scale Mobile Traffic Analysis: A Survey [J].
Naboulsi, Diala ;
Fiore, Marco ;
Ribot, Stephane ;
Stanica, Razvan .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (01) :124-161
[3]   Large-scale automated forecasting for network safety and security monitoring [J].
Naveiro, Roi ;
Rodriguez, Simon ;
Rios Insua, David .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (03) :431-447
[4]   A Hybrid Approach to Detect Traffic Anomalies in Large-Scale Data Networks [J].
Sun, Xin ;
Sun, Fu-Shing .
2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, :1418-1419
[5]   A Feature Selection Method for Large-Scale Network Traffic Classification Based on Spark [J].
Wang, Yong ;
Ke, Wenlong ;
Tao, Xiaoling .
INFORMATION, 2016, 7 (01)
[6]   Network monitoring for energy efficiency in large-scale networks: the case of the Spanish Academic Network [J].
José Luis García-Dorado ;
Eduardo Magaña ;
Pedro Reviriego ;
Mikel Izal ;
Daniel Morató ;
Juan Antonio Maestro ;
Javier Aracil ;
Jorge E. López de Vergara .
The Journal of Supercomputing, 2012, 62 :1284-1304
[7]   Network monitoring for energy efficiency in large-scale networks: the case of the Spanish Academic Network [J].
Luis Garcia-Dorado, Jose ;
Magana, Eduardo ;
Reviriego, Pedro ;
Izal, Mikel ;
Morato, Daniel ;
Antonio Maestro, Juan ;
Aracil, Javier ;
Lopez de Vergara, Jorge E. .
JOURNAL OF SUPERCOMPUTING, 2012, 62 (03) :1284-1304
[8]   The HaLoop approach to large-scale iterative data analysis [J].
Bu, Yingyi ;
Howe, Bill ;
Balazinska, Magdalena ;
Ernst, Michael D. .
VLDB JOURNAL, 2012, 21 (02) :169-190
[9]   The HaLoop approach to large-scale iterative data analysis [J].
Yingyi Bu ;
Bill Howe ;
Magdalena Balazinska ;
Michael D. Ernst .
The VLDB Journal, 2012, 21 :169-190
[10]   Large-scale holistic approach to Web block classification: assembling the jigsaws of a Web page puzzle [J].
Andrey Kravchenko .
World Wide Web, 2019, 22 :1999-2015