An Efficient Technique for Network Traffic Summarizationusing Multi view Clustering and Statistical Sampling

被引:5
作者
Ahmed, Mohiuddin [1 ]
Mahmood, Abdun Naser [1 ]
Maher, Michael J. [1 ]
机构
[1] UNSW, Sch Engn & Informat Technol, Canberra, ACT, Australia
关键词
Scalable Data Mining; Network Traffic Summarization; Multiview Clustering;
D O I
10.4108/sis.2.5.e4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is significant interest in the data mining and network management communities to efficiently analyse huge amounts of network traffic, given the amount of network traffic generated even in small networks. Summarization is a primary data mining task for generating a concise yet informative summary of the given data and it is a research challenge to create summary from network traffic data. Existing clustering based summarization techniques lack the ability to create a suitable summary for further data mining tasks such as anomaly detection and require the summary size as an external input. Additionally, for complex and high dimensional network traffic datasets, there is often no single clustering solution that explains the structure of the given data. In this paper, we investigate the use of multiview clustering to create a meaningful summary using original data instances from network traffic data in an efficient manner. We develop a mathematically sound approach to select the summary size using a sampling technique. We compare our proposed approach with regular clustering based summarization incorporating the summary size calculation method and random approach. We validate our proposed approach using the benchmark network traffic dataset and state-of-the-art summary evaluation metrics.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 18 条
[11]  
MacQueen J., 1967, P 5 BERK S MATH STAT
[12]  
Mahmood AN, 2011, C IND ELECT APPL, P2474, DOI 10.1109/ICIEA.2011.5976009
[13]  
McHugh J., 2000, ACM Transactions on Information and Systems Security, V3, P262, DOI 10.1145/382912.382923
[14]  
Portnoy L., 2001, P ACM CSS WORKSHOP D, P5
[15]  
Viet Ha-Thuc, 2008, 2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies (RIVF 2008), P240, DOI 10.1109/RIVF.2008.4586362
[16]  
Wagstaff L., 2005, INTERPLANETARY NETWO, V42
[17]  
Walpole M., FUNDAMENTALS PROBABI
[18]  
Wendel P., 2005, 5 IEEE INT S CLUST C