Parallelized User Clicks Recognition from Massive HTTP Data Based on Dependency Graph Model

被引:4
作者
Fang Cheng [1 ]
Liu Jun [1 ]
Lei Zhenming [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing Key Lab Network Syst Architecture & Conve, Beijing 100876, Peoples R China
关键词
cloud computing; massive data; graph model; web usage mining;
D O I
10.1109/CC.2014.7019836
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With increasingly complex website structure and continuously advancing web technologies, accurate user clicks recognition from massive HTTP data, which is critical for web usage mining, becomes more difficult. In this paper, we propose a dependency graph model to describe the relationships between web requests. Based on this model, we design and implement a heuristic parallel algorithm to distinguish user clicks with the assistance of cloud computing technology. We evaluate the proposed algorithm with real massive data. The size of the dataset collected from a mobile core network is 228.7GB. It covers more than three million users. The experiment results demonstrate that the proposed algorithm can achieve higher accuracy than previous methods.
引用
收藏
页码:13 / 25
页数:13
相关论文
共 20 条
[1]  
[Anonymous], 2011, IMC 11
[2]  
[Anonymous], INT CONTR AUT 2004 W
[3]   Web User-Session Inference by Means of Clustering Techniques [J].
Bianco, Andrea ;
Mardente, Gianluca ;
Mellia, Marco ;
Munafo, Maurizio ;
Muscariello, Luca .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2009, 17 (02) :405-416
[4]  
Chitraa V, 2011, INT J COMPUTER APPL, P34
[5]  
Domenech JM, 2007, LECT NOTES COMPUT SC, V4881, P695
[6]  
Fu YJ, 2000, LECT NOTES COMPUT SC, V1836, P21
[7]  
Jin Chang, 2010, Proceedings 2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010), P129, DOI 10.1109/ICDMW.2010.14
[8]   Characteristic analysis of internet traffic from the perspective of flows [J].
Kim, Myung-Sup ;
Won, Young J. ;
Hong, James W. .
COMPUTER COMMUNICATIONS, 2006, 29 (10) :1639-1652
[9]  
Lee J., 2007, NEW TRAFFIC MODEL CU
[10]  
Liu J, 2013, CHINA COMMUN, V10, P25, DOI 10.1109/CC.2013.6723876