Extraction of User Profile Based on the Hadoop Framework

被引:0
作者
Huang Lan [1 ]
Wang Xiao-wei [1 ]
Zhai Yan-dong [1 ]
Yang Bin [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Peoples R China
来源
2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8 | 2009年
关键词
Hadoop framework; user profile; distributed computing; WEB data mining; MapReduce; HDFS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid development of Internet, the web information dramatically increases, the users are often involved in voluminous information to feel lose, Distributed processing of mass data through a cluster composed by many machines and personalized search services based on the user profile have been the hotspots of research and development. This paper firstly studies the operation mechanism of Hadoop, which is a typical distributed processing framework of Apache, then realizes extraction of user profile from a large number of web log data and through comparison experiment with single machine to verify its efficiency.
引用
收藏
页码:5301 / 5306
页数:6
相关论文
共 13 条
[1]  
Borthakur D., HADOOP DISTRIBUTED F
[2]  
CUTTING D, HADOOPOVERVIEW
[3]  
Dean J., 2004, OSDI04 6 S OP SYST D
[4]  
DEAN J, HANDLING LARGE DATAS
[5]  
GHEAWAT S, GOOGLE FILE SYSTEM
[6]  
HU JM, 2006, LIB J, P118
[7]  
LI SM, 2007, THESIS HUAZHONG U SC, P22
[8]  
LI XY, 2005, J CHINA COLL INSURAN, P51
[9]  
LIU NN, 2005, THESIS ZHEJIANG U
[10]  
YE CH, 2004, J CHENGDU U INFORM T, V19, P521