A Hybrid Information Filtering Algorithm Based on Distributed Web log Mining

被引:2
作者
Ling Yun [1 ]
Wang Xun [1 ]
Gu Huamao [1 ]
机构
[1] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou, Zhejiang, Peoples R China
来源
THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS | 2008年
关键词
D O I
10.1109/ICCIT.2008.39
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For distributed large commercial mirror sites, this paper presents a hybrid information filtering algorithm based on distributed web log mining. Based on multi-agent technology, this algorithm preprocesses the web logs of mirror sites, in which the web page's manual rating is replaced by user browsing preference, and then user access matrix is constructed and standardized. On this basis, this paper proposes utilizing web page similarities to predict the rating for pages not having been rated, thus increasing the pages that have been jointly rated among users. This method could effectively solve the sparsity of user ratings in collaborative filtering. Eventually, a hybrid-filtering model is proposed to overcome the drawbacks of the content-based filtering and the collaborative filtering models. The experimental results show that this algorithm is applicable to distributed web server clustering architecture, avoids the inaccuracy and complexity of web page's manual ratings, effectively solves the faults of traditional filtering models and greatly improve the recommendation quality.
引用
收藏
页码:1086 / 1091
页数:6
相关论文
共 11 条
[1]  
AGGARWAL CC, 2001, ACM PODS C
[2]  
BADRUL S, 2000, P EC 00
[3]  
BADRUL S, 2000, ACM WEB KDD WORKSH
[4]   Fab: Content-based, collaborative recommendation [J].
Balabanovic, M ;
Shoham, Y .
COMMUNICATIONS OF THE ACM, 1997, 40 (03) :66-72
[5]   Discovering relevant scientific literature on the Web [J].
Bollacker, KD ;
Lawrence, S ;
Giles, CL .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 2000, 15 (02) :42-47
[6]  
Breese J. S., 1998, UAI, P43, DOI 10.5555/2074094.2074100
[7]  
Deng Ai-Lin, 2003, Journal of Software, V14, P1621
[8]   Automatic personalization based on Web usage mining - Web usage mining can help improve the scalability, accuracy, and flexibility of recommender systems. [J].
Mobasher, B ;
Cooley, R ;
Srivastava, J .
COMMUNICATIONS OF THE ACM, 2000, 43 (08) :142-151
[9]  
Sarwar B, 2001, P 10 INT C WORLD WID, P285, DOI 10.1145/371920.372071
[10]  
Srivastava J., 2000, SIGKDD EXPLORATIONS, V1, P12, DOI [DOI 10.1145/846183.846188, 10.1145/846183.846188]