Efficient sequential access pattern mining for web recommendations

被引:12
作者
Zhou, Baoyao [1 ]
Hui, Siu [1 ]
Fong, Alvis [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
D O I
10.3233/KES-2006-10205
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential access pattern mining discovers interesting and frequent user access patterns from web logs. Most of the previous studies have adopted Apriori-like sequential pattern mining techniques, which faced the problem on requiring expensive multiple scans of databases. More recent algorithms that are based on the Web Access Pattern tree (or WAP-tree) can achieve an order of magnitude faster than traditional Apriori-like sequential pattern mining techniques. However, the use of conditional search strategies in WAP-tree based mining algorithms requires re-construction of large numbers of intermediate conditional WAP-trees during mining process, which is also very costly. In this paper, we propose an efficient sequential access pattern mining algorithm, known as CSB-mine (Conditional Sequence Base mining algorithm). The proposed CSB-mine algorithm is based directly on the conditional sequence bases of each frequent event which eliminates the need for constructing WAP-trees. This can improve the efficiency of the mining process significantly compared with WAP-tree based mining algorithms, especially when the support threshold becomes smaller and the size of database gets larger. In this paper, the proposed CSB-mine algorithm and its performance will be discussed. In addition, we will also discuss a sequential access-based web recommender system that has incorporated the CSB-mine algorithm for web recommendations.
引用
收藏
页码:155 / 168
页数:14
相关论文
共 21 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]  
Agrawal R., 1994, P 20 INT C VER LARG, P487
[3]  
Cooley R., 1999, Knowledge and Information Systems, V1, P5
[4]  
Han JW, 2000, SIGMOD RECORD, V29, P1
[5]  
HETTICH S, UCI KDD ARCH
[6]  
Joachims T, 1997, INT JOINT CONF ARTIF, P770
[7]   GroupLens: Applying collaborative filtering to Usenet news [J].
Konstan, JA ;
Miller, BN ;
Maltz, D ;
Herlocker, JL ;
Gordon, LR ;
Riedl, J .
COMMUNICATIONS OF THE ACM, 1997, 40 (03) :77-87
[8]  
Kosala R., 2000, SIGKDD EXPLORATIONS, V2, P1, DOI DOI 10.1145/360402.360406
[9]  
Lin W., 2000, P COMP ASS RAD SURG, P35
[10]  
Lu Y, 2003, LECT NOTES ARTIF INT, V2637, P337