Web navigation prediction using multiple evidence combination and domain knowledge

被引:27
作者
Awad, Mamoun A. [1 ]
Khan, Latifur R. [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75083 USA
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 2007年 / 37卷 / 06期
关键词
artificial neural networks (ANNs); association rule mining (ARM); Dempster's rule; Markov model; N-gram;
D O I
10.1109/TSMCA.2007.904781
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting users' future requests in the World Wide Web can be applied effectively in many important applications, such as web search, latency reduction, and personalization systems. Such application has traditional tradeoffs between modeling complexity and prediction accuracy. In this paper, we study several hybrid models that combine different classification techniques, namely, Markov models, artificial neural networks (ANNs), and the All-Kth-Markov model, to resolve prediction using Dempster's rule. Such fusion overcomes the inability of the Markov model in predicting beyond the training data, as well as boosts the accuracy of ANN, particularly, when dealing with a large number of classes. We also employ a reduction technique, which uses domain knowledge, to reduce the number of classifiers to improve the predictive accuracy and the prediction time of ANNs. We demonstrate the effectiveness of our hybrid models by comparing our results with widely used techniques, namely, the Markov model, the All-Kth-Markov model, and association rule mining, based on a benchmark data set.
引用
收藏
页码:1054 / 1062
页数:9
相关论文
共 22 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], ADV LARGE MARGIN CLA
[3]  
[Anonymous], 1998, Proceedings of the 7th international conference on World Wide Web (WWW), DOI [10.1016/S0169-7552(98)00110-X, DOI 10.1016/S0169-7552(98)00110-X]
[4]  
[Anonymous], 1997, Machine Learning
[5]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[6]   Dissimilarity learning for nominal data [J].
Cheng, V ;
Li, CH ;
Kwok, JT ;
Li, CK .
PATTERN RECOGNITION, 2004, 37 (07) :1471-1477
[7]  
CHINEN K, 1997, P 7 ANN C INET KUAL
[8]  
Cooley R., 1999, Knowledge and Information Systems, V1, P5
[9]  
Duchamp D, 1999, USENIX ASSOCIATION PROCEEDINGS OF THE 2ND USENIX SYMPOSIUM ON INTERNET TECHNOLOGIES AND SYSTEMS (USITS'99), P127
[10]  
GRIFFIOEN J, 1994, PROCEEDINGS OF THE SUMMER 1994 USENIX CONFERENCE, P197