Clustering Web Page Sessions Using Sequence Alignment Method

被引:0
作者
Poornalatha, G. [1 ]
Prakash, S. Raghavendra [1 ]
机构
[1] NITK, Dept Informat Technol, Mangalore, India
来源
COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY | 2011年 / 250卷
关键词
clustering; sequence alignment; web usage mining; R-squared measure; dynamic programming;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper illustrates clustering of web page sessions in order to identify the users' navigation pattern. In the approach presented here, user sessions of variable lengths are compared pair wise, numbers of alignments are found between them and the distances are measured. Web page sessions are clustered by employing the modified k-means algorithm. A couple of web access logs including the well known NASA data set are used to illustrate the effectiveness of the clustering. R-squared measure is applied to determine the optimal number of clusters and chi-squared test is carried out to see the association between the various web page sessions that are clustered. These two measures show the goodness of the clusters formed.
引用
收藏
页码:479 / 483
页数:5
相关论文
共 9 条
  • [1] Mining navigation patterns using a sequence alignment method
    Hay, B
    Wets, G
    Vanhoof, K
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2004, 6 (02) : 150 - 163
  • [2] Liu Y, 2010, IEEE Int. Conf Data Min, P911, DOI [10.1109/icdm.2010.35, DOI 10.1109/ICDM.2010.35]
  • [3] Mojica J.A., 2005, 3 LAT AM WEB C LA WE
  • [4] Nina Shahnaz Parvin, 2009, INT C COMP TECHN DEV
  • [5] Oh S., 2007, 2 INT C INN COMP INF
  • [6] Validation and interpretation of Web users' sessions clusters
    Pallis, George
    Angelis, Lefteris
    Vakali, Athena
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1348 - 1367
  • [7] Poornalatha G, 2011, COMM COM INF SC, V191, P243
  • [8] Poornalatha G., 2011, PROCEDIA COMPUT SCI, V5, P450, DOI DOI 10.1016/J.PR0CS.2011.07.058
  • [9] Yilmaz H., 2010, Proceedings 2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010), P549, DOI 10.1109/ICDMW.2010.44