TCSOM: Clustering transactions using self-organizing map

被引:16
作者
He, ZY [1 ]
Xu, XF [1 ]
Deng, SC [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
关键词
clustering; self-organizing map; transactions; categorical data; data mining;
D O I
10.1007/s11063-005-8016-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-Organizing Map (SOM) networks have been successfully applied as a clustering method to numeric datasets. However, it is not feasible to directly apply SOM for clustering transactional data. This paper proposes the Transactions Clustering using SOM (TCSOM) algorithm for clustering binary transactional data. In the TCSOM algorithm, a normalized Dot Product norm based dissimilarity measure is utilized for measuring the distance between input vector and output neuron. And a modified weight adaptation function is employed for adjusting weights of the winner and its neighbors. More importantly, TCSOM is a one-pass algorithm, which is extremely suitable for data mining applications. Experimental results on real datasets show that TCSOM algorithm is superior to those state-of-the-art transactional data clustering algorithms with respect to clustering accuracy.
引用
收藏
页码:249 / 262
页数:14
相关论文
共 29 条
[1]  
[Anonymous], INFORM FUSION
[2]  
Barbara D., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P582, DOI 10.1145/584792.584888
[3]  
Cristofor D, 2002, J UNIVERS COMPUT SCI, V8, P153
[4]  
DOMINGOS P, 2001, 2001 SIGMOD WORKSH R
[5]  
Flexer A., 2001, Intelligent Data Analysis, V5, P373
[6]  
Ganti Venkatesh., 1999, Int. Conf. Knowledge Discovery and Data Mining, P73, DOI DOI 10.1145/312129.312201
[7]  
Giannotti F., 2002, Principles of Data Mining and Knowledge Discovery. 6th European Conference, PKDD 2002. Proceedings (Lecture Notes in Artificial Intelligence Vol.2431), P175
[8]  
Gibson D., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P311
[9]   ROCK: A robust clustering algorithm for categorical attributes [J].
Guha, S ;
Rastogi, R ;
Shim, K .
15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, :512-521
[10]  
HAN EH, 1997, SIGMOD WORKSH RES IS, P9