A hierarchical clustering algorithm for categorical sequence data

被引:21
作者
Oh, SJ [1 ]
Kim, JY [1 ]
机构
[1] Hanyang Univ, Dept Ind Engn, Seoul 133791, South Korea
关键词
algorithms; hierarchical clustering; sequences; similarity measure;
D O I
10.1016/j.ipl.2004.04.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. In this paper, we study how to cluster these sequence datasets. We propose a new similarity measure to compute the similarity between two sequences and develop a hierarchical clustering algorithm. Using a splice dataset and synthetic datasets, we show that the quality of clusters generated by our proposed approach is better than that of clusters produced by traditional clustering algorithms. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
[21]   Hierarchical clustering for complex data [J].
Khan, L ;
Luo, F .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (05) :791-809
[22]   A hierarchical co-clustering algorithm for high-order heterogeneous data [J].
Yang, Xinxin ;
Huang, Shaobin .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (01) :200-210
[23]   A hierarchical clustering algorithm based on GiST [J].
Zhou, Bing ;
Wang, He-xing ;
Wang, Cui-rong .
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 :125-+
[24]   A scalable hierarchical algorithm for unsupervised clustering [J].
Boley, D .
DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 :383-400
[25]   Improving data field hierarchical clustering using Barnes-Hut algorithm [J].
Zhuo, Zhongliu ;
Zhang, Xiaosong ;
Niu, Weina ;
Yang, Guowu ;
Zhang, Jingzhong .
PATTERN RECOGNITION LETTERS, 2016, 80 :113-120
[26]   A hierarchical clustering algorithm for MIMD architecture [J].
Du, ZH ;
Lin, F .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (5-6) :417-419
[27]   Avalanche: A Hierarchical, Divisive Clustering Algorithm [J].
Amalaman, Paul K. ;
Eick, Christoph F. .
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2015, 2015, 9166 :296-310
[28]   Hierarchical clustering algorithm of the minimum risk [J].
Wang De-xing ;
Xu Jie-long ;
Yuan Hongchun .
MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 :1410-1413
[29]   Hierarchical Clustering and CoClust Algorithm: A Nested Procedure to Analyse Sustainable Heating Data [J].
Di Lascio, F. Marta L. ;
Pappada, Roberta .
COMBINING, MODELLING AND ANALYZING IMPRECISION, RANDOMNESS AND DEPENDENCE, SMPS 2024, 2024, 1458 :85-92
[30]   Heterogeneous Metric Learning of Categorical Data with Hierarchical Couplings [J].
Zhu, Chengzhang ;
Cao, Longbing ;
Liu, Qiang ;
Yin, Jianping ;
Kumar, Vipin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) :1254-1267