A hierarchical clustering algorithm for categorical sequence data

被引:21
|
作者
Oh, SJ [1 ]
Kim, JY [1 ]
机构
[1] Hanyang Univ, Dept Ind Engn, Seoul 133791, South Korea
关键词
algorithms; hierarchical clustering; sequences; similarity measure;
D O I
10.1016/j.ipl.2004.04.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. In this paper, we study how to cluster these sequence datasets. We propose a new similarity measure to compute the similarity between two sequences and develop a hierarchical clustering algorithm. Using a splice dataset and synthetic datasets, we show that the quality of clusters generated by our proposed approach is better than that of clusters produced by traditional clustering algorithms. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:135 / 140
页数:6
相关论文
共 50 条
  • [1] A sequence-element-based hierarchical clustering algorithm for categorical sequence data
    Oh, SJ
    Kim, JY
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2005, 4 (01) : 81 - 96
  • [2] Ordering of categorical data in hierarchical clustering
    Kazimianec, Michail
    DATABASES AND INFORMATION SYSTEMS, 2008, : 401 - 404
  • [3] Hierarchical Sequence Clustering Algorithm for Data Mining
    Chezhian, V. Umadevi
    Subash, Thanappan
    Samy, M. Ragavan
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL III, 2011, : 1861 - 1864
  • [4] A hierarchical clustering algorithm for categorical attributes
    Agarwal, Parul
    Alam, M. Afshar
    Biswas, Ranjit
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 365 - 368
  • [5] THUS: An Efficient Two-stage Hierarchical Algorithm for Categorical Data Clustering
    Gao, Xuedong
    Yang, Minghan
    Wei, Guiying
    2018 8TH INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2018,
  • [6] Hierarchical division clustering framework for categorical data
    Wei, Wei
    Liang, Jiye
    Guo, Xinyao
    Song, Peng
    Sun, Yijun
    NEUROCOMPUTING, 2019, 341 : 118 - 134
  • [7] DHCC: Divisive hierarchical clustering of categorical data
    Xiong, Tengke
    Wang, Shengrui
    Mayers, Andre
    Monga, Ernest
    DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 24 (01) : 103 - 135
  • [8] Model-Based Hierarchical Clustering for Categorical Data
    Alalyan, Fahdah
    Zamzami, Nuha
    Bouguila, Nizar
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1424 - 1429
  • [9] QROCK: A quick version of the ROCK algorithm for clustering of categorical data
    Dutta, M
    Mahanta, AK
    Pujari, AK
    PATTERN RECOGNITION LETTERS, 2005, 26 (15) : 2364 - 2373
  • [10] A Hierarchical Clustering for Categorical Data Based on Holo-entropy
    Sun, Haojun
    Chen, Rongbo
    Jin, Shulin
    Qin, Yong
    2015 12TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2015, : 269 - 274