CABOSFV algorithm for high dimensional sparse data clustering

被引:0
|
作者
Wu, S [1 ]
Gao, XD [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Management, Beijing 100083, Peoples R China
来源
JOURNAL OF UNIVERSITY OF SCIENCE AND TECHNOLOGY BEIJING | 2004年 / 11卷 / 03期
关键词
clustering; data mining; sparse; high dimensionality;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
An algorithm, Clustering Algorithm Based On Sparse Feature Vector (CABOSFV), was proposed for the high dimensional clustering of binary sparse data. This algorithm compresses the data effectively by using a tool 'Sparse Feature Vector', thus reduces the data scale enormously, and can get the clustering result with only one data scan. Both theoretical analysis and empirical tests showed that CABOSFV is of low computational complexity. The algorithm finds clusters in high dimensional large datasets efficiently and handles noise effectively.
引用
收藏
页码:283 / 288
页数:6
相关论文
共 50 条
  • [1] CABOSFV algorithm for high dimensional sparse data clustering
    Sen Wu
    Xuedong Gao Management School
    Journal of University of Science and Technology Beijing(English Edition), 2004, (03) : 283 - 288
  • [2] Bidirectional CABOSFV for high dimensional sparse data clustering
    Gao, Xuedong
    Yang, Minghan
    Li, Ling
    2016 INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS' 2016), 2016,
  • [3] DS_CABOSFV Clustering Algorithm for High Dimensional Data Stream
    Pan, Jing
    4TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2012), 2012, : 16 - 19
  • [4] A multi-block clustering algorithm for high dimensional binarized sparse data
    Kosztyan, Zsolt T.
    Telcs, Andras
    Abonyi, Janos
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [5] Parallel clustering algorithm based on sparse index sort of high dimensional data
    Wu, Sen
    Feng, Xiao-Dong
    Wu, Qing-Hai
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2011, 31 (SUPPL. 2): : 13 - 18
  • [7] EFFECTIVE CLUSTERING ALGORITHM FOR HIGH-DIMENSIONAL SPARSE DATA BASED ON SOM
    Martinovic, Jan
    Slaninova, Katerina
    Vojacek, Lukas
    Drazdilova, Pavla
    Dvorsky, Jiri
    Vondrak, Ivo
    NEURAL NETWORK WORLD, 2013, 23 (02) : 131 - 147
  • [8] High Dimensional Data Clustering Algorithm Based on Sparse Feature Vector for Categorical Attributes
    Wu, Sen
    Wei, Guiying
    PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON LOGISTICS SYSTEMS AND INTELLIGENT MANAGEMENT, VOLS 1-3, 2010, : 973 - 976
  • [9] High Dimensional Sparse data Clustering Algorithm Based on Concept Feature Vector (CABOCFV)
    Wu, Sen
    Gu, Shujuan
    Gao, Xuedong
    IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 202 - 206
  • [10] Clustering high dimensional sparse transactional data with constraints
    Li, Yanrong
    Gopalan, Raj P.
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 692 - +