Feature clustering dimensionality reduction based on affinity propagation

被引:1
作者
Zhang, Yahong [1 ]
Li, Yujian [1 ]
Zhang, Ting [1 ]
Gadosey, Pius Kwao [1 ]
Liu, Zhaoying [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Classification; dimensionality reduction; feature clustering; affinity propagation; INFORMATION;
D O I
10.3233/IDA-163337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature clustering is a powerful technique for dimensionality reduction. However, existing approaches require the number of clusters to be given in advance or controlled by parameters. In this paper, by combining with affinity propagation (AP), we propose a new feature clustering (FC) algorithm, called APFC, for dimensionality reduction. For a given training dataset, the original features automatically form a bunch of clusters by AP. A new feature can then be extracted from each cluster in three different ways for reducing the dimensionality of the original data. APFC requires no provision of the number of clusters (or extracted features) beforehand. Moreover, it avoids computing the eigenvalues and eigenvectors of covariance matrix which is often necessary in many feature extraction methods. In order to demonstrate the effectiveness and efficiency of APFC, extensive experiments are conducted to compare it with three well-established dimensionality reduction methods on 14 UCI datasets in terms of classification accuracy and computational time.
引用
收藏
页码:309 / 323
页数:15
相关论文
共 30 条
  • [1] Baker L. D., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P96, DOI 10.1145/290941.290970
  • [2] Belkin M, 2002, ADV NEUR IN, V14, P585
  • [3] Brown G, 2012, J MACH LEARN RES, V13, P27
  • [4] Chang C.-C., LIBSVM: a Library for Support Vector Machines
  • [5] Dhillon I. S., 2003, Journal of Machine Learning Research, V3, P1265, DOI 10.1162/153244303322753661
  • [6] Dueck D, 2009, CITESEER
  • [7] Fan RE, 2005, J MACH LEARN RES, V6, P1889
  • [8] Clustering by passing messages between data points
    Frey, Brendan J.
    Dueck, Delbert
    [J]. SCIENCE, 2007, 315 (5814) : 972 - 976
  • [9] He XF, 2005, IEEE I CONF COMP VIS, P1208
  • [10] A Fuzzy Self-Constructing Feature Clustering Algorithm for Text Classification
    Jiang, Jung-Yi
    Liou, Ren-Jia
    Lee, Shie-Jue
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (03) : 335 - 349