Improving Classification Accuracy Using Fuzzy Clustering Coefficients of Variations (FCCV) Feature Selection Algorithm

被引:0
作者
Fong, Simon [1 ]
Liang, Justin [1 ]
Zhuang, Yan [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
来源
2014 IEEE 15TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI) | 2014年
关键词
Feature Selection; Fuzzy Clustering; Classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the challenges in inferring a classification model with good prediction accuracy is to select the relevant features that contribute to maximum predictive power. Many feature selection techniques have been proposed and studied in the past, but none so far claimed to be the best. In this paper, a novel and efficient feature selection method called Fuzzy Clustering Coefficients of Variation (FCCV) is proposed. FCCV is based on a very simple principle of variance-basis which finds an optimal balance between generalization and over-fitting. Through a computer simulation experiment, 44 datasets with substantially large number of features are tested by FCCV in comparison to four popular feature selection techniques. Results show that FCCV outperformed them in all aspects of averaged performances and speed. By the simplicity of design it is anticipated that FCCV will be a useful alternative of preprocessing method for classification especially with those datasets that are characterized by many features.
引用
收藏
页码:147 / 151
页数:5
相关论文
共 7 条
  • [1] [Anonymous], P WILK INT C COMP SC
  • [2] Bache K., 2013, UCI Machine Learning Repository
  • [3] Eisenstein J., 2004, P 6 INT C MULT INT
  • [4] Fong S., 2013, P 2 INT C BIG DAT SC, P902
  • [5] Selecting Optimal Feature Set in High-Dimensional Data by Swarm Search
    Fong, Simon
    Zhuang, Yan
    Tang, Rui
    Yang, Xin-She
    Deb, Suash
    [J]. JOURNAL OF APPLIED MATHEMATICS, 2013,
  • [6] NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA
    GEMAN, S
    BIENENSTOCK, E
    DOURSAT, R
    [J]. NEURAL COMPUTATION, 1992, 4 (01) : 1 - 58
  • [7] Kohavi R., 1996, ICML