A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data

被引:5
作者
Venkatesh, B. [1 ]
Anuradha, J. [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamil Nadu, India
关键词
Hybrid feature selection; ensemble method; IBPSO; relief; mRMR; FC and kernel SVM; GENE-EXPRESSION DATA; SUPPORT VECTOR MACHINES; BINARY PSO; CLASSIFICATION; ALGORITHM; OPTIMIZATION; FILTER;
D O I
10.3233/KES-190134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Microarray Data, it is complicated to achieve more classification accuracy due to the presence of high dimensions, irrelevant and noisy data. And also It had more gene expression data and fewer samples. To increase the classification accuracy and the processing speed of the model, an optimal number of features need to extract, this can be achieved by applying the feature selection method. In this paper, we propose a hybrid ensemble feature selection method. The proposed method has two phases, filter and wrapper phase in filter phase ensemble technique is used for aggregating the feature ranks of the Relief, minimum redundancy Maximum Relevance (mRMR), and Feature Correlation (FC) filter feature selection methods. This paper uses the Fuzzy Gaussian membership function ordering for aggregating the ranks. In wrapper phase, Improved Binary Particle Swarm Optimization (IBPSO) is used for selecting the optimal features, and the RBF Kernel-based Support Vector Machine (SVM) classifier is used as an evaluator. The performance of the proposed model are compared with state of art feature selection methods using five benchmark datasets. For evaluation various performance metrics such as Accuracy, Recall, Precision, and F1-Score are used. Furthermore, the experimental results show that the performance of the proposed method outperforms the other feature selection methods.
引用
收藏
页码:289 / 301
页数:13
相关论文
共 42 条
  • [1] Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection
    Ang, Jun Chin
    Mirzal, Andri
    Haron, Habibollah
    Hamed, Haza Nuzly Abdull
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) : 971 - 989
  • [2] [Anonymous], 2001, Swarm Intelligence
  • [3] [Anonymous], 2006, DATA COMPLEXITY PATT
  • [4] [Anonymous], 2016, MACH LEARN APPL INT, DOI DOI 10.5121/MLAIJ.2016.3201
  • [5] Gene expression cancer classification using modified K-Nearest Neighbors technique
    Ayyad, Sarah M.
    Saleh, Ahmed, I
    Labib, Labib M.
    [J]. BIOSYSTEMS, 2019, 176 : 41 - 51
  • [6] Soft computing techniques for rank aggregation on the World Wide Web
    Beg, MMS
    Ahmad, N
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2003, 6 (01): : 5 - 22
  • [7] An ensemble of filters and classifiers for microarray data classification
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    [J]. PATTERN RECOGNITION, 2012, 45 (01) : 531 - 539
  • [8] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [9] Cervante L, 2012, IEEE C EVOL COMPUTAT
  • [10] Improved binary PSO for feature selection using gene expression data
    Chuang, Li-Yeh
    Chang, Hsueh-Wei
    Tu, Chung-Jui
    Yang, Cheng-Hong
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (01) : 29 - 38