Comparison and Evaluation of the Combinations of Feature Selection and Classifier on Microarray Data

被引:1
|
作者
Yan, Chaokun [1 ]
Zhang, Jun [1 ]
Kang, Xi [1 ]
Gong, Zhengze [1 ]
Wang, Jianlin [1 ]
Zhang, Ge [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
来源
2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021) | 2021年
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Cancer classification prediction; Microarray data; Data analysis; Feature selection; Classification prediction; ALGORITHM; PREDICTION;
D O I
10.1109/ICBDA51983.2021.9403151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As gene chip technology is widely used in cancer research, a large number of valuable microarray data has been rapidly accumulated. These data have the characteristics of "high-dimensional small samples", in which most genes are unrelated or redundant. For high-dimensional, small-sample, high-noise, and few-sample binary classification datasets, we explore which combination of feature selection method and classifier can achieve the relatively best prediction accuracy, while the number of features included is relatively low. We adopt the standard data analysis procedures: preprocessing the data set, using different feature selection methods to generate feature subsets, and applying different classifiers to predict each feature subset. The results are compared to find out which combination with the relatively high prediction accuracy and the relatively small number of features.
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [31] A Meta-Review of Feature Selection Techniques in the Context of Microarray Data
    Mungloo-Dilmohamud, Zahra
    Jaufeerally-Fakim, Yasmina
    Pena-Reyes, Carlos
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT I, 2017, 10208 : 33 - 49
  • [32] Comparing Multiobjective Evolutionary Algorithms for Cancer Data Microarray Feature Selection
    Sol Dussaut, Julieta
    Javier Vidal, Pablo
    Ponzoni, Ignacio
    Carolina Olivera, Ana
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 149 - 156
  • [33] FEATURE SELECTION BY WEIGHTED-SNR FOR CANCER MICROARRAY DATA CLASSIFICATION
    Hengpraprohm, Supoj
    Chongstitvatana, Prabhas
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4627 - 4635
  • [34] Automated Feature Selection in Microarray Data Analysis using Deep Learning
    Tekade, Pallavi
    Joshi, Ram
    Salunke, Dipmala
    Gore, Shubham
    Shinde, Shaunak
    Bahirat, Divya
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1060 - 1066
  • [35] Feature selection algorithm based on mutual information and lasso for microarray data
    Zhongxin W.
    Gang S.
    Jing Z.
    Jia Z.
    Gang, Sun (ahfysungang@163.com), 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (10): : 278 - 286
  • [36] Feature selection of microarray data using multidimensional graph neural network and supernode hierarchical clustering
    Xie, Weidong
    Zhang, Shoujia
    Wang, Linjie
    Yu, Kun
    Li, Wei
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (03)
  • [37] Importance of feature selection stability in the classifier evaluation on high- dimensional genetic data
    Lukaszuk, Tomasz
    Krawczuk, Jerzy
    PEERJ, 2024, 12
  • [38] Machine learning approaches for classification of colorectal cancer with and without feature selection method on microarray data
    Nazari, Elham
    Aghemiri, Mehran
    Avan, Amir
    Mehrabian, Amin
    Tabesh, Hamed
    GENE REPORTS, 2021, 25
  • [39] On Taxonomy and Evaluation of Feature Selection-Based Learning Classifier System Ensemble Approaches for Data Mining Problems
    Debie, Essam
    Shafi, Kamran
    Merrick, Kathryn
    Lokan, Chris
    COMPUTATIONAL INTELLIGENCE, 2017, 33 (03) : 554 - 578
  • [40] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158