Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets

被引:0
作者
Chia Huey Ooi
Madhu Chetty
Shyh Wei Teng
机构
[1] Monash University,Gippsland School of Information Technology
来源
Data Mining and Knowledge Discovery | 2007年 / 14卷
关键词
Tissue classification; Microarray data analysis; Multiclass classification; Feature selection; Classifier aggregation;
D O I
暂无
中图分类号
学科分类号
摘要
The high dimensionality of microarray datasets endows the task of multiclass tissue classification with various difficulties—the main challenge being the selection of features deemed relevant and non-redundant to form the predictor set for classifier training. The necessity of varying the emphases on relevance and redundancy, through the use of the degree of differential prioritization (DDP) during the search for the predictor set is also of no small importance. Furthermore, there are several types of decomposition technique for the feature selection (FS) problem—all-classes-at-once, one-vs.-all (OVA) or pairwise (PW). Also, in multiclass problems, there is the need to consider the type of classifier aggregation used—whether non-aggregated (a single machine), or aggregated (OVA or PW). From here, first we propose a systematic approach to combining the distinct problems of FS and classification. Then, using eight well-known multiclass microarray datasets, we empirically demonstrate the effectiveness of the DDP in various combinations of FS decomposition types and classifier aggregation methods. Aided by the variable DDP, feature selection leads to classification performance which is better than that of rank-based or equal-priorities scoring methods and accuracies higher than previously reported for benchmark datasets with large number of classes. Finally, based on several criteria, we make general recommendations on the optimal choice of the combination of FS decomposition type and classifier aggregation method for multiclass microarray datasets.
引用
收藏
页码:329 / 366
页数:37
相关论文
共 50 条
  • [31] Gene Selection and Classification of Pancreatic Microarray datasets
    Sserwadda, Abubakhari
    Sarac, Omer Sinan
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [32] Multiclass MTS for Simultaneous Feature Selection and Classification
    Su, Chao-Ton
    Hsiao, Yu-Hsiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (02) : 192 - 205
  • [33] MF-GARF: Hybridizing Multiple Filters and GA Wrapper for Feature Selection of Microarray Cancer Datasets
    Saqib, Pakizah
    Qamar, Usman
    Khan, Reda Ayesha
    Aslam, Andleeb
    [J]. 2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 517 - 524
  • [34] Feature selection in high-dimensional microarray cancer datasets using an improved equilibrium optimization approach
    Balakrishnan, Kulanthaivel
    Dhanalakshmi, Ramasamy
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28)
  • [35] FEATURE SELECTION FOR DATASETS OF WINE FERMENTATIONS
    Mucherino, Antonio
    Urtubia, Alejandra
    [J]. 10TH INTERNATIONAL CONFERENCE ON MODELING AND APPLIED SIMULATION, MAS 2011, 2011, : 309 - 313
  • [36] Feature Selection with Dynamic Classifier Ensembles
    Kiziloz, Hakan Ezgi
    Deniz, Ayca
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2038 - 2043
  • [37] An ensemble svm classifier with feature selection
    Hu, Han
    En-en, Ren
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 6 - 8
  • [38] Classifier ensemble methods in feature selection
    Kiziloz, Hakan Ezgi
    [J]. NEUROCOMPUTING, 2021, 419 : 97 - 107
  • [39] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    [J]. Progress in Natural Science, 2009, 19 (10) : 1365 - 1371
  • [40] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469