Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets

被引:0
作者
Chia Huey Ooi
Madhu Chetty
Shyh Wei Teng
机构
[1] Monash University,Gippsland School of Information Technology
来源
Data Mining and Knowledge Discovery | 2007年 / 14卷
关键词
Tissue classification; Microarray data analysis; Multiclass classification; Feature selection; Classifier aggregation;
D O I
暂无
中图分类号
学科分类号
摘要
The high dimensionality of microarray datasets endows the task of multiclass tissue classification with various difficulties—the main challenge being the selection of features deemed relevant and non-redundant to form the predictor set for classifier training. The necessity of varying the emphases on relevance and redundancy, through the use of the degree of differential prioritization (DDP) during the search for the predictor set is also of no small importance. Furthermore, there are several types of decomposition technique for the feature selection (FS) problem—all-classes-at-once, one-vs.-all (OVA) or pairwise (PW). Also, in multiclass problems, there is the need to consider the type of classifier aggregation used—whether non-aggregated (a single machine), or aggregated (OVA or PW). From here, first we propose a systematic approach to combining the distinct problems of FS and classification. Then, using eight well-known multiclass microarray datasets, we empirically demonstrate the effectiveness of the DDP in various combinations of FS decomposition types and classifier aggregation methods. Aided by the variable DDP, feature selection leads to classification performance which is better than that of rank-based or equal-priorities scoring methods and accuracies higher than previously reported for benchmark datasets with large number of classes. Finally, based on several criteria, we make general recommendations on the optimal choice of the combination of FS decomposition type and classifier aggregation method for multiclass microarray datasets.
引用
收藏
页码:329 / 366
页数:37
相关论文
共 50 条
  • [21] An experimental features selection approach in a multiclass classifier
    Reig-Bolano, Ramon
    Marti-Puig, Pere
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 137 - 140
  • [22] Degree of Differential Prioritization Prediction for Multiclass Molecular Classification
    Ooi, Chia Huey
    Chetty, Madhu
    Teng, Shyh Wei
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2009, 28 (04): : 45 - 51
  • [23] Ensemble Classifier Based on Interval Modeling for Microarray Datasets
    Bentkowska, Urszula
    Galka, Wojciech
    Mrukowicz, Marcin
    Wojtowicz, Aleksander
    ENTROPY, 2024, 26 (03)
  • [24] A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets
    Sayed, Sabah
    Nassef, Mohammad
    Badr, Amr
    Farag, Ibrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 121 : 233 - 243
  • [25] A multi-objective feature selection and classifier ensemble technique for microarray data analysis
    Dash, Rasmita
    Misra, Bijan Bihari
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (02) : 123 - 160
  • [26] A genetic programming-based approach to the classification of multiclass microarray datasets
    Liu, Kun-Hong
    Xu, Chun-Gui
    BIOINFORMATICS, 2009, 25 (03) : 331 - 337
  • [27] Multiclass Lung Cancer Diagnosis by Gene Expression Programming and Microarray Datasets
    Azzawi, Hasseeb
    Hou, Jingyu
    Alanni, Russul
    Xiang, Yong
    Abdu-Aljabar, Rana
    Azzawi, Ali
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 541 - 553
  • [28] Feature selection with limited datasets
    Kupinski, MA
    Giger, ML
    MEDICAL PHYSICS, 1999, 26 (10) : 2176 - 2182
  • [29] Discriminative Least Squares Regression for Multiclass Classification and Feature Selection
    Xiang, Shiming
    Nie, Feiping
    Meng, Gaofeng
    Pan, Chunhong
    Zhang, Changshui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (11) : 1738 - 1754
  • [30] Fuzzy-rough discriminative feature selection and classification algorithm, with application to microarray and image datasets
    Kumar P, Pramod
    Vadakkepat, Prahlad
    Poh, Loh Ai
    APPLIED SOFT COMPUTING, 2011, 11 (04) : 3429 - 3440