Comparative Study of Feature Selection and Classification Techniques for High-Throughput DNA Methylation Data

被引:0
|
作者
Alkuhlani, Alhasan [1 ]
Nassef, Mohammad [1 ]
Farag, Ibrahim [1 ]
机构
[1] Cairo Univ, Dept Comp Sci, Fac Comp & Informat, Giza, Egypt
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷
关键词
Microarray; DNA Methylation; Feature selection; Classification; Cross-alidation; SUPPORT VECTOR MACHINES; GENE SELECTION; CANCER CLASSIFICATION; MICROARRAY DATA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high dimensionality of data is a common problem in classification. In this work, a small number of significant features is investigated to classify data of two sample groups. Various feature selection and classification techniques are applied in a collection of four high-throughput DNA methylation microarray data sets. Using accuracy as a performance metric, the repeated 10-fold cross-validation strategy is implemented to evaluate the different proposed techniques. Combining the Signal to Noise Ratio (SNR) and Wilcoxon rank-sum test filter methods with Support Vector Machine-Recursive Feature Elimination (SVM-RFE) as an embedded method has resulted in a perfect performance. In addition, the linear classifiers showed excellent results compared to others classifiers when applied to such data sets.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 50 条
  • [31] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 3699 - +
  • [32] Performance of feature-selection methods in the classification of high-dimension data
    Hua, Jianping
    Tembe, Waibhav D.
    Dougherty, Edward R.
    PATTERN RECOGNITION, 2009, 42 (03) : 409 - 424
  • [33] High dimensional data classification and feature selection using support vector machines
    Ghaddar, Bissan
    Naoum-Sawaya, Joe
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 265 (03) : 993 - 1004
  • [34] A Comparative Study of Feature Selection Techniques for Classify Student Performance
    Punlumjeak, Wattana
    Rachburee, Nachirat
    2015 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2015, : 425 - 429
  • [35] AmpliconDesign - an interactive web server for the design of high-throughput targeted DNA methylation assays
    Schonung, Maximilian
    Hess, Jana
    Bawidamann, Pascal
    Stable, Sina
    Hey, Joschka
    Langstein, Jens
    Assenov, Yassen
    Weichenhan, Dieter
    Lutsik, Pavlo
    Lipka, Daniel B.
    EPIGENETICS, 2021, 16 (09) : 933 - 939
  • [36] A comparative study of optimization algorithms for feature selection on ML-based classification of agricultural data
    Garip, Zeynep
    Ekinci, Ekin
    Cimen, Murat Erhan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3341 - 3362
  • [37] Feature Selection and Molecular Classification of Cancer Phenotypes: A Comparative Study
    Zanella, Luca
    Facco, Pierantonio
    Bezzo, Fabrizio
    Cimetta, Elisa
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (16)
  • [38] High-throughput assay of DNA methylation based on methylation-specific primer and SAGE
    Wang, XL
    Zhang, C
    Zhang, LJ
    Wang, XL
    Xu, SQ
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2006, 341 (03) : 749 - 754
  • [39] Comparative study of feature selection methods on microarray data
    Miyamoto, T
    Uchimura, S
    Hamamoto, Y
    Iizuka, N
    Oka, M
    Yamada-Okabe, H
    IEEE EMBS APBME 2003, 2003, : 82 - 83
  • [40] An efficient statistical feature selection approach for classification of gene expression data
    Chandra, B.
    Gupta, Manish
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) : 529 - 535