Comparative Study of Feature Selection and Classification Techniques for High-Throughput DNA Methylation Data

被引:0
|
作者
Alkuhlani, Alhasan [1 ]
Nassef, Mohammad [1 ]
Farag, Ibrahim [1 ]
机构
[1] Cairo Univ, Dept Comp Sci, Fac Comp & Informat, Giza, Egypt
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷
关键词
Microarray; DNA Methylation; Feature selection; Classification; Cross-alidation; SUPPORT VECTOR MACHINES; GENE SELECTION; CANCER CLASSIFICATION; MICROARRAY DATA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high dimensionality of data is a common problem in classification. In this work, a small number of significant features is investigated to classify data of two sample groups. Various feature selection and classification techniques are applied in a collection of four high-throughput DNA methylation microarray data sets. Using accuracy as a performance metric, the repeated 10-fold cross-validation strategy is implemented to evaluate the different proposed techniques. Combining the Signal to Noise Ratio (SNR) and Wilcoxon rank-sum test filter methods with Support Vector Machine-Recursive Feature Elimination (SVM-RFE) as an embedded method has resulted in a perfect performance. In addition, the linear classifiers showed excellent results compared to others classifiers when applied to such data sets.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 50 条
  • [1] Classification Based on Feature Extraction For Hepatocellular Carcinoma Diagnosis Using High-Throughput Dna Methylation Sequencing Data
    Yang, Zhiyuan
    Jin, Meng
    Zhang, Zhongyang
    Lu, Jianwei
    Hao, Ke
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 107 : 412 - 417
  • [2] Feature cluster selection for high-throughput data analysis
    Yu, Lei
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (02) : 177 - 191
  • [3] A Comparative Study of Various Feature Selection Techniques in High-Dimensional data set to Improve Classification Accuracy
    Shroff, Kandarp P.
    Maheta, Hardik H.
    2015 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2015,
  • [4] Attack classification using feature selection techniques: a comparative study
    Ankit Thakkar
    Ritika Lohiya
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 1249 - 1266
  • [5] Attack classification using feature selection techniques: a comparative study
    Thakkar, Ankit
    Lohiya, Ritika
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 1249 - 1266
  • [6] Assessing Differential Variability of High-Throughput DNA Methylation Data
    Saddiki, Hachem
    Colicino, Elena
    Lesseur, Corina
    CURRENT ENVIRONMENTAL HEALTH REPORTS, 2022, 9 (04) : 625 - 630
  • [7] Assessing Differential Variability of High-Throughput DNA Methylation Data
    Hachem Saddiki
    Elena Colicino
    Corina Lesseur
    Current Environmental Health Reports, 2022, 9 : 625 - 630
  • [8] Representation and classification for high-throughput data
    Wessels, LFA
    Reinders, MJT
    van Welsem, T
    Nederlof, PM
    BIOMEDICAL NANOTECHNOLOGY ARCHITECTURES AND APPLICATIONS, 2002, 4626 : 226 - 237
  • [9] A comparative study of feature selection and classification methods for gene expression data of glioma
    Abusamra, Heba
    4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS-BIOLOGY AND BIOINFORMATICS (CSBIO2013), 2013, 23 : 5 - 14
  • [10] Feature selection method based on support vector machine and shape analysis for high-throughput medical data
    Liu, Qiong
    Gu, Qiong
    Wu, Zhao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 91 : 103 - 111