Comparative Study of Feature Selection and Classification Techniques for High-Throughput DNA Methylation Data

被引:0
|
作者
Alkuhlani, Alhasan [1 ]
Nassef, Mohammad [1 ]
Farag, Ibrahim [1 ]
机构
[1] Cairo Univ, Dept Comp Sci, Fac Comp & Informat, Giza, Egypt
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷
关键词
Microarray; DNA Methylation; Feature selection; Classification; Cross-alidation; SUPPORT VECTOR MACHINES; GENE SELECTION; CANCER CLASSIFICATION; MICROARRAY DATA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high dimensionality of data is a common problem in classification. In this work, a small number of significant features is investigated to classify data of two sample groups. Various feature selection and classification techniques are applied in a collection of four high-throughput DNA methylation microarray data sets. Using accuracy as a performance metric, the repeated 10-fold cross-validation strategy is implemented to evaluate the different proposed techniques. Combining the Signal to Noise Ratio (SNR) and Wilcoxon rank-sum test filter methods with Support Vector Machine-Recursive Feature Elimination (SVM-RFE) as an embedded method has resulted in a perfect performance. In addition, the linear classifiers showed excellent results compared to others classifiers when applied to such data sets.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 50 条
  • [21] Classification for high-throughput data with an optimal subset of principal components
    Song, Joon Jin
    Ren, Yuan
    Yan, Fenglan
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2009, 33 (05) : 408 - 413
  • [22] Feature Selection and Classification for DNA Methylation-Based Subtypes of Substance Misuses
    Zhang, Xinyu
    Hu, Ying
    Justice, Amy
    Krystal, John
    Xu, Ke
    NEUROPSYCHOPHARMACOLOGY, 2017, 42 : S468 - S469
  • [23] A comparative study on the effect of feature selection on classification accuracy
    Karabulut, Esra Mahsereci
    Ozel, Selma Ayse
    Ibrikci, Turgay
    FIRST WORLD CONFERENCE ON INNOVATION AND COMPUTER SCIENCES (INSODE 2011), 2012, 1 : 323 - 327
  • [24] A VARIATIONAL BAYES BETA MIXTURE MODEL FOR FEATURE SELECTION IN DNA METHYLATION STUDIES
    Ma, Zhanyu
    Teschendorff, Andrew E.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (04)
  • [25] Eye tracking data guided feature selection for image classification
    Zhou, Xuan
    Gao, Xin
    Wang, Jiajun
    Yu, Hui
    Wang, Zhiyong
    Chi, Zheru
    PATTERN RECOGNITION, 2017, 63 : 56 - 70
  • [26] Classification of Schizophrenia by Iterative Random Forest Feature Selection Based on DNA Methylation Array Data
    Hu, Xinyu
    Li, Min
    Wang, Linconghua
    Li, Xingyi
    Wu, Fang-Xiang
    Wang, Jianxin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 807 - 811
  • [27] Machine Learning and Feature Selection for the Classification of Mental Disorders from Methylation Data
    Bartlett, Christopher L.
    Glatt, Stephen J.
    Bichindaritz, Isabelle
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2019, 2019, 11526 : 311 - 321
  • [28] Ensemble feature selection for high dimensional data: a new method and a comparative study
    Afef Ben Brahim
    Mohamed Limam
    Advances in Data Analysis and Classification, 2018, 12 : 937 - 952
  • [29] A comparative investigation of modern feature selection and classification approaches for the analysis of mass spectrometry data
    Gromski, Piotr S.
    Xu, Yun
    Correa, Elon
    Ellis, David I.
    Turner, Michael L.
    Goodacre, Royston
    ANALYTICA CHIMICA ACTA, 2014, 829 : 1 - 8
  • [30] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    FUTURE INFORMATION TECHNOLOGY, 2011, 13 : 213 - 217