Feature Selection for SNP Data Based on SVM-RFE and AGA

被引:0
|
作者
Yang, Xutao [1 ]
Wu, Yue [1 ]
Jia, Min [1 ]
Lei, Zhou [1 ]
Liu, Zongtian [1 ]
机构
[1] Shanghai Univ, Dept Comp Engn & Sci, Shanghai, Peoples R China
来源
2011 AASRI CONFERENCE ON APPLIED INFORMATION TECHNOLOGY (AASRI-AIT 2011), VOL 1 | 2011年
关键词
SVM; SVM-RFE; GA; SNP; feature select;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Within the association analysis of high-throughput SNP data, it has been faced two main challenges, the super-high dimension data with less samples and the complex interaction between SNPs of genetic disease. This paper proposes a feature selection method, which combines Support Vector Machine based Recursive Feature Reduction method(SVM-RFE) and Adaptive Genetic Algorithm(AGA). Under the premise of retainning the correlation between SNPs, it significantly reduced the optimization space of critical SNPs using SVM-RFE, and then quickly found the SNPs which distinguish sample type most effectively using AGA. Compared with the Chi-square analysis and modified Relief algorithm, experimental resualt shows that the suspect SNPs selected by this method has a better ability to distinguish the type of samples. This method provides a feasible way for SNPs association study, and screen out an appropriate scale of SNP set for further biomedical research.
引用
收藏
页码:204 / 208
页数:5
相关论文
共 8 条
  • [1] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [2] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [3] Cristianini Nello, 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, DOI DOI 10.1017/CB09780511801389
  • [4] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [5] Kira K, 1992, P 9 NAT C ART INT NE
  • [6] [毛勇 MAO Yong], 2007, [模式识别与人工智能, Pattern recognition and artificial intelligence], V20, P211
  • [8] Vapnik V., 1998, STACISTICAL LEARNING