A novel approach to feature extraction from classification models based on information gene pairs

被引:10
作者
Li, J. [1 ]
Tang, X. [1 ]
Liu, J. [1 ]
Huang, J. [1 ]
Wang, Y. [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
feature extraction; information gene pair; microarray data; cancer classification; genetic algorithm;
D O I
10.1016/j.patcog.2007.11.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various microarray experiments are now done in many laboratories, resulting in the rapid accumulation of microarray data in public repositories. One of the major challenges of analyzing microarray data is how to extract and select efficient features from it for accurate cancer classification. Here we introduce a new feature extraction and selection method based on information gene pairs that have significant change in different tissue samples. Experimental results on five public microarray data sets demonstrate that the feature subset selected by the proposed method performs well and achieves higher classification accuracy on several classifiers. We perform extensive experimental comparison of the features selected by the proposed method and features selected by other methods using different evaluation methods and classifiers. The results confirm that the proposed method performs as well as other methods on acute lymphoblastic-acute myeloid leukemia, adenocarcinoma and breast cancer data sets using a fewer information genes and leads to significant improvement of classification accuracy on colon and diffuse large B cell lymphoma cancer data sets. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1975 / 1984
页数:10
相关论文
共 27 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [4] Gene selection and classification from microarray data using kernel machine
    Cho, JH
    Lee, D
    Park, JH
    Lee, IB
    [J]. FEBS LETTERS, 2004, 571 (1-3) : 93 - 98
  • [5] New gene selection method for classification of cancer subtypes considering within-class variation
    Cho, JH
    Lee, D
    Park, JY
    Lee, IB
    [J]. FEBS LETTERS, 2003, 551 (1-3) : 3 - 7
  • [6] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [7] Gene-expression profiles in hereditary breast cancer.
    Hedenfalk, I
    Duggan, D
    Chen, YD
    Radmacher, M
    Bittner, M
    Simon, R
    Meltzer, P
    Gusterson, B
    Esteller, M
    Kallioniemi, OP
    Wilfond, B
    Borg, Å
    Trent, J
    Raffeld, M
    Yakhini, Z
    Ben-Dor, A
    Dougherty, E
    Kononen, J
    Bubendorf, L
    Fehrle, W
    Pittaluga, S
    Gruvberger, S
    Loman, N
    Johannsoson, O
    Olsson, H
    Sauter, G
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2001, 344 (08) : 539 - 548
  • [8] Holland J. H., 1992, ADAPTATION NATURAL A, DOI DOI 10.7551/MITPRESS/1090.001.0001
  • [9] Jaeger J., 2003, PACIFIC S BIOCOMPUTI, V8, P53, DOI DOI 10.1142/9789812776303_0006
  • [10] Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks
    Khan, J
    Wei, JS
    Ringnér, M
    Saal, LH
    Ladanyi, M
    Westermann, F
    Berthold, F
    Schwab, M
    Antonescu, CR
    Peterson, C
    Meltzer, PS
    [J]. NATURE MEDICINE, 2001, 7 (06) : 673 - 679