Swarm intelligence based wavelet coefficient feature selection for mass spectral classification: An application to proteomics data

被引:9
作者
Zhao, Weixiang [1 ]
Davis, Cristina E. [1 ]
机构
[1] Univ Calif Davis, Dept Mech & Aeronaut Engn, Davis, CA 95616 USA
关键词
Wavelet; Swarm intelligence; Ant colony algorithm; Mass spectrometry; Feature selection; Support vector machine; PLS REGRESSION; OVARIAN-CANCER; NIR SPECTRA; SPECTROMETRY; TRANSFORM; DECOMPOSITION; OPTIMIZATION; PREDICTION; NETWORKS; SYSTEM;
D O I
10.1016/j.aca.2009.08.008
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper introduces the ant colony algorithm, a novel swarm intelligence based optimization method, to select appropriate wavelet coefficients from mass spectral data as a new feature selection method for ovarian cancer diagnostics. By determining the proper parameters for the ant colony algorithm (ACA) based searching algorithm, we perform the feature searching process for 100 times with the number of selected features fixed at 5. The results of this study show: (I) the classification accuracy based on the five selected wavelet coefficients can reach up to 100% for all the training, validating and independent testing sets; (2) the eight most popular selected wavelet coefficients of the 100 runs can provide 100% accuracy for the training set, 100% accuracy for the validating set, and 98.8% accuracy for the independent testing set, which suggests the robustness and accuracy of the proposed feature selection method: and (3) the mass spectral data corresponding to the eight popular wavelet coefficients can be located by reverse wavelet transformation and these located mass spectral data still maintain high classification accuracies (100% for the training set, 97.6% for the validating set, and 98.8% for the testing set) and also provide sufficient physical and medical meaning for future ovarian cancer mechanism studies. Furthermore, the corresponding mass spectral data (potential biomarkers) are in good agreement with other studies which have used the same sample set. Together these results suggest this feature extraction strategy will benefit the development of intelligent and real-time spectroscopy instrumentation based diagnosis and monitoring systems. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:15 / 23
页数:9
相关论文
共 34 条
[1]  
Addison PS, 2002, The illustrated wavelet transform handbook: introductory theory and applications in science
[2]   Ovarian cancer detection by logical analysis of proteomic data [J].
Alexe, G ;
Alexe, S ;
Liotta, LA ;
Petricoin, E ;
Reiss, M ;
Hammer, PL .
PROTEOMICS, 2004, 4 (03) :766-783
[3]  
[Anonymous], 2006, GENETIC ENG BIOTECHN
[4]  
CAMPBELL J, 2004, APPL MULTIVARIATE PR
[5]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[6]   Use of direct headspace-mass spectrometry coupled with chemometrics to predict aroma properties in Australian Riesling wine [J].
Cozzolino, Daniel ;
Smyth, Heather E. ;
Cynkar, Wies ;
Janik, Les ;
Dambergs, Robert G. ;
Gishen, Mark .
ANALYTICA CHIMICA ACTA, 2008, 621 (01) :2-7
[7]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996
[8]   Ant system: Optimization by a colony of cooperating agents [J].
Dorigo, M ;
Maniezzo, V ;
Colorni, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1996, 26 (01) :29-41
[9]   Wavelet analysis of the baseline noise in HPLC [J].
Felinger, A ;
Káré, M .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2004, 72 (02) :225-232
[10]   Analysis of mammogram classification using a wavelet transform decomposition [J].
Ferreira, CBR ;
Borges, DL .
PATTERN RECOGNITION LETTERS, 2003, 24 (07) :973-982