Breast cancer diagnosis using genetic programming generated feature

被引:85
作者
Guo, H [1 ]
Nandi, AK [1 ]
机构
[1] Univ Liverpool, Dept Elect & Elect Engn, Signal Proc & Commun Grp, Liverpool L69 3GJ, Merseyside, England
关键词
feature extraction; genetic programming; Fisher discriminant analysis; pattern recognition;
D O I
10.1016/j.patcog.2005.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel method for breast cancer diagnosis using the feature generated by genetic programming (GP). We developed a new feature extraction measure (modified Fisher linear discriminant analysis (MFLDA)) to overcome the limitation of Fisher criterion. GP as an evolutionary mechanism provides a training structure to generate features. A modified Fisher criterion is developed to help GP optimize features that allow pattern vectors belonging to different categories to distribute compactly and disjoint regions. First, the MFLDA is experimentally compared with some classical feature extraction methods (principal component analysis, Fisher linear discriminant analysis, alternative Fisher linear discriminant analysis). Second, the feature generated by GP based on the modified Fisher criterion is compared with the features generated by GP using Fisher criterion and an alternative Fisher criterion in terms of the classification performance. The classification is carried out by a simple classifier (minimum distance classifier). Finally, the same feature generated by GP is compared with a original feature set as the inputs to multi-layer perceptrons and support vector machine. Results demonstrate the capability of this method to transform information from high-dimensional feature space into one-dimensional space and automatically discover the relationship among data, to improve classification accuracy. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:980 / 987
页数:8
相关论文
共 19 条
[1]  
[Anonymous], 1998, DATA MINING METHODS
[2]  
Blake C.L., 1998, UCI repository of machine learning databases
[3]   A comparison of linear genetic programming and neural networks in medical data mining [J].
Brameier, M ;
Banzhaf, W .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2001, 5 (01) :17-26
[4]   Alternative linear discriminant classifier [J].
Chen, SC ;
Yang, XB .
PATTERN RECOGNITION, 2004, 37 (07) :1545-1547
[5]   Two variations on Fisher's linear discriminant for pattern recognition [J].
Cooke, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (02) :268-273
[6]   Feature generation using genetic programming with application to fault classification [J].
Guo, H ;
Jack, LB ;
Nandi, AK .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (01) :89-99
[7]   Hierarchical classification and feature reduction for fast face detection with support vector machines [J].
Heisele, B ;
Serre, T ;
Prentice, S ;
Poggio, T .
PATTERN RECOGNITION, 2003, 36 (09) :2007-2017
[8]   A comparison of methods for multiclass support vector machines [J].
Hsu, CW ;
Lin, CJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02) :415-425
[9]   Genetic algorithms for feature selection in machine condition monitoring with vibration signals [J].
Jack, LB ;
Nandi, AK .
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (03) :205-212
[10]   Application of genetic programming for multicategory pattern classification [J].
Kishore, JK ;
Patnaik, LM ;
Mani, V ;
Agrawal, VK .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2000, 4 (03) :242-258