Feature subset selection for improving the performance of false positive reduction in lung nodule CAD

被引:61
作者
Boeroezky, Lilla [1 ]
Zhao, Luyin [1 ]
Lee, K. P. [1 ]
机构
[1] Philips Res N Amer, Briarcliff Manor, NY 10510 USA
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2006年 / 10卷 / 03期
关键词
computer-aided analysis; genetic algorithms (GAs); medical decision making; supervised machine learning; support vector machines;
D O I
10.1109/TITB.2006.872063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a feature subset selection method based on genetic algorithms to improve the performance of false positive reduction in lung nodule computer-aided detection (CAD). It is coupled with a classifier based on support vector machines. The proposed approach determines automatically the optimal size of the feature set, and chooses the most relevant features from a feature pool. Its performance was tested using a lung nodule database (52 true nodules and 443 false ones) acquired by multislice CT scans. From 23 features calculated for each detected structure, the suggested method determined ten to be the optimal feature subset size, and selected the most relevant ten features. A support vector machine classifier trained with the optimal feature subset resulted in 100% sensitivity and 56.4% specificity using an independent validation set. Experiments show significant improvement achieved by a system incorporating the proposed method over a system without it. This approach can be also applied to other machine learning problems; e.g. computer-aided diagnosis of lung nodules.
引用
收藏
页码:504 / 511
页数:8
相关论文
共 26 条
[1]  
American Cancer Society, 2005, CANC FACTS FIG 2005
[2]  
Batista G., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
[3]  
BEGG R, 2003, P C CONV TECHN AS PA, V1, P354
[4]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[5]  
Doak J., 1992, Technical Report CSE-92-18
[6]   Current status and future potential of computer-aided diagnosis in medical imaging [J].
Doi, K .
BRITISH JOURNAL OF RADIOLOGY, 2005, 78 :S3-S19
[7]  
Eshelman L. J., 1991, FDN GENETIC ALGORITH, V1, P265, DOI DOI 10.1016/B978-0-08-050684-5.50020-3
[8]  
Fan RE, 2005, J MACH LEARN RES, V6, P1889
[9]  
GE Z, 2004, P MED IM 2004 IM PRO
[10]   Statistical pattern recognition: A review [J].
Jain, AK ;
Duin, RPW ;
Mao, JC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (01) :4-37