A support vector machine classifier reduces interscanner variation in the HRCT classification of regional disease pattern in diffuse lung disease: Comparison to a Bayesian classifier

被引:21
作者
Chang, Yongjun [1 ]
Lim, Jonghyuck [1 ]
Kim, Namkug [1 ]
Seo, Joon Beom [1 ]
Lynch, David A. [2 ]
机构
[1] Univ Ulsan, Coll Med, Dept Radiol, Seoul 138736, South Korea
[2] Natl Jewish Med & Res Ctr, Dept Radiol, Denver, CO 80206 USA
关键词
diffuse lung disease; interscanner variation; support vector machine; naive Bayesian classifier; multicenter trial; HIGH-RESOLUTION CT; TEXTURE CLASSIFICATION; COMPUTED-TOMOGRAPHY; QUANTIFICATION; EMPHYSEMA; DIFFERENTIATION; MDCT; SVM; ANN;
D O I
10.1118/1.4802214
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: To investigate the effect of using different computed tomography (CT) scanners on the accuracy of high-resolution CT (HRCT) images in classifying regional disease patterns in patients with diffuse lung disease, support vector machine (SVM) and Bayesian classifiers were applied to multicenter data. Methods: Two experienced radiologists marked sets of 600 rectangular 20 x 20 pixel regions of interest (ROIs) on HRCT images obtained from two scanners (GE and Siemens), including 100 ROIs for each of local patterns of lungs-normal lung and five of regional pulmonary disease patterns (ground-glass opacity, reticular opacity, honeycombing, emphysema, and consolidation). Each ROI was assessed using 22 quantitative features belonging to one of the following descriptors: histogram, gradient, run-length, gray level co-occurrence matrix, low-attenuation area cluster, and top-hat transform. For automatic classification, a Bayesian classifier and a SVM classifier were compared under three different conditions. First, classification accuracies were estimated using data from each scanner. Next, data from the GE and Siemens scanners were used for training and testing, respectively, and vice versa. Finally, all ROI data were integrated regardless of the scanner type and were then trained and tested together. All experiments were performed based on forward feature selection and fivefold cross-validation with 20 repetitions. Results: For each scanner, better classification accuracies were achieved with the SVM classifier than the Bayesian classifier (92% and 82%, respectively, for the GE scanner; and 92% and 86%, respectively, for the Siemens scanner). The classification accuracies were 82%/72% for training with GE data and testing with Siemens data, and 79%/72% for the reverse. The use of training and test data obtained from the HRCT images of different scanners lowered the classification accuracy compared to the use of HRCT images from the same scanner. For integrated ROI data obtained from both scanners, the classification accuracies with the SVM and Bayesian classifiers were 92% and 77%, respectively. The selected features resulting from the classification process differed by scanner, with more features included for the classification of the integrated HRCT data than for the classification of the HRCT data from each scanner. For the integrated data, consisting of HRCT images of both scanners, the classification accuracy based on the SVM was statistically similar to the accuracy of the data obtained from each scanner. However, the classification accuracy of the integrated data using the Bayesian classifier was significantly lower than the classification accuracy of the ROI data of each scanner. Conclusions: The use of an integrated dataset along with a SVM classifier rather than a Bayesian classifier has benefits in terms of the classification accuracy of HRCT images acquired with more than one scanner. This finding is of relevance in studies involving large number of images, as is the case in a multicenter trial with different scanners. (c) 2013 American Association of Physicists in Medicine.
引用
收藏
页数:12
相关论文
共 27 条
[11]   STATISTICAL AND STRUCTURAL APPROACHES TO TEXTURE [J].
HARALICK, RM .
PROCEEDINGS OF THE IEEE, 1979, 67 (05) :786-804
[12]  
Hastie T., 2001, ELEMENTS STAT LEARNI
[13]  
Joachims T., EUR C MACH LEARN, P137, DOI DOI 10.1007/BFB0026683
[14]   MEASUREMENT OF PULMONARY PARENCHYMAL ATTENUATION - USE OF SPIROMETRIC GATING WITH QUANTITATIVE CT [J].
KALENDER, WA ;
RIENMULLER, R ;
SEISSLER, W ;
BEHR, J ;
WELKE, M ;
FICHTE, H .
RADIOLOGY, 1990, 175 (01) :265-268
[15]   Automatic detection and quantification of ground-glass opacities on high-resolution CT using multiple neural networks: Comparison with a density mask [J].
Kauczor, HU ;
Heitmann, K ;
Heussel, CP ;
Marwede, D ;
Uthmann, T ;
Thelen, M .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2000, 175 (05) :1329-1334
[16]  
Kim N., SPIE MED IM 2008 P, V6914
[17]   Development of an Automatic Classification System for Differentiation of Obstructive Lung Disease using HRCT [J].
Kim, Namkug ;
Seo, Joon Beom ;
Lee, Youngjoo ;
Lee, June Goo ;
Kim, Song Soo ;
Kang, Suk-Ho .
JOURNAL OF DIGITAL IMAGING, 2009, 22 (02) :136-148
[18]   Utility of high-resolution CT for management of diffuse lung disease: Results of a survey of US pulmonary physicians [J].
Scatarige, JC ;
Diette, GB ;
Haponik, EF ;
Merriman, B ;
Fishman, EK .
ACADEMIC RADIOLOGY, 2003, 10 (02) :167-175
[19]   The connection between regularization operators and support vector kernels [J].
Smola, AJ ;
Scholkopf, B ;
Muller, KR .
NEURAL NETWORKS, 1998, 11 (04) :637-649
[20]  
Sonka M., 2014, Cengage Learning