A support vector machine classifier reduces interscanner variation in the HRCT classification of regional disease pattern in diffuse lung disease: Comparison to a Bayesian classifier

被引:21
作者
Chang, Yongjun [1 ]
Lim, Jonghyuck [1 ]
Kim, Namkug [1 ]
Seo, Joon Beom [1 ]
Lynch, David A. [2 ]
机构
[1] Univ Ulsan, Coll Med, Dept Radiol, Seoul 138736, South Korea
[2] Natl Jewish Med & Res Ctr, Dept Radiol, Denver, CO 80206 USA
关键词
diffuse lung disease; interscanner variation; support vector machine; naive Bayesian classifier; multicenter trial; HIGH-RESOLUTION CT; TEXTURE CLASSIFICATION; COMPUTED-TOMOGRAPHY; QUANTIFICATION; EMPHYSEMA; DIFFERENTIATION; MDCT; SVM; ANN;
D O I
10.1118/1.4802214
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: To investigate the effect of using different computed tomography (CT) scanners on the accuracy of high-resolution CT (HRCT) images in classifying regional disease patterns in patients with diffuse lung disease, support vector machine (SVM) and Bayesian classifiers were applied to multicenter data. Methods: Two experienced radiologists marked sets of 600 rectangular 20 x 20 pixel regions of interest (ROIs) on HRCT images obtained from two scanners (GE and Siemens), including 100 ROIs for each of local patterns of lungs-normal lung and five of regional pulmonary disease patterns (ground-glass opacity, reticular opacity, honeycombing, emphysema, and consolidation). Each ROI was assessed using 22 quantitative features belonging to one of the following descriptors: histogram, gradient, run-length, gray level co-occurrence matrix, low-attenuation area cluster, and top-hat transform. For automatic classification, a Bayesian classifier and a SVM classifier were compared under three different conditions. First, classification accuracies were estimated using data from each scanner. Next, data from the GE and Siemens scanners were used for training and testing, respectively, and vice versa. Finally, all ROI data were integrated regardless of the scanner type and were then trained and tested together. All experiments were performed based on forward feature selection and fivefold cross-validation with 20 repetitions. Results: For each scanner, better classification accuracies were achieved with the SVM classifier than the Bayesian classifier (92% and 82%, respectively, for the GE scanner; and 92% and 86%, respectively, for the Siemens scanner). The classification accuracies were 82%/72% for training with GE data and testing with Siemens data, and 79%/72% for the reverse. The use of training and test data obtained from the HRCT images of different scanners lowered the classification accuracy compared to the use of HRCT images from the same scanner. For integrated ROI data obtained from both scanners, the classification accuracies with the SVM and Bayesian classifiers were 92% and 77%, respectively. The selected features resulting from the classification process differed by scanner, with more features included for the classification of the integrated HRCT data than for the classification of the HRCT data from each scanner. For the integrated data, consisting of HRCT images of both scanners, the classification accuracy based on the SVM was statistically similar to the accuracy of the data obtained from each scanner. However, the classification accuracy of the integrated data using the Bayesian classifier was significantly lower than the classification accuracy of the ROI data of each scanner. Conclusions: The use of an integrated dataset along with a SVM classifier rather than a Bayesian classifier has benefits in terms of the classification accuracy of HRCT images acquired with more than one scanner. This finding is of relevance in studies involving large number of images, as is the case in a multicenter trial with different scanners. (c) 2013 American Association of Physicists in Medicine.
引用
收藏
页数:12
相关论文
共 27 条
[1]  
Basu A., 2003, 36 ANN HAW INT C SYS
[2]   The semivariogram in comparison to the co-occurrence matrix for classification of image texture [J].
Carr, JR ;
de Miranda, FP .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1998, 36 (06) :1945-1952
[3]   Obstructive lung diseases: Texture classification for differentiation at CT [J].
Chabat, F ;
Yang, GZ ;
Hansell, DM .
RADIOLOGY, 2003, 228 (03) :871-877
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]   A quantification of the lung surface area in emphysema using computed tomography [J].
Coxson, HO ;
Rogers, RM ;
Whittall, KP ;
D'Yachkova, Y ;
Paré, PD ;
Sciurba, FC ;
Hogg, JC .
AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 1999, 159 (03) :851-856
[7]   Usual interstitial pneumonia - Quantitative assessment of high-resolution computed tomography findings by computer-assisted texture-based image analysis [J].
Delorme, S ;
KellerReichenbecher, MA ;
Zuna, I ;
Schlegel, W ;
vanKaick, G .
INVESTIGATIVE RADIOLOGY, 1997, 32 (09) :566-574
[8]  
Fujisaki Tatsuya, 2004, Radiat Med, V22, P233
[9]   Emphysema Quantification in Inflation-Fixed Lungs Using Low-Dose Computed Tomography and 3He Magnetic Resonance Imaging [J].
Gierada, David S. ;
Woods, Jason C. ;
Jacob, Richard E. ;
Bierhals, Andrew J. ;
Choong, Cliff K. ;
Bartel, Seth T. ;
Chang, Yulin V. ;
Das, Nitin A. ;
Hong, Cheng ;
Lutey, Barbara A. ;
Ritter, Jon H. ;
Pilgram, Thomas K. ;
Cooper, Joel D. ;
Patterson, G. Alexander ;
Battafarano, Richard J. ;
Meyers, Bryan F. ;
Yablonskiy, Dmitriy A. ;
Conradi, Mark S. .
JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 2010, 34 (05) :773-779
[10]   CHRONIC DIFFUSE INTERSTITIAL LUNG-DISEASE - DIAGNOSTIC-VALUE OF CHEST RADIOGRAPHY AND HIGH-RESOLUTION CT [J].
GRENIER, P ;
VALEYRE, D ;
CLUZEL, P ;
BRAUNER, MW ;
LENOIR, S ;
CHASTANG, C .
RADIOLOGY, 1991, 179 (01) :123-132