A comparison of ground truth estimation methods

被引:16
作者
Biancardi, Alberto M. [1 ]
Jirapatnakul, Artit C. [1 ]
Reeves, Anthony P. [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14850 USA
关键词
CAD development; Algorithm validation; Volumetric measurement; Diagnosis; Response to therapy; IMAGE DATABASE CONSORTIUM; PULMONARY NODULES; CT; SEGMENTATION; ANNOTATION; SIZE; LIDC;
D O I
10.1007/s11548-009-0401-3
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Knowledge of the exact shape of a lesion, or ground truth (GT), is necessary for the development of diagnostic tools by means of algorithm validation, measurement metric analysis, accurate size estimation. Four methods that estimate GTs from multiple readers' documentations by considering the spatial location of voxels were compared: thresholded Probability-Map at 0.50 (TPM0.50) and at 0.75 (TPM0.75), simultaneous truth and performance level estimation (STAPLE) and truth estimate from self distances (TESD). Methods A subset of the publicly available Lung Image Database Consortium archive was used, selecting pulmonary nodules documented by all four radiologists. The pair-wise similarities between the estimated GTs were analyzed by computing the respective Jaccard coefficients. Then, with respect to the readers' marking volumes, the estimated volumes were ranked and the sign test of the differences between them was performed. Results (a) the rank variations among the four methods and the volume differences between STAPLE and TESD are not statistically significant, (b) TPM0.50 estimates are statistically larger (c) TPM0.75 estimates are statistically smaller (d) there is some spatial disagreement in the estimates as the one-sided 90% confidence intervals between TPM0.75 and TPM0.50, TPM0.75 and STAPLE, TPM0.75 and TESD, TPM0.50 and STAPLE, TPM0.50 and TESD, STAPLE and TESD, respectively, show: [0.67, 1.00], [0.67, 1.00], [0.77, 1.00], [0.93, 1.00], [0.85, 1.00], [0.85, 1.00]. Conclusions The method used to estimate the GT is important: the differences highlighted that STAPLE and TESD, notwithstanding a few weaknesses, appear to be equally viable as a GT estimator, while the increased availability of computing power is decreasing the appeal afforded to TPMs. Ultimately, the choice of which GT estimation method, between the two, should be preferred depends on the specific characteristics of the marked data that is used with respect to the two elements that differentiate the method approaches: relative reliabilities of the readers and the reliability of the region boundaries.
引用
收藏
页码:295 / 305
页数:11
相关论文
共 21 条
[1]   Lung image database consortium: Developing a resource for the medical imaging research community [J].
Armato, SG ;
McLennan, G ;
McNitt-Gray, MF ;
Meyer, CR ;
Yankelevitz, D ;
Aberle, DR ;
Henschke, CI ;
Hoffman, EA ;
Kazerooni, EA ;
MacMahon, H ;
Reeves, AP ;
Croft, BY ;
Clarke, LP .
RADIOLOGY, 2004, 232 (03) :739-748
[2]  
BIANCARDI AM, 2009, SPIE INT S MED IMAGI, V7260
[3]   An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision [J].
Boykov, Y ;
Kolmogorov, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) :1124-1137
[4]   VOLUME DETERMINATIONS USING COMPUTED-TOMOGRAPHY [J].
BREIMAN, RS ;
BECK, JW ;
KOROBKIN, M ;
GLENNY, R ;
AKWARI, OE ;
HEASTON, DK ;
MOORE, AV ;
RAM, PC .
AMERICAN JOURNAL OF ROENTGENOLOGY, 1982, 138 (02) :329-333
[5]  
FELZENSZWALB P, 2003, DISTANCE TRANSFORMS
[6]  
Ford L.R., 1956, Canadian journal of Mathematics, V8, P399, DOI 10.4153/CJM-1956-045-5
[7]   Inherent variability of CT lung nodule measurements in vivo using semiautomated volumetric measurements [J].
Goodman, LR ;
Gulsun, M ;
Washington, L ;
Nagy, PG ;
Piacsek, KL .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2006, 186 (04) :989-994
[8]  
IBANEZ L, 2005, ITSK SOFTWARE GUIDE
[9]  
Jaccard P., 1908, Bull. Soc. Vaudoise Sci. Nat, V44, P223, DOI DOI 10.5169/SEALS-268384
[10]   Small pulmonary nodules: Volume measurement at chest CT - Phantom study [J].
Ko, JP ;
Rusinek, H ;
Jacobs, EL ;
Babb, JS ;
Betke, M ;
McGuinness, G ;
Naidich, DP .
RADIOLOGY, 2003, 228 (03) :864-870