Performance of a Deep-Learning Neural Network Model in Assessing Skeletal Maturity on Pediatric Hand Radiographs

被引:338
作者
Larson, David B. [1 ]
Chen, Matthew C. [2 ]
Lungren, Matthew P. [1 ]
Halabi, Safwan S. [1 ]
Stence, Nicholas V. [4 ]
Langlotz, Curtis P. [1 ,3 ]
机构
[1] Stanford Univ, Sch Med, Dept Radiol, 300 Pasteur Dr, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Comp Sci, 300 Pasteur Dr, Stanford, CA 94305 USA
[3] Stanford Univ, Sch Med, Dept Biomed Informat, 300 Pasteur Dr, Stanford, CA 94305 USA
[4] Childrens Hosp Colorado, Dept Radiol, Aurora, CO USA
关键词
BONE-AGE ASSESSMENT; GREULICH; CHILDREN; PYLE; RELIABILITY; FUTURE; TANNER; SYSTEM;
D O I
10.1148/radiol.2017170236
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: To compare the performance of a deep-learning bone age assessment model based on hand radiographs with that of expert radiologists and that of existing automated models. Materials and Methods: The institutional review board approved the study. A total of 14 036 clinical hand radiographs and corresponding reports were obtained from two children's hospitals to train and validate the model. For the first test set, composed of 200 examinations, the mean of bone age estimates from the clinical report and three additional human reviewers was used as the reference standard. Overall model performance was assessed by comparing the root mean square (RMS) and mean absolute difference (MAD) between the model estimates and the reference standard bone ages. Ninety-five percent limits of agreement were calculated in a pairwise fashion for all reviewers and the model. The RMS of a second test set composed of 913 examinations from the publicly available Digital Hand Atlas was compared with published reports of an existing automated model. Results: The mean difference between bone age estimates of the model and of the reviewers was 0 years, with a mean RMS and MAD of 0.63 and 0.50 years, respectively. The estimates of the model, the clinical report, and the three reviewers were within the 95% limits of agreement. RMS for the Digital Hand Atlas data set was 0.73 years, compared with 0.61 years of a previously reported model. Conclusion: A deep-learning convolutional neural network model can estimate skeletal maturity with accuracy similar to that of an expert radiologist and to that of existing automated models.
引用
收藏
页码:313 / 322
页数:10
相关论文
共 33 条
[1]  
[Anonymous], 1983, ASSESSMENT SKELETAL
[2]   Bone age assessment practices in infants and older children among Society for Pediatric Radiology members [J].
Breen, Micheal A. ;
Tsai, Andy ;
Stamm, Aymeric ;
Kleinman, Paul K. .
PEDIATRIC RADIOLOGY, 2016, 46 (09) :1269-1274
[3]   Bone age assessment: a large scale comparison of the Greulich and Pyle, and Tanner and Whitehouse (TW2) methods [J].
Bull, RK ;
Edwards, PD ;
Kemp, PM ;
Fry, S ;
Hughes, IA .
ARCHIVES OF DISEASE IN CHILDHOOD, 1999, 81 (02) :172-173
[4]   Machine Learning for Medical Imaging1 [J].
Erickson, Bradley J. ;
Korfiatis, Panagiotis ;
Akkus, Zeynettin ;
Kline, Timothy L. .
RADIOGRAPHICS, 2017, 37 (02) :505-515
[5]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[6]   Computer-aided estimation of skeletal age and comparison with bone age evaluations by the method of Greulich-Pyle and Tanner-Whitehouse [J].
Frisch, H ;
Riedl, S ;
Waldhor, T .
PEDIATRIC RADIOLOGY, 1996, 26 (03) :226-231
[7]   Bone age assessment of children using a digital hand atlas [J].
Gertych, Arkadiusz ;
Zhang, Aifeng ;
Sayre, James ;
Pospiech-Kurkowska, Sywia ;
Huang, H. K. .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2007, 31 (4-5) :322-331
[8]   Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique [J].
Greenspan, Hayit ;
van Ginneken, Bram ;
Summers, Ronald M. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1153-1159
[9]  
Greulich W.W., 1971, RADIOGRAPHIC ATLAS S
[10]   The reliability of bone age determination in central European children using the Greulich and Pyle method [J].
Groell, R ;
Lindbichler, F ;
Riepl, T ;
Gherra, L ;
Roposch, A ;
Fotter, R .
BRITISH JOURNAL OF RADIOLOGY, 1999, 72 (857) :461-464