Development and Validation of a Deep Learning-based Automatic Detection Algorithm for Active Pulmonary Tuberculosis on Chest Radiographs

被引:154
作者
Hwang, Eui Jin [1 ]
Park, Sunggyun [2 ]
Jin, Kwang-Nam [3 ]
Kim, Jung Im [4 ]
Choi, So Young [5 ]
Lee, Jong Hyuk [1 ]
Goo, Jin Mo [1 ]
Aum, Jaehong [2 ]
Yim, Jae-Joon [6 ]
Park, Chang Min [1 ]
Kim, Dong Hyeon [7 ]
Kim, Dong Hyeon [7 ]
Woo, Sungmin [8 ]
Choi, Wonseok [7 ]
Hwang, In Pyung [7 ]
Song, Yong Sub [7 ]
Lim, Jiyeon [7 ]
Kim, Hyungjin [7 ]
Wi, Jae Yeon [9 ]
Oh, Su Suk [10 ]
Kang, Mi-Jin [11 ]
Woo, Chris
机构
[1] Seoul Natl Univ, Coll Med, Dept Radiol, 101 Daehak Ro, Seoul 03080, South Korea
[2] Lunit Inc, Seoul, South Korea
[3] Seoul Natl Univ, Dept Radiol, Boramae Med Ctr, Seoul, South Korea
[4] Kyung Hee Univ Hosp Gangdong, Dept Radiol, Seoul, South Korea
[5] Eulji Univ, Med Ctr, Dept Radiol, Daejon, South Korea
[6] Seoul Natl Univ, Coll Med, Dept Internal Med, Div Pulm & Crit Care Med, Seoul, South Korea
[7] Seoul Natl Univ Hosp, Coll Med, Dept Radiol, Seoul, South Korea
[8] Armed Forces Daejon Hosp, Dept Radiol, Daejon, South Korea
[9] Asan Med Ctr, Dept Radiol, Seoul, South Korea
[10] Seoul Natl Univ Hosp, Dept Radiol, Seoul, South Korea
[11] Inje Univ, Sanggyepaik Hosp, Dept Radiol, Seoul, South Korea
关键词
tuberculosis; chest radiograph; deep learning; computer-aided detection; COMPUTER-AIDED DETECTION; CT COLONOGRAPHY; 2ND READER; CLASSIFICATION; NODULES;
D O I
10.1093/cid/ciy967
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Background Detection of active pulmonary tuberculosis on chest radiographs (CRs) is critical for the diagnosis and screening of tuberculosis. An automated system may help streamline the tuberculosis screening process and improve diagnostic performance. Methods We developed a deep learning-based automatic detection (DLAD) algorithm using 54c221 normal CRs and 6768 CRs with active pulmonary tuberculosis that were labeled and annotated by 13 board-certified radiologists. The performance of DLAD was validated using 6 external multicenter, multinational datasets. To compare the performances of DLAD with physicians, an observer performance test was conducted by 15 physicians including nonradiology physicians, board-certified radiologists, and thoracic radiologists. Image-wise classification and lesion-wise localization performances were measured using area under the receiver operating characteristic (ROC) curves and area under the alternative free-response ROC curves, respectively. Sensitivities and specificities of DLAD were calculated using 2 cutoffs (high sensitivity [98%] and high specificity [98%]) obtained through in-house validation. Results DLAD demonstrated classification performance of 0.977-1.000 and localization performance of 0.973-1.000. Sensitivities and specificities for classification were 94.3%-100% and 91.1%-100% using the high-sensitivity cutoff and 84.1%-99.0% and 99.1%-100% using the high-specificity cutoff. DLAD showed significantly higher performance in both classification (0.993 vs 0.746-0.971) and localization (0.993 vs 0.664-0.925) compared to all groups of physicians. Conclusions Our DLAD demonstrated excellent and consistent performance in the detection of active pulmonary tuberculosis on CR, outperforming physicians, including thoracic radiologists. A deep learning-based algorithm outperformed radiologists in detecting active pulmonary tuberculosis on chest radiographs and thus may play an important role in diagnosis and screening of tuberculosis in select situations, contributing to the reduction of the high burden of tuberculosis worldwide.
引用
收藏
页码:739 / 747
页数:9
相关论文
共 31 条
[1]  
[Anonymous], 2017, COMPUTING RES REPOSI
[2]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[3]  
[Anonymous], 2013, SYST SCREEN ACT TUB
[4]   Diagnostic Accuracy of Computer-Aided Detection of Pulmonary Tuberculosis in Chest Radiographs: A Validation Study from Sub-Saharan Africa [J].
Breuninger, Marianne ;
van Ginneken, Bram ;
Philipsen, Rick H. H. M. ;
Mhimbira, Francis ;
Hella, Jerry J. ;
Lwilla, Fred ;
van den Hombergh, Jan ;
Ross, Amanda ;
Jugheli, Levan ;
Wagner, Dirk ;
Reither, Klaus .
PLOS ONE, 2014, 9 (09)
[5]   Understanding the Software Development Practices of Blockchain Projects: A Survey [J].
Chakraborty, Partha ;
Shahriyar, Rifat ;
Iqbal, Anindya ;
Bosu, Amiangshu .
PROCEEDINGS OF THE 12TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2018), 2018,
[6]  
den Boon S, 2006, INT J TUBERC LUNG D, V10, P876
[7]   RECEIVER OPERATING CHARACTERISTIC RATING ANALYSIS - GENERALIZATION TO THE POPULATION OF READERS AND PATIENTS WITH THE JACKKNIFE METHOD [J].
DORFMAN, DD ;
BERBAUM, KS ;
METZ, CE .
INVESTIGATIVE RADIOLOGY, 1992, 27 (09) :723-731
[8]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[9]   Observer Performance in the Detection and Classification of Malignant Hepatic Nodules and Masses with CT Image-Space Denoising and Iterative Reconstruction [J].
Fletcher, Joel G. ;
Yu, Lifeng ;
Li, Zhoubo ;
Manduca, Armando ;
Blezek, Daniel J. ;
Hough, David M. ;
Venkatesh, Sudhakar K. ;
Brickner, Gregory C. ;
Cernigliaro, Joseph C. ;
Hara, Amy K. ;
Fidler, Jeff L. ;
Lake, David S. ;
Shiung, Maria ;
Lewis, David ;
Leng, Shuai ;
Augustine, Kurt E. ;
Carter, Rickey E. ;
Holmes, David R., III ;
McCollough, Cynthia H. .
RADIOLOGY, 2015, 276 (02) :465-478
[10]   Single reading with computer-aided detection for screening mammography [J].
Gilbert, Fiona J. ;
Astley, Susan M. ;
Gillan, Maureen G. C. ;
Agbaje, Olorunsola F. ;
Wallis, Matthew G. ;
James, Jonathan ;
Boggis, Caroline R. M. ;
Duffy, Stephen W. .
NEW ENGLAND JOURNAL OF MEDICINE, 2008, 359 (16) :1675-1684