Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training

被引:0
作者
Rueckel, Johannes [1 ]
Huemmer, Christian [2 ]
Fieselmann, Andreas [2 ]
Ghesu, Florin-Cristian [3 ]
Mansoor, Awais [3 ]
Schachtner, Balthasar [1 ,4 ]
Wesp, Philipp [1 ]
Trappmann, Lena [1 ]
Munawwar, Basel [1 ]
Ricke, Jens [1 ]
Ingrisch, Michael [1 ]
Sabel, Bastian O. [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Radiol, Univ Hosp, Marchioninistr 15, D-81377 Munich, Germany
[2] Siemens Healthineers, Xray Prod, Forchheim, Germany
[3] Siemens Healthineers, Digital Technol & Innovat, Princeton, NJ USA
[4] German Ctr Lung Res DZL, Comprehens Pneumol Ctr CPC M, Munich, Germany
关键词
Artificial intelligence; Chest radiography; Pneumothorax; Chest tubes; AREAS; SIZE;
D O I
10.1007/s00330-021-07833-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objectives Diagnostic accuracy of artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXR) is limited by the noisy annotation quality of public training data and confounding thoracic tubes (TT). We hypothesize that in-image annotations of the dehiscent visceral pleura for algorithm training boosts algorithm's performance and suppresses confounders. Methods Our single-center evaluation cohort of 3062 supine CXRs includes 760 PTX-positive cases with radiological annotations of PTX size and inserted TTs. Three step-by-step improved algorithms (differing in algorithm architecture, training data from public datasets/clinical sites, and in-image annotations included in algorithm training) were characterized by area under the receiver operating characteristics (AUROC) in detailed subgroup analyses and referenced to the well-established "CheXNet" algorithm. Results Performances of established algorithms exclusively trained on publicly available data without in-image annotations are limited to AUROCs of 0.778 and strongly biased towards TTs that can completely eliminate algorithm's discriminative power in individual subgroups. Contrarily, our final "algorithm 2" which was trained on a lower number of images but additionally with in-image annotations of the dehiscent pleura achieved an overall AUROC of 0.877 for unilateral PTX detection with a significantly reduced TT-related confounding bias. Conclusions We demonstrated strong limitations of an established PTX-detecting AI algorithm that can be significantly reduced by designing an AI system capable of learning to both classify and localize PTX. Our results are aimed at drawing attention to the necessity of high-quality in-image localization in training data to reduce the risks of unintentionally biasing the training process of pathology-detecting AI algorithms.
引用
收藏
页码:7888 / 7900
页数:13
相关论文
共 31 条
[1]   Turning a Blind Eye: Explicit Removal of Biases and Variation from Deep Neural Network Embeddings [J].
Alvi, Mohsan ;
Zisserman, Andrew ;
Nellaker, Christoffer .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 :556-572
[2]  
[Anonymous], 2017, ARXIV171010501CS
[3]  
Ball CG, 2010, CAN J SURG, V53, P184
[4]   Spontaneous pneumothorax: time to rethink management? [J].
Bintcliffe, Oliver J. ;
Hallifax, Rob J. ;
Edey, Anthony ;
Feller-Kopman, David ;
Lee, Y. C. Gary ;
Marquette, Charles H. ;
Tschopp, Jean-Marie ;
West, Douglas ;
Rahman, Najib M. ;
Maskell, Nick A. .
LANCET RESPIRATORY MEDICINE, 2015, 3 (07) :578-588
[5]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[6]   Crowdsourcing pneumothorax annotations using machine learning annotations on the NIH chest X-ray dataset [J].
Filice, Ross W. ;
Stein, Anouk ;
Wu, Carol C. ;
Arteaga, Veronica A. ;
Borstelmann, Stephen ;
Gaddikeri, Ramya ;
Galperin-Aizenberg, Maya ;
Gill, Ritu R. ;
Godoy, Myrna C. ;
Hobbs, Stephen B. ;
Jeudy, Jean ;
Lakhani, Paras C. ;
Laroia, Archana ;
Nayak, Sundeep M. ;
Parekh, Maansi R. ;
Prasanna, Prasanth ;
Shah, Palmi ;
Vummidi, Dharshan ;
Yaddanapudi, Kavitha ;
Shih, George .
JOURNAL OF DIGITAL IMAGING, 2020, 33 (02) :490-496
[7]   The Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial of the National Cancer Institute: History, organization, and status [J].
Gohagan, JK ;
Prorok, PC ;
Hayes, RB ;
Kramer, BS .
CONTROLLED CLINICAL TRIALS, 2000, 21 (06) :251S-272S
[8]  
Guendel S., 2019, ARXIV PREPRINT ARXIV
[9]  
Gündel S, 2019, LECT NOTES COMPUT SC, V11401, P757, DOI 10.1007/978-3-030-13469-3_88
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269