Association of Artificial Intelligence-Aided Chest Radiograph Interpretation With Reader Performance and Efficiency

被引:46
作者
Ahn, Jong Seok [3 ]
Ebrahimian, Shadi [1 ,2 ,4 ]
McDermott, Shaunagh [1 ,2 ]
Lee, Sanghyup [3 ]
Naccarato, Laura [1 ,2 ]
Di Capua, John F. [1 ,2 ]
Wu, Markus Y. [1 ,2 ]
Zhang, Eric W. [1 ,2 ]
Muse, Victorine [1 ,2 ]
Miller, Benjamin [1 ,2 ,5 ]
Sabzalipour, Farid [1 ,2 ,5 ]
Bizzo, Bernardo C. [1 ,2 ,5 ]
Dreyer, Keith J. [1 ,2 ,5 ]
Kaviani, Parisa [1 ,2 ]
Digumarthy, Subba R. [1 ,2 ]
Kalra, Mannudeep K. [1 ,2 ,5 ]
机构
[1] Massachusetts Gen Hosp, Dept Radiol, Div Thorac Imaging, 75 Blossom Ct, Boston, MA 02114 USA
[2] Harvard Med Sch, 75 Blossom Ct, Boston, MA 02114 USA
[3] Lunit Inc, Seoul, South Korea
[4] Icahn Sch Med Mt Sinai, Elmhurst Hosp Ctr, Internal Med, Elmhurst, NY USA
[5] Mass Gen Brigham, Data Sci Off, Boston, MA USA
关键词
EMERGENCY-DEPARTMENT; VARIABILITY; DIAGNOSIS; MEDICINE;
D O I
10.1001/jamanetworkopen.2022.29289
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IMPORTANCE The efficient and accurate interpretation of radiologic images is paramount. OBJECTIVE To evaluate whether a deep learning-based artificial intelligence (AI) engine used concurrently can improve reader performance and efficiency in interpreting chest radiograph abnormalities. DESIGN, SETTING, AND PARTICIPANTS This multicenter cohort study was conducted from April to November 2021 and involved radiologists, including attending radiologists, thoracic radiology fellows, and residents, who independently participated in 2 observer performance test sessions. The sessions included a reading session with Al and a session without Al, in a randomized crossover manner with a 4-week washout period in between. The Al produced a heat map and the image-level probability of the presence of the referrable lesion. The data used were collected at 2 quaternary academic hospitals in Boston, Massachusetts: Beth Israel Deaconess Medical Center (The Medical Information Mart for Intensive Care Chest X-Ray [MIMIC-CXR]) and Massachusetts General Hospital (MGH). MAIN OUTCOMES AND MEASURES The ground truths for the labels were created via consensual reading by 2 thoracic radiologists. Each reader documented their findings in a customized report template, in which the 4 target chest radiograph findings and the reader confidence of the presence of each finding was recorded. The time taken for reporting each chest radiograph was also recorded. Sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC) were calculated for each target finding. RESULTS A total of 6 radiologists (2 attending radiologists, 2 thoracic radiology fellows, and 2 residents) participated in the study. The study involved a total of 497 frontal chest radiographs-247 from the MIMIC-CXR data set (demographic data for patients were not available) and 250 chest radiographs from MGH (mean [SD] age, 63 [16] years; 133 men [53.2%])-from adult patients with and without 4 target findings (pneumonia, nodule, pneumothorax, and pleural effusion). The target findings were found in 351 of 497 chest radiographs. The Al was associated with higher sensitivity for all findings compared with the readers (nodule, 0.816 [95% CI, 0.732-0.882] vs 0.567 [95% CI, 0.524-0.611]; pneumonia, 0.887 [95% CI, 0.834-0.928] vs 0.673 [95% CI, 0.632-0.714]; pleural effusion, 0.872 [95% CI, 0.808-0.921] vs 0.889 [95% CI, 0.862-0.917]; pneumothorax, 0.988 [95% CI, 0.932-1.000] vs 0.792 [95% CI, 0.756-0.827]). Al-aided interpretation was associated with significantly improved reader sensitivities for all target findings, without negative impacts on the specificity. Overall, the AUROCs of readers improved for all 4 target findings, with significant improvements in detection of pneumothorax and nodule. The reporting time with Al was 10% lower than without Al (40.8 vs 36.9 seconds; difference, 3.9 seconds; 95% CI, 2.9-5.2 seconds; P < .001). CONCLUSIONS AND RELEVANCE These findings suggest that Al-aided interpretation was associated with improved reader performance and efficiency for identifying major thoracic findings on a chest radiograph.
引用
收藏
页数:15
相关论文
共 41 条
  • [11] Effectiveness of Computer-Aided Detection in Community Mammography Practice
    Fenton, Joshua J.
    Abraham, Linn
    Taplin, Stephen H.
    Geller, Berta M.
    Carney, Patricia A.
    D'Orsi, Carl
    Elmore, Joann G.
    Barlow, William E.
    [J]. JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2011, 103 (15) : 1152 - 1161
  • [12] Characterizing mammographic images by using generic texture features
    Haeberle, Lothar
    Wagner, Florian
    Fasching, Peter A.
    Jud, Sebastian M.
    Heusinger, Katharina
    Loehberg, Christian R.
    Hein, Alexander
    Bayer, Christian M.
    Hack, Carolin C.
    Lux, Michael P.
    Binder, Katja
    Elter, Matthias
    Muenzenmayer, Christian
    Schulz-Wendtland, Ruediger
    Meier-Meitinger, Martina
    Adamietz, Boris R.
    Uder, Michael
    Beckmann, Matthias W.
    Wittenberg, Thomas
    [J]. BREAST CANCER RESEARCH, 2012, 14 (02)
  • [13] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [14] An Artificial Intelligence-Based Chest X-ray Model on Human Nodule Detection Accuracy From a Multicenter Study
    Homayounieh, Fatemeh
    Digumarthy, Subba
    Ebrahimian, Shadi
    Rueckel, Johannes
    Hoppe, Boj Friedrich
    Sabel, Bastian Oliver
    Conjeti, Sailesh
    Ridder, Karsten
    Sistermanns, Markus
    Wang, Lei
    Preuhs, Alexander
    Ghesu, Florin
    Mansoor, Awais
    Moghbel, Mateen
    Botwin, Ariel
    Singh, Ramandeep
    Cartmell, Samuel
    Patti, John
    Huemmer, Christian
    Fieselmann, Andreas
    Joerger, Clemens
    Mirshahzadeh, Negar
    Muse, Victorine
    Kalra, Mannudeep
    [J]. JAMA NETWORK OPEN, 2021, 4 (12)
  • [15] Comparison of Baseline, Bone-Subtracted, and Enhanced Chest Radiographs for Detection of Pneumothorax
    Homayounieh, Fatemeh
    Digumarthy, Subba R.
    Febbo, Jennifer A.
    Garrana, Sherief
    Nitiwarangkul, Chayanin
    Singh, Ramandeep
    Khera, Ruhani Doda
    Gilman, Matthew
    Kalra, Mannudeep K.
    [J]. CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES, 2021, 72 (03): : 519 - 524
  • [16] Deep Learning for Detecting Pneumothorax on Chest Radiographs after Needle Biopsy: Clinical Implementation
    Hong, Wonju
    Hwang, Eui Jin
    Lee, Jong Hyuk
    Park, Jongsoo
    Goo, Jin Mo
    Park, Chang Min
    [J]. RADIOLOGY, 2022, 303 (02)
  • [17] Deep learning algorithm for surveillance of pneumothorax after lung biopsy: a multicenter diagnostic cohort study
    Hwang, Eui Jin
    Hong, Jung Hee
    Lee, Kyung Hee
    Kim, Jung Im
    Nam, Ju Gang
    Kim, Da Som
    Choi, Hyewon
    Yoo, Seung Jin
    Goo, Jin Mo
    Park, Chang Min
    [J]. EUROPEAN RADIOLOGY, 2020, 30 (07) : 3660 - 3671
  • [18] Development and Validation of a Deep Learning-Based Automated Detection Algorithm for Major Thoracic Diseases on Chest Radiographs
    Hwang, Eui Jin
    Park, Sunggyun
    Jin, Kwang-Nam
    Kim, Jung Im
    Choi, So Young
    Lee, Jong Hyuk
    Goo, Jin Mo
    Aum, Jaehong
    Yim, Jae-Joon
    Cohen, Julien G.
    Ferretti, Gilbert R.
    Park, Chang Min
    Kim, Dong Hyeon
    Woo, Sungmin
    Choi, Wonseok
    Hwang, In Pyung
    Song, Yong Sub
    Lim, Jiyeon
    Kim, Hyungjin
    Wi, Jae Yeon
    Oh, Su Suk
    Kang, Mi-Jin
    Lee, Nyoung Keun
    Yoo, Jin Young
    Suh, Young Joo
    [J]. JAMA NETWORK OPEN, 2019, 2 (03) : e191095
  • [19] Intraobserver and interobserver agreement of the interpretation of pediatric chest radiographs
    Johnson J.
    Kline J.A.
    [J]. Emergency Radiology, 2010, 17 (4) : 285 - 290
  • [20] Kim M, 2020, Arxiv, DOI arXiv:2007.07506