Learning from the machine: AI assistance is not an effective learning tool for resident education in chest x-ray interpretation

被引:8
作者
Chassagnon, Guillaume [1 ,2 ]
Billet, Nicolas [1 ]
Rutten, Caroline [1 ]
Toussaint, Thibault [1 ]
de Linval, Quentin Cassius [1 ]
Collin, Megane [1 ]
Lemouchi, Leila [1 ]
Homps, Margaux [1 ]
Hedjoudje, Mohamed [1 ]
Ventre, Jeanne [3 ]
Gregory, Jules [2 ,4 ]
Canniff, Emma [1 ]
Regnard, Nor-Eddine [3 ,5 ]
Bennani, Souhail [1 ,3 ]
Revel, Marie-Pierre [1 ,2 ]
机构
[1] Hop Cochin, AP HP, Radiol Dept, 27 Rue Faubourg St Jacques, F-75014 Paris, France
[2] Univ Paris, 27 Rue Faubourg St Jacques,85 Blvd St Germain, F-75006 Paris, France
[3] Gleamer, 117 Quai Valmy, F-75010 Paris, France
[4] Hop Beaujon, Radiol Dept, FHU MOSA, 100 Bd Gen Leclerc, F-92110 Clichy, France
[5] Reseau Imagerie Sud Francilien, 254 Ter Ave Henri Barbusse, F-91210 Draveil, France
关键词
Diagnosis; Computer-Assisted; Radiography; Thoracic; Education; Medical; FEEDBACK;
D O I
10.1007/s00330-023-10043-1
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesTo assess whether a computer-aided detection (CADe) system could serve as a learning tool for radiology residents in chest X-ray (CXR) interpretation.MethodsEight radiology residents were asked to interpret 500 CXRs for the detection of five abnormalities, namely pneumothorax, pleural effusion, alveolar syndrome, lung nodule, and mediastinal mass. After interpreting 150 CXRs, the residents were divided into 2 groups of equivalent performance and experience. Subsequently, group 1 interpreted 200 CXRs from the "intervention dataset" using a CADe as a second reader, while group 2 served as a control by interpreting the same CXRs without the use of CADe. Finally, the 2 groups interpreted another 150 CXRs without the use of CADe. The sensitivity, specificity, and accuracy before, during, and after the intervention were compared.ResultsBefore the intervention, the median individual sensitivity, specificity, and accuracy of the eight radiology residents were 43% (range: 35-57%), 90% (range: 82-96%), and 81% (range: 76-84%), respectively. With the use of CADe, residents from group 1 had a significantly higher overall sensitivity (53% [n = 431/816] vs 43% [n = 349/816], p < 0.001), specificity (94% [i = 3206/3428] vs 90% [n = 3127/3477], p < 0.001), and accuracy (86% [n = 3637/4244] vs 81% [n = 3476/4293], p < 0.001), compared to the control group. After the intervention, there were no significant differences between group 1 and group 2 regarding the overall sensitivity (44% [n = 309/696] vs 46% [n = 317/696], p = 0.666), specificity (90% [n = 2294/2541] vs 90% [n = 2285/2542], p = 0.642), or accuracy (80% [n = 2603/3237] vs 80% [n = 2602/3238], p = 0.955).ConclusionsAlthough it improves radiology residents' performances for interpreting CXRs, a CADe system alone did not appear to be an effective learning tool and should not replace teaching.
引用
收藏
页码:8241 / 8250
页数:10
相关论文
共 21 条
  • [1] Workload for radiologists during on-call hours: dramatic increase in the past 15 years
    Bruls, R. J. M.
    Kwee, R. M.
    [J]. INSIGHTS INTO IMAGING, 2020, 11 (01)
  • [2] Using debriefing and feedback in simulation to improve participant performance: an educator's perspective
    Burns, Claire L.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL EDUCATION, 2015, 6 : 118 - 120
  • [3] Artificial intelligence: from challenges to clinical implementation
    Chassagnon, G.
    Dohan, A.
    [J]. DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2020, 101 (12) : 763 - 764
  • [4] Deep Learning-based Approach for Automated Assessment of Interstitial Lung Disease in Systemic Sclerosis on CT Images
    Chassagnon, Guillaume
    Vakalopoulou, Maria
    Regent, Alexis
    Zacharaki, Evangelia, I
    Aviram, Galit
    Martin, Charlotte
    Marini, Rafael
    Bus, Norbert
    Jerjir, Naim
    Mekinian, Arsene
    Hua-Huy, Thong
    Monnier-Cholley, Laurence
    Benmostefa, Nouria
    Mouthon, Luc
    Dinh-Xuan, Anh-Tuan
    Paragios, Nikos
    Revel, Marie-Pierre
    [J]. RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2020, 2 (04) : 1 - 10
  • [5] AI-driven quantification, staging and outcome prediction of COVID-19 pneumonia
    Chassagnon, Guillaume
    Vakalopoulou, Maria
    Battistella, Enzo
    Christodoulidis, Stergios
    Trieu-Nghi Hoang-Thi
    Dangeard, Severine
    Deutsch, Eric
    Andre, Fabrice
    Guillo, Enora
    Halm, Nara
    El Hajj, Stefany
    Bompard, Florian
    Neveu, Sophie
    Hani, Chahinez
    Saab, Ines
    Campredon, Alienor
    Koulakian, Hasmik
    Bennani, Souhail
    Freche, Gael
    Barat, Maxime
    Lombard, Aurelien
    Fournier, Laure
    Monnier, Hippolyte
    Grand, Teodor
    Gregory, Jules
    Nguyen, Yann
    Khalil, Antoine
    Mahdjoub, Elyas
    Brillet, Pierre-Yves
    Ba, Stephane Tran
    Bousson, Valerie
    Mekki, Ahmed
    Carlier, Robert-Yves
    Revel, Marie-Pierre
    Paragios, Nikos
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 67
  • [6] Artificial intelligence applications for thoracic imaging
    Chassagnon, Guillaume
    Vakalopoulou, Maria
    Paragios, Nikos
    Revel, Marie-Pierre
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2020, 123
  • [7] Computer-aided Detection of Lung Cancer on Chest Radiographs: Effect on Observer Performance
    de Hoop, Bartjan
    De Boo, Diederik W.
    Gietema, Hester A.
    van Hoorn, Frans
    Mearadji, Banafsche
    Schijf, Laura
    van Ginneken, Bram
    Prokop, Mathias
    Schaefer-Prokop, Cornelia
    [J]. RADIOLOGY, 2010, 257 (02) : 532 - 540
  • [8] Competency in chest radiography - A comparison of medical students, residents, and fellows
    Eisen, LA
    Berger, JS
    Hegde, A
    Schneider, RF
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2006, 21 (05) : 460 - 465
  • [9] FEEDBACK IN CLINICAL MEDICAL-EDUCATION
    ENDE, J
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1983, 250 (06): : 777 - 781
  • [10] Radiology residents' skill level in chest x-ray reading
    Fabre, C.
    Proisy, M.
    Chapuis, C.
    Jouneau, S.
    Lentz, P-A
    Meunier, C.
    Mahe, G.
    Lederlin, M.
    [J]. DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2018, 99 (06) : 361 - 370