Deep Learning for Chest X-ray Diagnosis: Competition Between Radiologists with or Without Artificial Intelligence Assistance

被引:4
作者
Guo, Lili [1 ]
Zhou, Changsheng [2 ]
Xu, Jingxu [3 ]
Huang, Chencui [3 ]
Yu, Yizhou [3 ]
Lu, Guangming [2 ]
机构
[1] Nanjing Med Univ, Affiliated Huaian 1 Peoples Hosp, Dept Radiol, Huaian 223300, Peoples R China
[2] Nanjing Univ, Jinling Hosp, Med Sch, Dept Med Imaging, Nanjing 210002, Peoples R China
[3] Beijing Deepwise & League PHD Technol Co Ltd, Deepwise AI Lab, Beijing 100080, Peoples R China
来源
JOURNAL OF IMAGING INFORMATICS IN MEDICINE | 2024年 / 37卷 / 03期
基金
国家重点研发计划;
关键词
Chest X-ray; Artificial intelligence; Competition; Deep learning; Score; NETWORK;
D O I
10.1007/s10278-024-00990-6
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
This study aimed to assess the performance of a deep learning algorithm in helping radiologist achieve improved efficiency and accuracy in chest radiograph diagnosis. We adopted a deep learning algorithm to concurrently detect the presence of normal findings and 13 different abnormalities in chest radiographs and evaluated its performance in assisting radiologists. Each competing radiologist had to determine the presence or absence of these signs based on the label provided by the AI. The 100 radiographs were randomly divided into two sets for evaluation: one without AI assistance (control group) and one with AI assistance (test group). The accuracy, false-positive rate, false-negative rate, and analysis time of 111 radiologists (29 senior, 32 intermediate, and 50 junior) were evaluated. A radiologist was given an initial score of 14 points for each image read, with 1 point deducted for an incorrect answer and 0 points given for a correct answer. The final score for each doctor was automatically calculated by the backend calculator. We calculated the mean scores of each radiologist in the two groups (the control group and the test group) and calculated the mean scores to evaluate the performance of the radiologists with and without AI assistance. The average score of the 111 radiologists was 597 (587-605) in the control group and 619 (612-626) in the test group (P < 0.001). The time spent by the 111 radiologists on the control and test groups was 3279 (2972-3941) and 1926 (1710-2432) s, respectively (P < 0.001). The performance of the 111 radiologists in the two groups was evaluated by the area under the receiver operating characteristic curve (AUC). The radiologists showed better performance on the test group of radiographs in terms of normal findings, pulmonary fibrosis, heart shadow enlargement, mass, pleural effusion, and pulmonary consolidation recognition, with AUCs of 1.0, 0.950, 0.991, 1.0, 0.993, and 0.982, respectively. The radiologists alone showed better performance in aortic calcification (0.993), calcification (0.933), cavity (0.963), nodule (0.923), pleural thickening (0.957), and rib fracture (0.987) recognition. This competition verified the positive effects of deep learning methods in assisting radiologists in interpreting chest X-rays. AI assistance can help to improve both the efficacy and efficiency of radiologists.
引用
收藏
页码:922 / 934
页数:13
相关论文
共 26 条
[1]   Medical and Legal Implications of MRI Scout Imaging in a Surgical Patient with Case Presentation [J].
Albano, Stephen ;
Ramnot, Ajay ;
Siddiqi, Javed ;
Mahato, Deependra .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2020, 12 (01)
[2]   Automated Triaging of Adult Chest Radiographs with Deep Artificial Neural Networks (vol 291, pg 196, 2019) [J].
Annarumma, Mauro ;
Withey, Samuel J. ;
Bakewell, Robert J. ;
Pesce, Emanuele ;
Goh, Vicky ;
Montana, Giovanni .
RADIOLOGY, 2019, 291 (01) :272-272
[3]   Atlas-based rib-bone detection in chest X-rays [J].
Candemir, Sema ;
Jaeger, Stefan ;
Antani, Sameer ;
Bagci, Ulas ;
Folio, Les R. ;
Xu, Ziyue ;
Thoma, George .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2016, 51 :32-39
[4]  
Cao Xue-Fang, 2021, Chronic Dis Transl Med, V7, P35, DOI 10.1016/j.cdtm.2021.02.001
[5]   Successful Identification of Nasopharyngeal Carcinoma in Nasopharyngeal Biopsies Using Deep Learning [J].
Chuang, Wen-Yu ;
Chang, Shang-Hung ;
Yu, Wei-Hsiang ;
Yang, Cheng-Kun ;
Yeh, Chi-Ju ;
Ueng, Shir-Hwa ;
Liu, Yu-Jen ;
Chen, Tai-Di ;
Chen, Kuang-Hua ;
Hsieh, Yi-Yin ;
Hsia, Yi ;
Wang, Tong-Hong ;
Hsueh, Chuen ;
Kuo, Chang-Fu ;
Yeh, Chao-Yuan .
CANCERS, 2020, 12 (02)
[6]   The impact of workplace violence on job satisfaction, job burnout, and turnover intention: the mediating role of social support [J].
Duan, Xiaojian ;
Ni, Xin ;
Shi, Lei ;
Zhang, Leijing ;
Ye, Yuan ;
Mu, Huitong ;
Li, Zhe ;
Liu, Xin ;
Fan, Lihua ;
Wang, Yongchen .
HEALTH AND QUALITY OF LIFE OUTCOMES, 2019, 17 (1)
[7]   How Clinicians Perceive Artificial Intelligence-Assisted Technologies in Diagnostic Decision Making: Mixed Methods Approach [J].
Hah, Hyeyoung ;
Goldin, Deana Shevit .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (12)
[8]  
Huang SC, 2020, NPJ DIGIT MED, V3, DOI 10.1038/s41746-020-0266-y
[9]   Artificial Intelligence and Early Detection of Pancreatic Cancer: 2020 Summative Review [J].
Kenner, Barbara ;
Chari, Suresh T. ;
Kelsen, David ;
Klimstra, David S. ;
Pandol, Stephen J. ;
Rosenthal, Michael ;
Rustgi, Anil K. ;
Taylor, James A. ;
Yala, Adam ;
Abul-Husn, Noura ;
Andersen, Dana K. ;
Bernstein, David ;
Brunak, Soren ;
Canto, Marcia Irene ;
Eldar, Yonina C. ;
Fishman, Elliot K. ;
Fleshman, Julie ;
Go, Vay Liang W. ;
Holt, Jane M. ;
Field, Bruce ;
Goldberg, Ann ;
Hoos, William ;
Iacobuzio-Donahue, Christine ;
Li, Debiao ;
Lidgard, Graham ;
Maitra, Anirban ;
Matrisian, Lynn M. ;
Poblete, Sung ;
Rothschild, Laura ;
Sander, Chris ;
Schwartz, Lawrence H. ;
Shalit, Uri ;
Srivastava, Sudhir ;
Wolpin, Brian .
PANCREAS, 2021, 50 (03) :251-279
[10]   Retrospective analysis of clinical problems concerning acute pancreatitis in one treatment center [J].
Koziel, Dorota ;
Kozlowska, Monika ;
Deneke, Jan ;
Matykiewicz, Jaroslaw ;
Gluszek, Stanislaw .
PRZEGLAD GASTROENTEROLOGICZNY, 2013, 8 (05) :320-326