Artificial Intelligence Model Assisting Thyroid Nodule Diagnosis and Management: A Multicenter Diagnostic Study

被引:14
作者
Ha, Eun Ju [1 ]
Lee, Jeong Hoon [1 ]
Lee, Da Hyun [1 ]
Moon, Jayoung [1 ]
Lee, Haein [1 ]
Kim, You Na [1 ]
Kim, Minji [1 ]
Na, Dong Gyu [2 ]
Kim, Ji-hoon [3 ,4 ]
机构
[1] Ajou Univ, Dept Radiol, Sch Med, Suwon 16499, South Korea
[2] Univ Ulsan, GangNeung Asan Hosp, Coll Med, Dept Radiol, Gangneung si 25440, Gangwon Do, South Korea
[3] Seoul Natl Univ, Seoul Natl Univ Hosp, Dept Radiol, Coll Med, Seoul 03080, South Korea
[4] Seoul Natl Univ, Seoul Natl Univ Hosp, Dept Radiol, Coll Med,Unit 11610, 101 Daehak Ro, Seoul 03080, South Korea
基金
新加坡国家研究基金会;
关键词
artificial intelligence; thyroid nodule; thyroid cancer; ultrasonography; multicenter diagnostic study; RADIOLOGY CONSENSUS STATEMENT; IMAGING-BASED MANAGEMENT; DATA SYSTEM; KOREAN SOCIETY;
D O I
10.1210/clinem/dgad503
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Context: It is not clear how to integrate artificial intelligence (AI)-based models into diagnostic workflows. Objective: To develop and validate a deep-learning-based AI model (AI-Thyroid) for thyroid cancer diagnosis, and to explore how this improves diagnostic performance.Methods: The system was trained using 19 711 images of 6163 patients in a tertiary hospital (Ajou University Medical Center; AUMC). It was validated using 11 185 images of 4820 patients in 24 hospitals (test set 1) and 4490 images of 2367 patients in AUMC (test set 2). The clinical implications were determined by comparing the findings of six physicians with different levels of experience (group 1: 4 trainees, and group 2: 2 faculty radiologists) before and after AI-Thyroid assistance.Results: The area under the receiver operating characteristic (AUROC) curve of AI-Thyroid was 0.939. The AUROC, sensitivity, and specificity were 0.922, 87.0%, and 81.5% for test set 1 and 0.938, 89.9%, and 81.6% for test set 2. The AUROCs of AI-Thyroid did not differ significantly according to the prevalence of malignancies (>15.0% vs =15.0%, P = .226). In the simulated scenario, AI-Thyroid assistance changed the AUROC, sensitivity, and specificity from 0.854 to 0.945, from 84.2% to 92.7%, and from 72.9% to 86.6% (all P < .001) in group 1, and from 0.914 to 0.939 (P = .022), from 78.6% to 85.5% (P = .053) and from 91.9% to 92.5% (P = .683) in group 2. The interobserver agreement improved from moderate to substantial in both groups.Conclusion: AI-Thyroid can improve diagnostic performance and interobserver agreement in thyroid cancer diagnosis, especially in less-experienced physicians.
引用
收藏
页码:527 / 535
页数:9
相关论文
共 22 条
  • [1] Artificial Intelligence in Thyroid Field-A Comprehensive Review
    Bini, Fabiano
    Pica, Andrada
    Azzimonti, Laura
    Giusti, Alessandro
    Ruinelli, Lorenzo
    Marinozzi, Franco
    Trimboli, Pierpaolo
    [J]. CANCERS, 2021, 13 (19)
  • [2] The average receiver operating characteristic curve in multireader multicase imaging studies
    Chen, W.
    Samuelson, F. W.
    [J]. BRITISH JOURNAL OF RADIOLOGY, 2014, 87 (1040)
  • [3] A review of medical image data augmentation techniques for deep learning applications
    Chlap, Phillip
    Min, Hang
    Vandenberg, Nym
    Dowling, Jason
    Holloway, Lois
    Haworth, Annette
    [J]. JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2021, 65 (05) : 545 - 563
  • [4] Interobserver and Intraobserver Variations in Ultrasound Assessment of Thyroid Nodules
    Choi, Seon Hyeong
    Kim, Eun-Kyung
    Kwak, Jin Young
    Kim, Min Jung
    Son, Eun Ju
    [J]. THYROID, 2010, 20 (02) : 167 - 172
  • [5] Diagnostic Performance of the Modified Korean Thyroid Imaging Reporting and Data System for Thyroid Malignancy: A Multicenter Validation Study
    Chung, Sae Rom
    Ahn, Hye Shin
    Choi, Young Jun
    Lee, Ji Ye
    Yoo, Roh-Eul
    Lee, Yoo Jin
    Kim, Jee Young
    Sung, Jin Yong
    Kim, Ji-Hoon
    Baek, Jung Hwan
    [J]. KOREAN JOURNAL OF RADIOLOGY, 2021, 22 (09) : 1579 - 1586
  • [6] Cibas Edmund S, 2017, J Am Soc Cytopathol, V6, P217, DOI 10.1016/j.jasc.2017.09.002
  • [7] Generalized Roe and Metz receiver operating characteristic model: analytic link between simulated decision scores and empirical AUC variances and covariances
    Gallas, Brandon D.
    Hillis, Stephen L.
    [J]. JOURNAL OF MEDICAL IMAGING, 2014, 1 (03)
  • [8] Deep learning based classification of ultrasound images for thyroid nodules: a large scale of pilot study
    Guan, Qing
    Wang, Yunjun
    Du, Jiajun
    Qin, Yu
    Lu, Hongtao
    Xiang, Jun
    Wang, Fen
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2019, 7 (07)
  • [9] Development of a machine learning-based fine-grained risk stratification system for thyroid nodules using predefined clinicoradiological features
    Ha, Eun Ju
    Lee, Jeong Hoon
    Lee, Da Hyun
    Na, Dong Gyu
    Kim, Ji-hoon
    [J]. EUROPEAN RADIOLOGY, 2023, 33 (05) : 3211 - 3221
  • [10] 2021 Korean Thyroid Imaging Reporting and Data System and Imaging-Based Management of Thyroid Nodules: Korean Society of Thyroid Radiology Consensus Statement and Recommendations
    Ha, Eun Ju
    Chung, Sae Rom
    Na, Dong Gyu
    Ahn, Hye Shin
    Chung, Jin
    Lee, Ji Ye
    Park, Jeong Seon
    Yoo, Roh-Eul
    Baek, Jung Hwan
    Baek, Sun Mi
    Cho, Seong Whi
    Choi, Yoon Jung
    Hahn, Soo Yeon
    Jung, So Lyung
    Kim, Ji-Hoon
    Kim, Seul Kee
    Kim, Soo Jin
    Lee, Chang Yoon
    Lee, Ho Kyu
    Lee, Jeong Hyun
    Lee, Young Hen
    Lim, Hyun Kyung
    Shin, Jung Hee
    Sim, Jung Suk
    Sung, Jin Young
    Yoon, Jung Hyun
    Choi, Miyoung
    [J]. KOREAN JOURNAL OF RADIOLOGY, 2021, 22 (12) : 2094 - 2123