Thy-Wise: An interpretable machine learning model for the evaluation of thyroid nodules

被引:16
作者
Jin, Zhe [1 ]
Pei, Shufang [2 ]
Ouyang, Lizhu [3 ]
Zhang, Lu [1 ]
Mo, Xiaokai [1 ]
Chen, Qiuying [1 ]
You, Jingjing [1 ]
Chen, Luyan [1 ]
Zhang, Bin [1 ]
Zhang, Shuixing [1 ]
机构
[1] Jinan Univ, Affiliated Hosp 1, Dept Radiol, Guangzhou, Guangdong, Peoples R China
[2] Guangdong Prov Peoples Hosp, Guangdong Acad Med Sci, Dept Ultrasound, Guangzhou, Guangdong, Peoples R China
[3] Southern Med Univ, Shunde Hosp, Dept Ultrasound, Foshan, Peoples R China
基金
中国博士后科学基金;
关键词
diagnosis; machine learning; random forest; thyroid nodules; ultrasonography; ACR TI-RADS; RISK STRATIFICATION; ARTIFICIAL-INTELLIGENCE; DIAGNOSIS; ULTRASONOGRAPHY; PREDICTIONS; MALIGNANCY; MANAGEMENT; ACCURACY;
D O I
10.1002/ijc.34248
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Current risk stratification systems for thyroid nodules suffer from low specificity and high biopsy rates. Recently, machine learning (ML) is introduced to assist thyroid nodule diagnosis but lacks interpretability. Here, we developed and validated ML models on 3965 thyroid nodules, as compared to the American College of Radiology Thyroid Imaging, Reporting and Data System (ACR TI-RADS). Subsequently, a SHapley Additive exPlanation (SHAP) algorithm was leveraged to interpret the results of the best-performing ML model. Clinical characteristics including thyroid-function tests were collected from medical records. Five ACR TI-RADS ultrasonography (US) categories plus nodule size were assessed by experienced radiologists. Random forest (RF), support vector machine (SVM) and extreme gradient boosting (XGBoost) were used to build US-only and US-clinical ML models. The ML models and ACR TI-RADS were compared in terms of diagnostic performance and unnecessary biopsy rate. Among the ML models, the US-only RF model (hereafter, Thy-Wise) achieved the optimal performance. Compared to ACR TI-RADS, Thy-Wise showed higher accuracy (82.4% vs 74.8% for the internal validation; 82.1% vs 73.4% for external validation) and specificity (78.7% vs 68.3% for internal validation; 78.5% vs 66.9% for external validation) while maintaining sensitivity (91.7% vs 91.2% for internal validation; 91.9% vs 91.1% for external validation), as well as reduced unnecessary biopsies (15.3% vs 32.3% for internal validation; 15.7% vs 47.3% for external validation). The SHAP-based interpretation of Thy-Wise enables clinicians to better understand the reasoning behind the diagnosis, which may facilitate the clinical translation of this model.
引用
收藏
页码:2229 / 2243
页数:15
相关论文
共 45 条
[1]   An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP [J].
Bi, Yue ;
Xiang, Dongxu ;
Ge, Zongyuan ;
Li, Fuyi ;
Jia, Cangzhi ;
Song, Jiangning .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 22 :362-372
[2]   Both thyroid autoimmunity and increased serum TSH are independent risk factors for malignancy in patients with thyroid nodules [J].
Boi, F. ;
Minerba, L. ;
Lai, M. L. ;
Marziani, B. ;
Figus, B. ;
Spanu, F. ;
Borghero, A. ;
Mariotti, S. .
JOURNAL OF ENDOCRINOLOGICAL INVESTIGATION, 2013, 36 (05) :313-320
[3]   Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission [J].
Caruana, Rich ;
Lou, Yin ;
Gehrke, Johannes ;
Koch, Paul ;
Sturm, Marc ;
Elhadad, Noemie .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1721-1730
[4]   Performance of Five Ultrasound Risk Stratification Systems in Selecting Thyroid Nodules for FNA [J].
Castellana, Marco ;
Castellana, Carlo ;
Treglia, Giorgio ;
Giorgino, Francesco ;
Giovanella, Luca ;
Russ, Gilles ;
Trimboli, Pierpaolo .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2020, 105 (05)
[5]   Biomarkers of thyroid function and autoimmunity for predicting high-risk groups of thyroid cancer: a nested case-control study [J].
Cho, Young Ae ;
Kong, Sun-Young ;
Shin, Aesun ;
Lee, Jeonghee ;
Lee, Eun Kyung ;
Lee, You Jin ;
Kim, Jeongseon .
BMC CANCER, 2014, 14
[6]   Explainable Machine Learning Model for Predicting GI Bleed Mortality in the Intensive Care Unit [J].
Deshmukh, Farah ;
Merchant, Shamel S. .
AMERICAN JOURNAL OF GASTROENTEROLOGY, 2020, 115 (10) :1657-1668
[7]   The Diagnosis and Management of Thyroid Nodules A Review [J].
Durante, Cosimo ;
Grani, Giorgio ;
Lamartina, Livia ;
Filetti, Sebastian ;
Mandel, Susan J. ;
Cooper, David S. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (09) :914-924
[8]   Novel functions of thyroid hormone receptor mutants: Beyond nucleus-initiated transcription [J].
Furuya, Fumihiko ;
Ying, Hao ;
Zhao, Li ;
Cheng, Sheue-yann .
STEROIDS, 2007, 72 (02) :171-179
[9]   Very high prevalence of thyroid nodules detected by high frequency (13 MHz) ultrasound examination [J].
Guth, S. ;
Theune, U. ;
Aberle, J. ;
Galach, A. ;
Bamberger, C. M. .
EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2009, 39 (08) :699-706
[10]  
Ha EJ, 2021, ULTRASONOGRAPHY, V40, P23