Utilizing machine learning for early screening of thyroid nodules: a dual-center cross-sectional study in China

被引:0
作者
Weng, Shuwei [1 ,2 ]
Ding, Chen [3 ]
Hu, Die [1 ,2 ]
Chen, Jin [1 ,2 ]
Liu, Yang [4 ]
Liu, Wenwu [1 ,2 ]
Chen, Yang [1 ,2 ]
Guo, Xin [1 ,2 ]
Cao, Chenghui [1 ,2 ]
Yi, Yuting [1 ,2 ]
Yang, Yanyi [5 ,6 ]
Peng, Daoquan [1 ,2 ]
机构
[1] Cent South Univ, Xiangya Hosp 2, Dept Cardiol, Changsha, Hunan, Peoples R China
[2] Res Inst Blood Lipid & Atherosclerosis, Changsha, Hunan, Peoples R China
[3] Soochow Univ, Affiliated Hosp 4, Suzhou Dushu Lake Hosp, Dept Cardiol,Med Ctr, Suzhou, Jiangsu, Peoples R China
[4] Third Mil Med Univ, Xinqiao Hosp,Army Med Univ, Chongqing Clin Res Ctr Kidney & Urol Dis, Dept Nephrol,Key Lab Prevent & Treatment Chron Kid, Chongqing, Peoples R China
[5] Cent South Univ, Xiangya Hosp 2, Hlth Management Ctr, Changsha, Hunan, Peoples R China
[6] Hunan Prov Clin Med Res Ctr Intelligent Management, Changsha, Hunan, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
基金
中国国家自然科学基金;
关键词
thyroid nodule; machine learning; early screening; urine iodine; ensemble learning methods; IODINE INTAKE; ASSOCIATION; MANAGEMENT; DIAGNOSIS; AGE;
D O I
10.3389/fendo.2024.1385167
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Thyroid nodules, increasingly prevalent globally, pose a risk of malignant transformation. Early screening is crucial for management, yet current models focus mainly on ultrasound features. This study explores machine learning for screening using demographic and biochemical indicators.Methods Analyzing data from 6,102 individuals and 61 variables, we identified 17 key variables to construct models using six machine learning classifiers: Logistic Regression, SVM, Multilayer Perceptron, Random Forest, XGBoost, and LightGBM. Performance was evaluated by accuracy, precision, recall, F1 score, specificity, kappa statistic, and AUC, with internal and external validations assessing generalizability. Shapley values determined feature importance, and Decision Curve Analysis evaluated clinical benefits.Results Random Forest showed the highest internal validation accuracy (78.3%) and AUC (89.1%). LightGBM demonstrated robust external validation performance. Key factors included age, gender, and urinary iodine levels, with significant clinical benefits at various thresholds. Clinical benefits were observed across various risk thresholds, particularly in ensemble models.Conclusion Machine learning, particularly ensemble methods, accurately predicts thyroid nodule presence using demographic and biochemical data. This cost-effective strategy offers valuable insights for thyroid health management, aiding in early detection and potentially improving clinical outcomes. These findings enhance our understanding of the key predictors of thyroid nodules and underscore the potential of machine learning in public health applications for early disease screening and prevention.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] The Association of Thyroid Nodules with Metabolic Status: A Cross-Sectional SPECT-China Study
    Chen, Yi
    Zhu, Chunfang
    Chen, Yingchao
    Wang, Ningjian
    Li, Qin
    Han, Bing
    Zhao, Li
    Chen, Chi
    Zhai, Hualing
    Lu, Yingli
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2018, 2018
  • [2] Association of adiposity with thyroid nodules: a cross-sectional study of a healthy population in Beijing, China
    Yang, Hui-xia
    Zhong, Yu
    Lv, Wei-hua
    Zhang, Feng
    Yu, Hong
    BMC ENDOCRINE DISORDERS, 2019, 19 (01)
  • [3] The Effect of Iodine Status on the Risk of Thyroid Nodules: A Cross-Sectional Study in Zhejiang, China
    Lou, Xiaoming
    Wang, Xiaofeng
    Wang, Zhifang
    Mao, Guangming
    Zhu, Wenming
    Wang, Yuanyang
    Pan, Xuejiao
    Chen, Zhijian
    Mo, Zhe
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2020, 2020
  • [4] Correlation Analysis of Breast and Thyroid Nodules: A Cross-Sectional Study
    Chen, Jingtai
    Xu, Zhou
    Hou, Lingmi
    Tang, Yunhui
    Qian, Shuangqiang
    Pu, Hongyu
    Tang, Juan
    Gao, Yanchun
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2021, 14 : 3999 - 4010
  • [5] Thyroid Screening During Early Pregnancy and the Need for Trimester Specific Reference Ranges: A Cross-Sectional Study in Lahore, Pakistan
    Talat, Afnan
    Khan, Aleena A.
    Nasreen, Samia
    Wass, John A.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2019, 11 (09)
  • [6] Prevalence of Thyroid Nodules and Thyroid Cancer in Individuals with a First-Degree Family History of Non-Medullary Thyroid Cancer: A Cross-Sectional Study Based on Sonographic Screening
    Grani, Giorgio
    Lamartina, Livia
    Montesano, Teresa
    Giacomelli, Laura
    Biffoni, Marco
    Trulli, Fabiana
    Filetti, Sebastiano
    Durante, Cosimo
    THYROID, 2022, 32 (11) : 1392 - 1401
  • [7] The Prevalence of Single and Multiple Thyroid Nodules and Its Association with Metabolic Diseases in Chinese: A Cross-Sectional Study
    Zou, Bing
    Sun, Li
    Wang, Xin
    Chen, Zongtao
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2020, 2020
  • [8] Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China
    Ma, Han
    Xu, Cheng-fu
    Shen, Zhe
    Yu, Chao-hui
    Li, You-ming
    BIOMED RESEARCH INTERNATIONAL, 2018, 2018
  • [9] Machine learning for early diagnosis of Kawasaki disease in acute febrile children: retrospective cross-sectional study in China
    Zheng, Wei
    Zhu, Shiben
    Wang, Xuelian
    Chen, Cuixuan
    Zhen, Zifeng
    Xu, Yi
    Mo, Xiaolan
    Tse, Gary
    Li, Xufang
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [10] Machine learning for screening and predicting the availability of medications for children: a cross-sectional survey study
    Guo, Jing-yan
    FRONTIERS IN PEDIATRICS, 2024, 12