Utilizing machine learning for early screening of thyroid nodules: a dual-center cross-sectional study in China

被引:0
作者
Weng, Shuwei [1 ,2 ]
Ding, Chen [3 ]
Hu, Die [1 ,2 ]
Chen, Jin [1 ,2 ]
Liu, Yang [4 ]
Liu, Wenwu [1 ,2 ]
Chen, Yang [1 ,2 ]
Guo, Xin [1 ,2 ]
Cao, Chenghui [1 ,2 ]
Yi, Yuting [1 ,2 ]
Yang, Yanyi [5 ,6 ]
Peng, Daoquan [1 ,2 ]
机构
[1] Cent South Univ, Xiangya Hosp 2, Dept Cardiol, Changsha, Hunan, Peoples R China
[2] Res Inst Blood Lipid & Atherosclerosis, Changsha, Hunan, Peoples R China
[3] Soochow Univ, Affiliated Hosp 4, Suzhou Dushu Lake Hosp, Dept Cardiol,Med Ctr, Suzhou, Jiangsu, Peoples R China
[4] Third Mil Med Univ, Xinqiao Hosp,Army Med Univ, Chongqing Clin Res Ctr Kidney & Urol Dis, Dept Nephrol,Key Lab Prevent & Treatment Chron Kid, Chongqing, Peoples R China
[5] Cent South Univ, Xiangya Hosp 2, Hlth Management Ctr, Changsha, Hunan, Peoples R China
[6] Hunan Prov Clin Med Res Ctr Intelligent Management, Changsha, Hunan, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
基金
中国国家自然科学基金;
关键词
thyroid nodule; machine learning; early screening; urine iodine; ensemble learning methods; IODINE INTAKE; ASSOCIATION; MANAGEMENT; DIAGNOSIS; AGE;
D O I
10.3389/fendo.2024.1385167
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Thyroid nodules, increasingly prevalent globally, pose a risk of malignant transformation. Early screening is crucial for management, yet current models focus mainly on ultrasound features. This study explores machine learning for screening using demographic and biochemical indicators.Methods Analyzing data from 6,102 individuals and 61 variables, we identified 17 key variables to construct models using six machine learning classifiers: Logistic Regression, SVM, Multilayer Perceptron, Random Forest, XGBoost, and LightGBM. Performance was evaluated by accuracy, precision, recall, F1 score, specificity, kappa statistic, and AUC, with internal and external validations assessing generalizability. Shapley values determined feature importance, and Decision Curve Analysis evaluated clinical benefits.Results Random Forest showed the highest internal validation accuracy (78.3%) and AUC (89.1%). LightGBM demonstrated robust external validation performance. Key factors included age, gender, and urinary iodine levels, with significant clinical benefits at various thresholds. Clinical benefits were observed across various risk thresholds, particularly in ensemble models.Conclusion Machine learning, particularly ensemble methods, accurately predicts thyroid nodule presence using demographic and biochemical data. This cost-effective strategy offers valuable insights for thyroid health management, aiding in early detection and potentially improving clinical outcomes. These findings enhance our understanding of the key predictors of thyroid nodules and underscore the potential of machine learning in public health applications for early disease screening and prevention.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Risk factors associated with the prevalence of thyroid nodules in adults in Northeast China: a cross-sectional population-based study
    Yan, Yudie
    Dong, Junhe
    Li, Shufeng
    Yang, Guochun
    Huang, Kunbo
    Tian, Wen
    Su, Jingtong
    Zhang, Zhen
    BMJ OPEN, 2023, 13 (10):
  • [22] Adiposity and the risk of thyroid nodules with a high-suspicion sonographic pattern: a large cross-sectional epidemiological study
    Lai, Xingjian
    Zhang, Bo
    Wang, Yong
    Jiang, Yuxin
    Li, Jianchu
    Gao, Luying
    Wang, Ying
    JOURNAL OF THORACIC DISEASE, 2019, 11 (12) : 5014 - 5022
  • [23] Prevalence and Associated Factors of Thyroid Nodules Among 52,003 Chinese 'Healthy' Individuals in Beijing: A Retrospective Cross-Sectional Study
    Xu, Jin
    Lau, Phyllis
    Ma, Yong
    Zhao, Na
    Yu, Xuezhong
    Zhu, Huadong
    Li, Yi
    RISK MANAGEMENT AND HEALTHCARE POLICY, 2024, 17 : 181 - 189
  • [24] Prevalence and risk factors of thyroid nodules in breast cancer women with different clinicopathological characteristics: a cross-sectional study
    Ma, Chen-yu
    Liang, Xin-yu
    Ran, Liang
    Hu, Lei
    Zeng, Fan-ling
    She, Rui-ling
    Feng, Jun-han
    Jiang, Zhi-yu
    Li, Zhao-xing
    Qu, Xiu-quan
    Peng, Bai-qing
    Wu, Kai-nan
    Kong, Ling-quan
    CLINICAL & TRANSLATIONAL ONCOLOGY, 2024, 26 (09) : 2380 - 2387
  • [25] Iodine Nutrition and the Prevalence Status of Thyroid Nodules in the Population: a Cross-sectional Survey in Heilongjiang Province, China
    Tian, Chunyuan
    Bu, Ye
    Ji, Chunlei
    Shi, Mengqi
    Zhang, Liwei
    Kong, Dejiao
    Dong, Xiaoqiu
    Liu, Ying
    BIOLOGICAL TRACE ELEMENT RESEARCH, 2021, 199 (09) : 3181 - 3189
  • [26] Diagnosing growing pains in children by using machine learning: a cross-sectional multicenter study
    Akal, Fuat
    Batu, Ezgi D.
    Sonmez, Hafize Emine
    Karadag, Serife G.
    Demir, Ferhat
    Ayaz, Nuray Aktay
    Sozeri, Betul
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (12) : 3601 - 3614
  • [27] Diagnosing growing pains in children by using machine learning: a cross-sectional multicenter study
    Fuat Akal
    Ezgi D. Batu
    Hafize Emine Sonmez
    Şerife G. Karadağ
    Ferhat Demir
    Nuray Aktay Ayaz
    Betül Sözeri
    Medical & Biological Engineering & Computing, 2022, 60 : 3601 - 3614
  • [28] Prevalence and associated metabolic factors for thyroid nodules: a cross-sectional study in Southwest of China with more than 120 thousand populations
    Xu, Li
    Zeng, Fanling
    Wang, Yutong
    Bai, Ye
    Shan, Xuefeng
    Kong, Lingxi
    BMC ENDOCRINE DISORDERS, 2021, 21 (01)
  • [29] Machine learning algorithms to predict mild cognitive impairment in older adults in China: A cross-sectional study
    Song, Yanliqing
    Yuan, Quan
    Liu, Haoqiang
    Gu, KeNan
    Liu, Yue
    JOURNAL OF AFFECTIVE DISORDERS, 2025, 368 : 117 - 126
  • [30] Knowledge, attitude, and practice towards thyroid nodules and cancer among patients: a cross-sectional study
    Li, Wei
    Deng, Jian
    Xiong, Wei
    Zhong, Yangyan
    Cao, Hong
    Jiang, Guoqin
    FRONTIERS IN PUBLIC HEALTH, 2023, 11