Utilizing machine learning for early screening of thyroid nodules: a dual-center cross-sectional study in China

被引:0
作者
Weng, Shuwei [1 ,2 ]
Ding, Chen [3 ]
Hu, Die [1 ,2 ]
Chen, Jin [1 ,2 ]
Liu, Yang [4 ]
Liu, Wenwu [1 ,2 ]
Chen, Yang [1 ,2 ]
Guo, Xin [1 ,2 ]
Cao, Chenghui [1 ,2 ]
Yi, Yuting [1 ,2 ]
Yang, Yanyi [5 ,6 ]
Peng, Daoquan [1 ,2 ]
机构
[1] Cent South Univ, Xiangya Hosp 2, Dept Cardiol, Changsha, Hunan, Peoples R China
[2] Res Inst Blood Lipid & Atherosclerosis, Changsha, Hunan, Peoples R China
[3] Soochow Univ, Affiliated Hosp 4, Suzhou Dushu Lake Hosp, Dept Cardiol,Med Ctr, Suzhou, Jiangsu, Peoples R China
[4] Third Mil Med Univ, Xinqiao Hosp,Army Med Univ, Chongqing Clin Res Ctr Kidney & Urol Dis, Dept Nephrol,Key Lab Prevent & Treatment Chron Kid, Chongqing, Peoples R China
[5] Cent South Univ, Xiangya Hosp 2, Hlth Management Ctr, Changsha, Hunan, Peoples R China
[6] Hunan Prov Clin Med Res Ctr Intelligent Management, Changsha, Hunan, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
基金
中国国家自然科学基金;
关键词
thyroid nodule; machine learning; early screening; urine iodine; ensemble learning methods; IODINE INTAKE; ASSOCIATION; MANAGEMENT; DIAGNOSIS; AGE;
D O I
10.3389/fendo.2024.1385167
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Thyroid nodules, increasingly prevalent globally, pose a risk of malignant transformation. Early screening is crucial for management, yet current models focus mainly on ultrasound features. This study explores machine learning for screening using demographic and biochemical indicators.Methods Analyzing data from 6,102 individuals and 61 variables, we identified 17 key variables to construct models using six machine learning classifiers: Logistic Regression, SVM, Multilayer Perceptron, Random Forest, XGBoost, and LightGBM. Performance was evaluated by accuracy, precision, recall, F1 score, specificity, kappa statistic, and AUC, with internal and external validations assessing generalizability. Shapley values determined feature importance, and Decision Curve Analysis evaluated clinical benefits.Results Random Forest showed the highest internal validation accuracy (78.3%) and AUC (89.1%). LightGBM demonstrated robust external validation performance. Key factors included age, gender, and urinary iodine levels, with significant clinical benefits at various thresholds. Clinical benefits were observed across various risk thresholds, particularly in ensemble models.Conclusion Machine learning, particularly ensemble methods, accurately predicts thyroid nodule presence using demographic and biochemical data. This cost-effective strategy offers valuable insights for thyroid health management, aiding in early detection and potentially improving clinical outcomes. These findings enhance our understanding of the key predictors of thyroid nodules and underscore the potential of machine learning in public health applications for early disease screening and prevention.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Development and validation of a new nomogram for self-reported OA based on machine learning: a cross-sectional study
    Chen, Jiexin
    Zheng, Qiongbing
    Lan, Youmian
    Li, Meijing
    Lin, Ling
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [42] Exploring Salivary Iodine Concentration as a Biomarker for Iodine Status and Thyroid Nodules in Females From Different Water Iodine Areas: a Cross-sectional Study
    Zhang, Ying
    Li, Xiuwei
    Wang, Jianqiang
    Ma, Wei
    Wang, Haiyan
    Wang, Jinpeng
    Xu, Jing
    AMERICAN JOURNAL OF CLINICAL NUTRITION, 2024, 120 (01) : 162 - 169
  • [43] Differentiating pathologic parathyroid glands from thyroid nodules on neck ultrasound: the PARATH-US cross-sectional study
    Yazgi, Dolly
    Richa, Carine
    Salenave, Sylvie
    Kamenicky, Peter
    Bourouina, Amel
    Clavier, Lorraine
    Dupeux, Margot
    Papon, Jean-Francois
    Young, Jacques
    Chanson, Philippe
    Maione, Luigi
    LANCET REGIONAL HEALTH-EUROPE, 2023, 35
  • [44] Nutritional Screening and Assessment, and Quality of Life in Children with Cancer: A Cross-Sectional Study in Mainland China
    Ouyang, Na
    Lu, Xia
    Cai, Ruiqing
    Liu, Meiling
    Liu, Ke
    JOURNAL OF PEDIATRIC NURSING-NURSING CARE OF CHILDREN & FAMILIES, 2021, 57 : 99 - 105
  • [45] Prevalence and influences of preschoolers' sedentary behaviors in early learning centers: a cross-sectional study
    Tucker, Patricia
    Vanderloo, Leigh M.
    Burke, Shauna M.
    Irwin, Jennifer D.
    Johnson, Andrew M.
    BMC PEDIATRICS, 2015, 15
  • [46] Prevalence of gestational diabetes mellitus and results of the screening tests at a tertiary referral center: A cross-sectional study
    Ozgu-Erdinc, A. Seval
    Sert, Umit Yasemin
    Buyuk, Gul Nihal
    Engin-Ustun, Yaprak
    DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2019, 13 (01) : 74 - 77
  • [47] Prediction of Suicidal Ideation among Korean Adults Using Machine Learning: A Cross-Sectional Study
    Oh, Bumjo
    Yun, Je-Yeon
    Yeo, Eun Chong
    Kim, Dong-Hoi
    Kim, Jin
    Cho, Bum-Joo
    PSYCHIATRY INVESTIGATION, 2020, 17 (04) : 331 - 340
  • [48] Iodine nutrition status and thyroid disorders: a cross-sectional study from the Xinjiang Autonomous Region of China
    Guo, Y.
    Zynat, J.
    Xu, Z.
    Wang, X.
    Osiman, R.
    Zhao, H.
    Tuhuti, A.
    Abdunaimu, M.
    Wang, H.
    Jin, X.
    Xing, S.
    EUROPEAN JOURNAL OF CLINICAL NUTRITION, 2016, 70 (11) : 1332 - 1336
  • [49] Using advanced machine learning algorithms to predict academic major completion: A cross-sectional study
    Kordbagheri, Alireza
    Kordbagheri, Mohammadreza
    Tayim, Natalie
    Fakhrou, Abdulnaser
    Davoudi, Mohammadreza
    Computers in Biology and Medicine, 2025, 184
  • [50] A feasibility study on utilizing machine learning technology to reduce the costs of gastric cancer screening in Taizhou, China
    Yan, Si-yan
    Fu, Xin-yu
    Tang, Shen-Ping
    Qi, Rong-bin
    Liang, Jia-wei
    Mao, Xin-li
    Ye, Li-ping
    Li, Shao-wei
    DIGITAL HEALTH, 2024, 10