Prediction of complications in diabetes mellitus using machine learning models with transplanted topic model features

被引:0
|
作者
Han, Benedict Choonghyun [1 ]
Kim, Jimin [2 ]
Choi, Jinwook [3 ,4 ]
机构
[1] Seoul Natl Univ, Interdisciplinary Program Bioengn, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Seoul Natl Univ, English Language & Literature, 1 Gwanak Ro, Seoul 08826, South Korea
[3] Seoul Natl Univ, Coll Med, Dept Biomed Engn, 101 Daehak Ro, Seoul 03080, South Korea
[4] Seoul Natl Univ, Inst Med & Biol Engn, Med Res Ctr, 103 Daehak Ro, Seoul 03080, South Korea
基金
新加坡国家研究基金会;
关键词
Diabetes Mellitus; Latent Dirichlet allocation; Machine learning; Topic modeling;
D O I
10.1007/s13534-023-00322-7
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose: This study aims to predict the progression of Diabetes Mellitus (DM) from the clinical notes through machine learning based on latent Dirichlet allocation (LDA) topic modeling. Particularly, 174,427 clinical notes of DM patients were collected from the electronic medical record (EMR) system of the Seoul National University Hospital outpatient clinic. Method: We developed a model to predict the development of DM complications. Topics developed by the topic model were exploited as the key feature of our machine-learning model. The proposed model generalized a correlation between topic structures and complications. Results: The model provided acceptable predictive performance for all four types of complications (diabetic retinopathy, diabetic nephropathy, nonalcoholic fatty liver disease, and cerebrovascular accident). Upon employing extreme gradient boosting (XGBoost), we obtained the F1 scores of the predictions for each complication type as 0.844, 0.921, 0.831, and 0.762. Conclusion: This study shows that a machine learning project based on topic modeling can effectively predict the progress of a disease. Furthermore, a unique way of topic model transplanting, which matches the dimension of the topic structures of the two data sets, is presented.
引用
收藏
页码:163 / 171
页数:9
相关论文
共 50 条
  • [1] Prediction of complications in diabetes mellitus using machine learning models with transplanted topic model features
    Benedict Choonghyun Han
    Jimin Kim
    Jinwook Choi
    Biomedical Engineering Letters, 2024, 14 : 163 - 171
  • [2] The early prediction of gestational diabetes mellitus by machine learning models
    Kaya, Yeliz
    Butun, Zafer
    Celik, Ozer
    Salik, Ece Akca
    Tahta, Tugba
    Yavuz, Arzu Altun
    BMC PREGNANCY AND CHILDBIRTH, 2024, 24 (01)
  • [3] Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review
    Kee, Ooi Ting
    Harun, Harmiza
    Mustafa, Norlaila
    Murad, Nor Azian Abdul
    Chin, Siok Fong
    Jaafar, Rosmina
    Abdullah, Noraidatulakma
    CARDIOVASCULAR DIABETOLOGY, 2023, 22 (01)
  • [4] Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review
    Ooi Ting Kee
    Harmiza Harun
    Norlaila Mustafa
    Nor Azian Abdul Murad
    Siok Fong Chin
    Rosmina Jaafar
    Noraidatulakma Abdullah
    Cardiovascular Diabetology, 22
  • [5] Multivariable prediction model of complications derived from diabetes mellitus using machine learning on scarce highly unbalanced data
    Colmenares-Mejia, Claudia C.
    Rincon-Acuna, Juan C.
    Cely, Andres
    Gonzalez-Velez, Abel E.
    Castillo, Andrea
    Murcia, Jossie
    Isaza-Ruget, Mario A.
    INTERNATIONAL JOURNAL OF DIABETES IN DEVELOPING COUNTRIES, 2024, 44 (03) : 528 - 538
  • [6] Prediction model for gestational diabetes mellitus using the XG Boost machine learning algorithm
    Hu, Xiaoqi
    Hu, Xiaolin
    Yu, Ya
    Wang, Jia
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [7] Prediction of Diabetes Mellitus Progression Using Supervised Machine Learning
    Chauhan, Apoorva S.
    Varre, Mathew S.
    Izuora, Kenneth
    Trabia, Mohamed B.
    Dufek, Janet S.
    SENSORS, 2023, 23 (10)
  • [8] An early prediction model for gestational diabetes mellitus created using machine learning algorithms
    Yang, Zhifen
    Shi, Xiaoyue
    Wang, Shengpu
    Du, Lijia
    Zhang, Xiaoying
    Zhang, Kun
    Zhang, Yongqiang
    Ma, Jinlong
    Zheng, Rui
    INTERNATIONAL JOURNAL OF GYNECOLOGY & OBSTETRICS, 2025,
  • [9] Predicting complications of diabetes mellitus using advanced machine learning algorithms
    Ljubic, Branimir
    Hai, Ameen Abdel
    Stanojevic, Marija
    Diaz, Wilson
    Polimac, Daniel
    Pavlovski, Martin
    Obradovic, Zoran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1343 - 1351
  • [10] Predictive models for diabetes mellitus using machine learning techniques
    Lai, Hang
    Huang, Huaxiong
    Keshavjee, Karim
    Guergachi, Aziz
    Gao, Xin
    BMC ENDOCRINE DISORDERS, 2019, 19 (01)