LSTM-Based Prediction Model for Tuberculosis Among HIV-Infected Patients Using Structured Electronic Medical Records: A Retrospective Machine Learning Study

被引:1
|
作者
Chen, Jingfang [1 ,2 ]
Liu, Linlin [3 ]
Huang, Junxiong [1 ]
Jiang, Youli [4 ]
Yin, Chengliang [1 ]
Zhang, Lukun [5 ]
Li, Zhihuan [1 ]
Lu, Hongzhou [1 ,5 ]
机构
[1] Macau Univ Sci & Technol, Fac Med, Taipa 999078, Macau, Peoples R China
[2] Third Peoples Hosp Shenzhen, Dept Res & Teaching, Shenzhen 518112, Peoples R China
[3] Univ South China, Sch Nursing, Hengyang Med Sch, Hengyang 421001, Peoples R China
[4] Peoples Hosp Longhua, Dept Neurol, Shenzhen 518109, Peoples R China
[5] Third Peoples Hosp Shenzhen, Natl Clin Res Ctr Infect Dis, Dept Infect Dis, Shenzhen 518112, Peoples R China
来源
JOURNAL OF MULTIDISCIPLINARY HEALTHCARE | 2024年 / 17卷
关键词
Prediction models; HIV; Tuberculosis; Machine Learning; Artificial Intelligence; CHINA;
D O I
10.2147/JMDH.S467877
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Both HIV and TB are chronic infectious diseases requiring long-term treatment and follow-up, resulting in extensive electronic medical records. With the exponential growth of health and medical big data, effectively extracting and analyzing these data has become the research hotspot. As a fundamental aspect of artificial intelligence, machine learning has been extensively applied in medical research, encompassing diagnosis, treatment, patient monitoring, drug development, and epidemiological investigations. This significantly enhances medical information systems and facilitates the interoperability of medical data. Methods: In our study, we analyzed longitudinal data from the electronic health records of 4540 patients, gathered from the National Clinical Research Center for Infectious Diseases in Shenzhen, China, spanning from 2017 to 2021. Initially, we employed the finetuned ChatGLM to structure the electronic medical records. Subsequently, we utilized a multi-layer perceptron to classify each patient and determined the presence of tuberculosis in HIV patients. Using machine learning-based natural language processing, we structured these records to build a specialized database for HIV and TB co-infection. We studied the epidemiological characteristics, focusing on incidence patterns, patient characteristics, and influencing factors, to uncover the transmission characteristics of these diseases in Shenzhen. Additionally, we used Long Short-Term Memory to create a predictive model for TB co-infection among HIV patients, based on their medical records. This model predicted the risk of TB co-infection, providing scientific evidence for clinical decision- making and enabling early detection and precise intervention. Results: Based on the refined ChatGLM model tailored for structured electronic health records, the accuracy of symptom extraction consistently surpassed 0.95 precision. Key symptoms such as diarrhea and normal showed precision rates exceeding 0.90. High scores were also achieved in recall and F1 scores. Among 4540 HIV patients, 758 were diagnosed with concurrent tuberculosis, indicating a 16.7% co-infection rate, while syphilis co-infection affected 25.1%, underscoring the prevalence of concurrent infections among HIV patients. Utilizing electronic health records, a Multilayer Perceptron classifier was developed as a benchmark against Long Short-Term Memory to predict high-risk groups for HIV and tuberculosis co-infections. The Multilayer Perceptron classifier demonstrated predictive ability with AUROC values ranging from 0.616 to 0.682 on the test set, suggesting opportunities for further optimization and generalization despite its accuracy in identifying HIV-TB co-infections. In tuberculosis intelligent diagnosis based on laboratory results, the Long Short-Term Memory showed consistent performance across 5-fold cross-validation, with AUROC values ranging from 0.827 to 0.850, indicating reliability and consistency in tuberculosis prediction. Furthermore, by optimizing classification thresholds, the model achieved an overall accuracy of 81.18% in distinguishing HIV co-infected tuberculosis from simple HIV infection. Conclusion: Combining the Multilayer Perceptron classifier with Long Short-Term Memory represented an advanced approach for effectively extracting electronic health records and utilizing it for disease prediction. This underscored the superior performance of deep learning techniques in managing both structured and unstructured medical data. Models leveraging laboratory time-series data demonstrated notably better performance compared to those relying solely on electronic health records for predicting tuberculosis incidence. This emphasized the benefits of deep learning in handling intricate medical data and provided valuable insights for healthcare providers exploring the use of deep learning in disease prediction and management.
引用
收藏
页码:3557 / 3573
页数:17
相关论文
共 31 条
  • [1] Prediction of the risk of cytopenia in hospitalized HIV/AIDS patients using machine learning methods based on electronic medical records
    Huang, Liling
    Xie, Bo
    Zhang, Kai
    Xu, Yuanlong
    Su, Lingsong
    Lv, Yu
    Lu, Yangjie
    Qin, Jianqiu
    Pang, Xianwu
    Qiu, Hong
    Li, Lanxiang
    Wei, Xihua
    Huang, Kui
    Meng, Zhihao
    Hu, Yanling
    Lv, Jiannan
    FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [2] An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records
    Diao, Xiaolin
    Huo, Yanni
    Yan, Zhanzheng
    Wang, Haibin
    Yuan, Jing
    Wang, Yuxin
    Cai, Jun
    Zhao, Wei
    JMIR MEDICAL INFORMATICS, 2021, 9 (01)
  • [3] Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
    Gupta, Sunil
    Truyen Tran
    Luo, Wei
    Dinh Phung
    Kennedy, Richard Lee
    Broad, Adam
    Campbell, David
    Kipp, David
    Singh, Madhu
    Khasraw, Mustafa
    Matheson, Leigh
    Ashley, David M.
    Venkatesh, Svetha
    BMJ OPEN, 2014, 4 (03):
  • [4] Relationship between chest radiographic patterns of tuberculosis and CD4+ cell count among HIV-infected patients at Katsina, Nigeria: A retrospective study
    Tahir, Abdulrahman
    Sani, Tijjani
    Gwalabe, Sabiu Abdu
    HIV & AIDS REVIEW, 2016, 15 (04): : 177 - 179
  • [5] Machine Learning Model for Readmission Prediction of Patients With Heart Failure Based on Electronic Health Records: Protocol for a Quasi-Experimental Study for Impact Assessment
    Nair, Monika
    Lundgren, Lina E.
    Soliman, Amira
    Dryselius, Petra
    Fogelberg, Ebba
    Petersson, Marcus
    Hamed, Omar
    Triantafyllou, Miltiadis
    Nygren, Jens
    JMIR RESEARCH PROTOCOLS, 2024, 13
  • [6] Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records
    Zheyi Dong
    Qian Wang
    Yujing Ke
    Weiguang Zhang
    Quan Hong
    Chao Liu
    Xiaomin Liu
    Jian Yang
    Yue Xi
    Jinlong Shi
    Li Zhang
    Ying Zheng
    Qiang Lv
    Yong Wang
    Jie Wu
    Xuefeng Sun
    Guangyan Cai
    Shen Qiao
    Chengliang Yin
    Shibin Su
    Xiangmei Chen
    Journal of Translational Medicine, 20
  • [7] Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records
    Dong, Zheyi
    Wang, Qian
    Ke, Yujing
    Zhang, Weiguang
    Hong, Quan
    Liu, Chao
    Liu, Xiaomin
    Yang, Jian
    Xi, Yue
    Shi, Jinlong
    Zhang, Li
    Zheng, Ying
    Lv, Qiang
    Wang, Yong
    Wu, Jie
    Sun, Xuefeng
    Cai, Guangyan
    Qiao, Shen
    Yin, Chengliang
    Su, Shibin
    Chen, Xiangmei
    JOURNAL OF TRANSLATIONAL MEDICINE, 2022, 20 (01)
  • [8] Prediction Model for 30-Day Mortality after Non-Cardiac Surgery Using Machine-Learning Techniques Based on Preoperative Evaluation of Electronic Medical Records
    Choi, Byungjin
    Oh, Ah Ran
    Lee, Seung-Hwa
    Lee, Dong Yun
    Lee, Jong-Hwan
    Yang, Kwangmo
    Kim, Ha Yeon
    Park, Rae Woong
    Park, Jungchan
    JOURNAL OF CLINICAL MEDICINE, 2022, 11 (21)
  • [9] A machine learning-based prediction model for postoperative delirium in cardiac valve surgery using electronic health records
    Li, Qiuying
    Li, Jiaxin
    Chen, Jiansong
    Zhao, Xu
    Zhuang, Jian
    Zhong, Guoping
    Song, Yamin
    Lei, Liming
    BMC CARDIOVASCULAR DISORDERS, 2024, 24 (01):
  • [10] Event Prediction Model Considering Time and Input Error Using Electronic Medical Records in the Intensive Care Unit: Retrospective Study
    Sung, MinDong
    Hahn, Sangchul
    Han, Chang Hoon
    Lee, Jung Mo
    Lee, Jayoung
    Yoo, Jinkyu
    Heo, Jay
    Kim, Young Sam
    Chung, Kyung Soo
    JMIR MEDICAL INFORMATICS, 2021, 9 (11)