Interpretable Disease Progression Prediction Based on Reinforcement Reasoning Over a Knowledge Graph

被引:3
作者
Sun, Zhoujian [1 ,2 ]
Dong, Wei [3 ]
Shi, Jinlong [4 ]
Huang, Zhengxing [5 ]
机构
[1] Zhejiang Lab, Inst Appl Math & Machine Intelligence, Hangzhou 311121, Peoples R China
[2] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Hangzhou 310027, Peoples R China
[3] Chinese Peoples Liberat Army Gen Hosp, Dept Crit Care Med, Cardiol Div, Beijing 100853, Peoples R China
[4] Chinese Peoples Liberat Army Gen Hosp, Med Innovat Res Div, Key Lab Biomed Engn & Translat Med, Minist Ind & Informat Technol, Beijing 100853, Peoples R China
[5] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 03期
关键词
Interpretable disease progression prediction; knowledge graph (KG); medical decision making; reinforcement learning; ELECTRONIC HEALTH RECORDS; CLASSIFICATION;
D O I
10.1109/TSMC.2023.3331847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To combine medical knowledge and medical data to interpretably predict the risk of disease. Methods: We formulated the disease progression prediction task as a random walk along a knowledge graph (KG). Specifically, we build a KG to record relationships between diseases and risk factors according to validated medical knowledge. Then, an object walks along the KG. It starts walking at a patient entity, which connects the KG based on the patient's current diseases or risk factors and stops at a disease entity representing the predicted disease. The trajectory generated by the object represents an interpretable disease progression path of the given patient. The dynamics of the object are controlled by a policy-based reinforcement learning module, which is trained by electronic health records (EHRs). Experiments: We utilized three real-world EHR datasets to evaluate the performance of our model. In the disease progression prediction task, our model achieves 0.743, 0.639, and 0.643 in terms of macro area under the curve (AUC) in predicting 53 circulation system diseases in the three datasets, respectively. This performance is comparable to medical research's commonly used machine learning models. In qualitative analysis, our clinical collaborator reviewed the disease progression paths generated by our model and advocated their interpretability and reliability. Conclusion: Experimental results validate the proposed model in interpretably evaluating and optimizing disease progression prediction. Significance: Our work contributes to leveraging the potential of medical knowledge and medical data jointly for interpretable prediction tasks.
引用
收藏
页码:1948 / 1959
页数:12
相关论文
共 46 条
[1]   Discovering the Type 2 Diabetes in Electronic Health Records Using the Sparse Balanced Support Vector Machine [J].
Bernardini, Michele ;
Romeo, Luca ;
Misericordia, Paolo ;
Frontoni, Emanuele .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (01) :235-246
[2]  
Bordes A., 2013, Adv. Neural Inf. Process. Syst., V26
[3]   Predicting Chronic Disease Hospitalizations from Electronic Health Records: An Interpretable Classification Approach [J].
Brisimi, Theodora S. ;
Xu, Tingting ;
Wang, Taiyao ;
Dai, Wuyang ;
Adams, William G. ;
Paschalidis, Ioannis Ch .
PROCEEDINGS OF THE IEEE, 2018, 106 (04) :690-707
[4]   Deep ROC Analysis and AUC as Balanced Average Accuracy, for Improved Classifier Selection, Audit and Explanation [J].
Carrington, Andre M. ;
Manuel, Douglas G. ;
Fieguth, Paul W. ;
Ramsay, Tim ;
Osmani, Venet ;
Wernly, Bernhard ;
Bennett, Carol ;
Hawken, Steven ;
Magwood, Olivia ;
Sheikh, Yusuf ;
McInnes, Matthew ;
Holzinger, Andreas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :329-341
[5]  
Che Zhengping, 2016, AMIA Annu Symp Proc, V2016, P371
[6]   Jointly Non-Sampling Learning for Knowledge Graph Enhanced Recommendation [J].
Chen, Chong ;
Zhang, Min ;
Ma, Weizhi ;
Liu, Yiqun ;
Ma, Shaoping .
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, :189-198
[7]  
Chen J, 2018, 2018 INTERNATIONAL CONFERENCE ON MANIPULATION, AUTOMATION AND ROBOTICS AT SMALL SCALES (MARSS)
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   GRAM: Graph-based Attention Model for Healthcare Representation Learning [J].
Choi, Edward ;
Bahadori, Mohammad Taha ;
Song, Le ;
Stewart, Walter F. ;
Sun, Jimeng .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :787-795
[10]  
Choi E, 2016, ADV NEUR IN, V29