AI-Driven Diagnostic Assistance in Medical Inquiry: ReinforcementLearning Algorithm Development and Validation

被引:4
作者
Zou, Xuan [1 ]
He, Weijie [2 ,3 ,4 ]
Huang, Yu [5 ]
Ouyang, Yi [5 ]
Zhang, Zhen [1 ]
Wu, Yu [1 ]
Wu, Yongsheng [1 ]
Feng, Lili [6 ]
Yang, Mengqi [7 ]
Chen, Xuyan [6 ]
Zheng, Yefeng [5 ]
Jiang, Rui [4 ,8 ]
Chen, Ting [2 ,3 ,4 ]
机构
[1] Shenzhen Ctr Dis Control & Prevent, Shenzhen, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Room 3-609, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Inst Artificial Intelligence, Beijing, Peoples R China
[4] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[5] Tencent YouTu Lab, Jarvis Res Ctr, Shenzhen, Peoples R China
[6] Tsinghua Univ, Beijing Tsinghua Changgung Hosp, Sch Clin Med, Beijing, Peoples R China
[7] Tencent Healthcare, Shenzhen, Peoples R China
[8] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
关键词
inquiry and diagnosis; electronic health record; reinforcement learning; natural language processing; artificial intelligence;
D O I
10.2196/54616
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: For medical diagnosis, clinicians typically begin with a patient's chief concerns, followed by questions aboutsymptoms and medical history, physical examinations, and requests for necessary auxiliary examinations to gather comprehensivemedical information. This complex medical investigation process has yet to be modeled by existing artificial intelligence (AI)methodologies. Objective: The aim of this study was to develop an AI-driven medical inquiry assistant for clinical diagnosis that providesinquiry recommendations by simulating clinicians'medical investigating logic via reinforcement learning. Methods: We compiled multicenter, deidentified outpatient electronic health records from 76 hospitals in Shenzhen, China,spanning the period from July to November 2021. These records consisted of both unstructured textual information and structuredlaboratory test results. We first performed feature extraction and standardization using natural language processing techniquesand then used a reinforcement learning actor-critic framework to explore the rational and effective inquiry logic. To align theinquiry process with actual clinical practice, we segmented the inquiry into 4 stages: inquiring about symptoms and medicalhistory, conducting physical examinations, requesting auxiliary examinations, and terminating the inquiry with a diagnosis.External validation was conducted to validate the inquiry logic of the AI model. Results: This study focused on 2 retrospective inquiry-and-diagnosis tasks in the emergency and pediatrics departments. Theemergency departments provided records of 339,020 consultations including mainly children (median age 5.2, IQR 2.6-26.1years) with various types of upper respiratory tract infections (250,638/339,020, 73.93%). The pediatrics department providedrecords of 561,659 consultations, mainly of children (median age 3.8, IQR 2.0-5.7 years) with various types of upper respiratorytract infections (498,408/561,659, 88.73%). When conducting its own inquiries in both scenarios, the AI model demonstratedhigh diagnostic performance, with areas under the receiver operating characteristic curve of 0.955 (95% CI 0.953-0.956) and0.943 (95% CI 0.941-0.944), respectively. When the AI model was used in a simulated collaboration with physicians, it notablyreduced the average number of physicians' inquiries to 46% (6.037/13.26; 95% CI 6.009-6.064) and 43% (6.245/14.364; 95%J Med Internet Res 2024 | vol. 26 | e54616 | p. 1https://www.jmir.org/2024/1/e54616(page number not for citation purposes)Zou et alJOURNAL OF MEDICAL INTERNET RESEARCHXSL center dot FORenderX CI 6.225-6.269) while achieving areas under the receiver operating characteristic curve of 0.972 (95% CI 0.970-0.973) and 0.968(95% CI 0.967-0.969) in the scenarios. External validation revealed a normalized Kendall tau distance of 0.323 (95% CI 0.301-0.346),indicating the inquiry consistency of the AI model with physicians. Conclusions: This retrospective analysis of predominantly respiratory pediatric presentations in emergency and pediatricsdepartments demonstrated that an AI-driven diagnostic assistant had high diagnostic performance both in stand-alone use and insimulated collaboration with clinicians. Its investigation process was found to be consistent with the clinicians'medical investigationlogic. These findings highlight the diagnostic assistant's promise in assisting the decision-making processes of health careprofessionals
引用
收藏
页数:18
相关论文
共 37 条
[1]  
Balogh E.P., 2015, Improving Diagnosis in Health Care
[2]   A MARKOVIAN DECISION PROCESS [J].
BELLMAN, R .
JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684
[3]   Overconfidence as a cause of diagnostic error in medicine [J].
Berner, Eta S. ;
Graber, Mark L. .
AMERICAN JOURNAL OF MEDICINE, 2008, 121 (05) :2-23
[4]   Rationality in medical decision making: a review of the literature on doctors' decision-making biases [J].
Bornstein, BH ;
Emler, AC .
JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2001, 7 (02) :97-107
[5]  
Chen JY, 2022, AAAI CONF ARTIF INTE, P4432
[6]   A benchmark for automatic medical consultation system: frameworks, tasks and datasets [J].
Chen, Wei ;
Li, Zhiwei ;
Fang, Hongyi ;
Yao, Qianyuan ;
Zhong, Cheng ;
Hao, Jianye ;
Zhang, Qi ;
Huang, Xuanjing ;
Peng, Jiajie ;
Wei, Zhongyu .
BIOINFORMATICS, 2023, 39 (01)
[7]  
Cheng Y, 2022, P 31 INT JOINT C ART, P4266
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning [J].
He, Weijie ;
Chen, Ting .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :4695-4703
[10]   BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis [J].
He, Weijie ;
Mao, Xiaohao ;
Ma, Chao ;
Huang, Yu ;
Hernandez-Lobato, Jose Miguel ;
Chen, Ting .
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, :2511-2521