Scalable Online Disease Diagnosis via Multi-Model-Fused Actor-Critic Reinforcement Learning

被引：4

作者：

He, Weijie ^{[1
,2
,3
]}

Chen, Ting ^{[1
,2
,3
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

[2] Tsinghua Univ, Inst Artificial Intelligence, Beijing, Peoples R China

[3] Tsinghua Univ, BNRist, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

online disease diagnosis; self-diagnosis; reinforcement learning;

D O I：

10.1145/3534678.3542672

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For those seeking healthcare advice online, AI based dialogue agents capable of interacting with patients to perform automatic disease diagnosis are a viable option. This application necessitates efficient inquiry of relevant disease symptoms in order to make accurate diagnosis recommendations. This can be formulated as a problem of sequential feature (symptom) selection and classification for which reinforcement learning (RL) approaches have been proposed as a natural solution. They perform well when the feature space is small, that is, the number of symptoms and diagnosable disease categories is limited, but they frequently fail in assignments with a large number of features. To address this challenge, we propose a Multi-Model-Fused Actor-Critic (MMF-AC) RL framework that consists of a generative actor network and a diagnostic critic network. The actor incorporates a Variational AutoEncoder (VAE) to model the uncertainty induced by partial observations of features, thereby facilitating in making appropriate inquiries. In the critic network, a supervised diagnosis model for disease predictions is involved to precisely estimate the state-value function. Furthermore, inspired by the medical concept of differential diagnosis, we combine the generative and diagnosis models to create a novel reward shaping mechanism to address the sparse reward problem in large search spaces. We conduct extensive experiments on both synthetic and real-world datasets for empirical evaluations. The results demonstrate that our approach outperforms state-of-the-art methods in terms of diagnostic accuracy and interaction efficiency while also being more effectively scalable to large search spaces. Besides, our method is adaptable to both categorical and continuous features, making it ideal for online applications.

引用

页码：4695 / 4703

页数：9

共 44 条

[1]

AHEAD Research Inc, 2017, SYMCAT SYMPT BAS COM

[2]

[Anonymous], 2013, INT C MACH LEARN

[3] A MARKOVIAN DECISION PROCESS [J].

BELLMAN, R .

JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684

[4]

Cao Y., 2014, ARXIV14107827

[5] Search Engines vs. Symptom Checkers: A Comparison of their Effectiveness for Online Health Advice [J].

Cross, Sebastian ;

Mourad, Ahmed ;

Zuccon, Guido ;

Koopman, Bevan .

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, :206-216

[6]

Fox Susannah., 2013, INFORM TRIAGE

[7]

Haarnoja Tuomas, 2018, INT C MACH LEARN, V80

[8] BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis [J].

He, Weijie ;

Mao, Xiaohao ;

Ma, Chao ;

Huang, Yu ;

Hernandez-Lobato, Jose Miguel ;

Chen, Ting .

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, :2511-2521

[9]

Janisch J, 2019, AAAI CONF ARTIF INTE, P3959

[10]

Janisch Jaromir, 2020, MACH LEARN, P1

← 1 2 3 4 5 →