Improving Skin Condition Classification with a Visual Symptom Checker Trained Using Reinforcement Learning

被引：12

作者：

Akrout, Mohamed ^{[1
,2
]}

Farahmand, Amir-Massoud ^{[2
,3
]}

Jarmain, Tory ^{[1
]}

Abid, Latif ^{[1
]}

机构：

[1] Triage, 1,Adelaide St E,Suite 3001, Toronto, ON M5C 1J4, Canada

[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada

[3] Vector Inst, Toronto, ON, Canada

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV | 2019年 / 11767卷

关键词：

Skin condition classification; Question answering model; Reinforcement Learning; Deep Q-Learning;

D O I：

10.1007/978-3-030-32251-9_60

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a visual symptom checker that combines a pre-trained Convolutional Neural Network (CNN) with a Reinforcement Learning (RL) agent as a Question Answering (QA) model. This method increases the classification confidence and accuracy of the visual symptom checker, and decreases the average number of questions asked to narrow down the differential diagnosis. A Deep Q-Network (DQN)-based RL agent learns how to ask the patient about the presence of symptoms in order to maximize the probability of correctly identifying the underlying condition. The RL agent uses the visual information provided by CNN in addition to the answers to the asked questions to guide the QA system. We demonstrate that the RL-based approach increases the accuracy more than 20% compared to the CNN-only approach, which only uses the visual information to predict the condition. Moreover, the increased accuracy is up to 10% compared to the approach that uses the visual information provided by CNN along with a conventional decision tree-based QA system. We finally show that the RL-based approach not only outperforms the decision tree-based approach, but also narrows down the diagnosis faster in terms of the average number of asked questions.

引用

页码：549 / 557

页数：9

共 11 条

[1]

Akrout M., 2018, P 32 C NEUR INF PROC

[2] A reinforcement learning formulation to the complex question answering problem [J].

Chali, Yllias ;

Hasan, Sadid A. ;

Mojahid, Mustapha .

INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (03) :252-272

[3]

Choi Edward, 2016, JMLR Workshop Conf Proc, V56, P301

[4]

Liu QL, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P201

[5] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[6]

Nogueira R., 2017, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, P574, DOI DOI 10.18653/V1/D17-1061

[7]

Puterman Martin L, 1994, Markov Decision Processes: Discrete Stochastic Dynamic Programming

[8] Learning a Health Knowledge Graph from Electronic Medical Records [J].

Rotmensch, Maya ;

Halpern, Yoni ;

Tlimat, Abdulhakim ;

Horng, Steven ;

Sontag, David .

SCIENTIFIC REPORTS, 2017, 7

[9]

Sutton RS, 2018, ADAPT COMPUT MACH LE, P1

[10] Rethinking the Inception Architecture for Computer Vision [J].

Szegedy, Christian ;

Vanhoucke, Vincent ;

Ioffe, Sergey ;

Shlens, Jon ;

Wojna, Zbigniew .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2818-2826

← 1 2 →