A Comparison of Artificial Intelligence and Human Doctors for the Purpose of Triage and Diagnosis

被引:67
作者
Baker, Adam [1 ]
Perov, Yura [1 ]
Middleton, Katherine [1 ]
Baxter, Janie [1 ]
Mullarkey, Daniel [1 ]
Sangar, Davinder [1 ]
Butt, Mobasher [1 ]
DoRosario, Arnold [2 ]
Johri, Saurabh [1 ]
机构
[1] Babylon Hlth, London, England
[2] Yale New Haven Hlth, Northeast Med Grp, New Haven, CT USA
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2020年 / 3卷
关键词
virtual assistant; diagnosis; triage; AI diagnosis; symptom checker; computer-assisted diagnosis; bayesian networks;
D O I
10.3389/frai.2020.543405
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
AI virtual assistants have significant potential to alleviate the pressure on overly burdened healthcare systems by enabling patients to self-assess their symptoms and to seek further care when appropriate. For these systems to make a meaningful contribution to healthcare globally, they must be trusted by patients and healthcare professionals alike, and service the needs of patients in diverse regions and segments of the population. We developed an AI virtual assistant which provides patients with triage and diagnostic information. Crucially, the system is based on a generative model, which allows for relatively straightforward reparameterization to reflect local disease and risk factor burden in diverse regions and population segments. This is an appealing property, particularly when considering the potential of AI systems to improve the provision of healthcare on a global scale in many regions and for both developing and developed countries. We performed a prospective validation study of the accuracy and safety of the AI system and human doctors. Importantly, we assessed the accuracy and safety of both the AI and human doctors independently against identical clinical cases and, unlike previous studies, also accounted for the information gathering process of both agents. Overall, we found that the AI system is able to provide patients with triage and diagnostic information with a level of clinical accuracy and safety comparable to that of human doctors. Through this approach and study, we hope to start building trust in AI-powered systems by directly comparing their performance to human doctors, who do not always agree with each other on the cause of patients' symptoms or the most appropriate triage recommendation.
引用
收藏
页数:9
相关论文
共 29 条
[1]   Symptom checkers versus doctors: A prospective, head-to-head comparison for cough [J].
Berry, Andrew C. ;
Berry, Nicholas A. ;
Wang, Bin ;
Mulekar, Madhuri S. ;
Melvin, Anne ;
Battiola, Richard J. ;
Bulacan, Frederick K. ;
Berry, Bruce B. .
CLINICAL RESPIRATORY JOURNAL, 2020, 14 (04) :413-415
[2]   GP at hand: a digital revolution for health care provision? [J].
Burki, Talha .
LANCET, 2019, 394 (10197) :457-460
[3]   Digital and online symptom checkers and health assessment/triage services for urgent health problems: systematic review [J].
Chambers, Duncan ;
Cantrell, Anna J. ;
Johnson, Maxine ;
Preston, Louise ;
Baxter, Susan K. ;
Booth, Andrew ;
Turner, Janette .
BMJ OPEN, 2019, 9 (08)
[4]   AIS-BN: An adaptive importance sampling algorithm for evidential reasoning in large Bayesian networks [J].
Cheng, J ;
Druzdzel, MJ .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 :155-188
[5]  
Douglas Laura, 2017, UNIVERSAL MARGINALIZ
[6]   Safety of patient-facing digital symptom checkers [J].
Fraser, Hamish ;
Coiera, Enrico ;
Wong, David .
LANCET, 2018, 392 (10161) :2263-2264
[7]   The incidence of diagnostic error in medicine [J].
Graber, Mark L. .
BMJ QUALITY & SAFETY, 2013, 22 :ii21-ii27
[8]  
Gu SX, 2015, ADV NEUR IN, V28
[9]   The Application of Medical Artificial Intelligence Technology in Rural Areas of Developing Countries [J].
Guo, Jonathan ;
Li, Bin .
HEALTH EQUITY, 2018, 2 (01) :174-181
[10]  
Heckerman D., 1990, Proc. of the 5th Annual Conf. on Uncertainty in Artificial Intell, P163, DOI DOI 10.1016/B978-0-444-88738-2.50020-8