Evaluation of Four Artificial Intelligence-Assisted Self-Diagnosis Apps on Three Diagnoses: Two-Year Follow-Up Study

被引:28
作者
Cirkovic, Aleksandar [1 ]
机构
[1] Schulgasse 21, D-92637 Germany, Germany
关键词
artificial intelligence; machine learning; mobile apps; medical diagnosis; mHealth;
D O I
10.2196/18097
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Consumer-oriented mobile self-diagnosis apps have been developed using undisclosed algorithms, presumably based on machine learning and other artificial intelligence (AI) technologies. The US Food and Drug Administration now discerns apps with learning AI algorithms from those with stable ones and treats the former as medical devices. To the author's knowledge, no self-diagnosis app testing has been performed in the field of ophthalmology so far. Objective: The objective of this study was to test apps that were previously mentioned in the scientific literature on a set of diagnoses in a deliberate time interval, comparing the results and looking for differences that hint at "nonlocked" learning algorithms. Methods: Four apps from the literature were chosen (Ada, Babylon, Buoy, and Your.MD). A set of three ophthalmology diagnoses (glaucoma, retinal tear, dry eye syndrome) representing three levels of urgency was used to simultaneously test the apps' diagnostic efficiency and treatment recommendations in this specialty. Two years was the chosen time interval between the tests (2018 and 2020). Scores were awarded by one evaluating physician using a defined scheme. Results: Two apps (Ada and Your.MD) received significantly higher scores than the other two. All apps either worsened in their results between 2018 and 2020 or remained unchanged at a low level. The variation in the results over time indicates "nonlocked" learning algorithms using AI technologies. None of the apps provided correct diagnoses and treatment recommendations for all three diagnoses in 2020. Two apps (Babylon and Your.MD) asked significantly fewer questions than the other two (P<.001). Conclusions: "Nonlocked" algorithms are used by self-diagnosis apps. The diagnostic efficiency of the tested apps seems to worsen over time, with some apps being more capable than others. Systematic studies on a wider scale are necessary for health care providers and patients to correctly assess the safety and efficacy of such apps and for correct classification by health care regulating authorities.
引用
收藏
页数:8
相关论文
共 39 条
[1]   The Use of Artificially Intelligent Self-Diagnosing Digital Platforms by the General Public: Scoping Review [J].
Aboueid, Stephanie ;
Liu, Rebecca H. ;
Desta, Binyam Negussie ;
Chaurasia, Ashok ;
Ebrahim, Shanil .
JMIR MEDICAL INFORMATICS, 2019, 7 (02) :4-14
[2]   Dry Eye Syndrome Preferred Practice Pattern® [J].
Akpek, Esen K. ;
Amescua, Guillermo ;
Farid, Marjan ;
Garcia-Ferrer, Francisco J. ;
Lin, Amy ;
Rhee, Michelle K. ;
Varu, Divya M. ;
Musch, David C. ;
Dunn, Steven P. ;
Mah, Francis S. .
OPHTHALMOLOGY, 2019, 126 (01) :P286-P334
[3]  
American Academy of Ophthalmology, 2012, BAS CLIN SCI COURS C
[4]  
[Anonymous], 2020, SENIOR DATA SCI ENG
[5]  
[Anonymous], 2016, SCOOP INDEPENDENT NE
[6]  
[Anonymous], 2017, BUOY HLTH CHATBOT HE
[7]  
[Anonymous], 2020, 2020 2021 BAS CLIN S
[8]  
[Anonymous], REB SELLS MBCHB HONS
[9]  
Buhr S., 2017, BUOY HOPES FIGHT FAK
[10]  
Burgener R., 2006, GOOGLE PATENTS