Accuracy and comprehensibility of chat-based artificial intelligence for patient information on atrial fibrillation and cardiac implantable electronic devices

被引:18
作者
Hillmann, Henrike A. K. [1 ]
Angelini, Eleonora [1 ]
Karfoul, Nizar [1 ]
Feickert, Sebastian [2 ,3 ,4 ]
Mueller-Leisse, Johanna [1 ]
Duncker, David [1 ]
机构
[1] Hannover Med Sch, Dept Cardiol & Angiol, Hannover Heart Rhythm Ctr, Carl Neuberg Str 1, D-30625 Hannover, Germany
[2] Vivantes Clin Urban, Dept Cardiol, Dieffenbachstr 1, D-10967 Berlin, Germany
[3] Vivantes Clin Urban, Internal Intens Care Unit, Dieffenbachstr 1, D-10967 Berlin, Germany
[4] Univ Med Ctr Rostock, Dept Cardiol, Ernst Heydemann Str 6, D-18057 Rostock, Germany
来源
EUROPACE | 2023年 / 26卷 / 01期
关键词
Artificial intelligence; Atrial fibrillation; Cardiac implantable devices; Cardiac electrophysiology; Patient education; Digital health; WEBSITE;
D O I
10.1093/europace/euad369
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Natural language processing chatbots (NLPC) can be used to gather information for medical content. However, these tools contain a potential risk of misinformation. This study aims to evaluate different aspects of responses given by different NLPCs on questions about atrial fibrillation (AF) and clinical implantable electronic devices (CIED). Methods and results Questions were entered into three different NLPC interfaces. Responses were evaluated with regard to appropriateness, comprehensibility, appearance of confabulation, absence of relevant content, and recommendations given for clinically relevant decisions. Moreover, readability was assessed by calculating word count and Flesch Reading Ease score. 52, 60, and 84% of responses on AF and 16, 72, and 88% on CIEDs were evaluated to be appropriate for all responses given by Google Bard, (GB) Bing Chat (BC) and ChatGPT Plus (CGP), respectively. Assessment of comprehensibility showed that 96, 88, and 92% of responses on AF and 92 and 88%, and 100% on CIEDs were comprehensible for all responses created by GB, BC, and CGP, respectively. Readability varied between different NLPCs. Relevant aspects were missing in 52% (GB), 60% (BC), and 24% (CGP) for AF, and in 92% (GB), 88% (BC), and 52% (CGP) for CIEDs. Conclusion Responses generated by an NLPC are mostly easy to understand with varying readability between the different NLPCs. The appropriateness of responses is limited and varies between different NLPCs. Important aspects are often missed to be mentioned. Thus, chatbots should be used with caution to gather medical information about cardiac arrhythmias and devices.
引用
收藏
页数:10
相关论文
共 24 条
  • [1] Artificial Hallucinations in ChatGPT: Implications in Scientific Writing
    Alkaissi, Hussam
    McFarlane, Samy I.
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (02)
  • [2] Evaluating Recommendations About Atrial Fibrillation for Patients and Clinicians Obtained From Chat-Based Artificial Intelligence Algorithms
    Azizi, Zahra
    Alipour, Pouria
    Gomez, Sofia
    Broadwin, Cassandra
    Islam, Sumaiya
    Sarraju, Ashish
    Rogers, A. J.
    Sandhu, Alexander T.
    Rodriguez, Fatima
    [J]. CIRCULATION-ARRHYTHMIA AND ELECTROPHYSIOLOGY, 2023, 16 (07) : 415 - 417
  • [3] The power of visuals: taking patient education to the next level
    Barendse, Rogier
    Bruining, Nico
    [J]. EUROPACE, 2023, 25 (02): : 258 - 259
  • [4] The 'afibmatters.org' educational website for patients with atrial fibrillation from the European Heart Rhythm Association
    Duncker, David
    Svennberg, Emma
    Deharo, Jean-Claude
    Costa, Francisco Moscoso
    Kommata, Varvara
    [J]. EUROPACE, 2021, 23 (11): : 1693 - 1697
  • [5] Big hype about ChapGPT in medicine: Is it something for rhythmologists? What must be taken into consideration?
    Haverkamp W.
    Strodthoff N.
    Tennenbaum J.
    Israel C.
    [J]. Herzschrittmachertherapie + Elektrophysiologie, 2023, 34 (3) : 240 - 245
  • [6] 360° Virtual reality to improve patient education and reduce anxiety towards atrial fibrillation ablation
    Hermans, Astrid N. L.
    Betz, Konstanze
    Verhaert, Dominique V. M.
    den Uijl, Dennis W.
    Clerx, Kristof
    Debie, Luuk
    Lahaije, Marion
    Vernooy, Kevin
    Linz, Dominik
    Weijs, Bob
    [J]. EUROPACE, 2023, 25 (03): : 855 - 862
  • [7] The 'myrhythmdevice.org' educational website for patients with implanted cardiac devices from the European Heart Rhythm Association
    Kommata, Varvara
    Deharo, Jean-Claude
    Drossart, Inga
    Foldager, Dan
    Svennberg, Emma
    Vernooy, Kevin
    Verstrael, Axel
    Duncker, David
    [J]. EUROPACE, 2022, 24 (11): : 1713 - 1715
  • [8] Marchandot B., 2023, Eur Hear J Open, V3, po
  • [9] Evaluation of an Artificial Intelligence Chatbot for Delivery of IR Patient Education Material: A Comparison with Societal Website Content
    McCarthy, Colin J.
    Berkowitz, Seth
    Ramalingam, Vijay
    Ahmed, Muneeb
    [J]. JOURNAL OF VASCULAR AND INTERVENTIONAL RADIOLOGY, 2023, 34 (10) : 1760 - +
  • [10] The imperative for regulatory oversight of large language models (or generative AI) in healthcare
    Mesko, Bertalan
    Topol, Eric J. J.
    [J]. NPJ DIGITAL MEDICINE, 2023, 6 (01)