Guidelines and standard frameworks for artificial intelligence in medicine: a systematic review

被引:2
作者
Shiferaw, Kirubel Biruk [1 ]
Roloff, Moritz [1 ]
Balaur, Irina [2 ]
Welter, Danielle [3 ]
Waltemath, Dagmar [1 ]
Zeleke, Atinkut Alamirrew [1 ]
机构
[1] Univ Med Greifswald, Dept Med Informat, Inst Community Med, Walther Rathenau Str 48, D-17475 Greifswald, Germany
[2] Univ Luxembourg, Luxembourg Ctr Syst Biomed, L-4367 Belvaux, Luxembourg
[3] Luxembourg Natl Data Serv, L-4362 Esch Sur Alzette, Luxembourg
关键词
digital medicine; artificial intelligence; machine learning; guidelines; quality; framework; AGREE II; medicine; standard; systematic review; medical informatics; IMPLEMENTATION;
D O I
10.1093/jamiaopen/ooae155
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives The continuous integration of artificial intelligence (AI) into clinical settings requires the development of up-to-date and robust guidelines and standard frameworks that consider the evolving challenges of AI implementation in medicine. This review evaluates the quality of these guideline and summarizes ethical frameworks, best practices, and recommendations.Materials and Methods The Appraisal of Guidelines, Research, and Evaluation II tool was used to assess the quality of guidelines based on 6 domains: scope and purpose, stakeholder involvement, rigor of development, clarity of presentation, applicability, and editorial independence. The protocol of this review including the eligibility criteria, the search strategy data extraction sheet and methods, was published prior to the actual review with International Registered Report Identifier of DERR1-10.2196/47105.Results The initial search resulted in 4975 studies from 2 databases and 7 studies from manual search. Eleven articles were selected for data extraction based on the eligibility criteria. We found that while guidelines generally excel in scope, purpose, and editorial independence, there is significant variability in applicability and the rigor of guideline development. Well-established initiatives such as TRIPOD+AI, DECIDE-AI, SPIRIT-AI, and CONSORT-AI have shown high quality, particularly in terms of stakeholder involvement. However, applicability remains a prominent challenge among the guidelines. The result also showed that the reproducibility, ethical, and environmental aspects of AI in medicine still need attention from both medical and AI communities.Discussion Our work highlights the need for working toward the development of integrated and comprehensive reporting guidelines that adhere to the principles of Findability, Accessibility, Interoperability and Reusability. This alignment is essential for fostering a cultural shift toward transparency and open science, which are pivotal milestone for sustainable digital health research.Conclusion This review evaluates the current reporting guidelines, discussing their advantages as well as challenges and limitations. As artificial intelligence (AI) continues to play an increasingly central role in health care, its safe and effective integration requires high-quality reporting guidelines that address the specific challenges of implementing AI in clinical settings. This systematic review evaluates the quality of existing AI reporting guidelines in medicine, focusing on their ethical considerations, best practices, and practical recommendations. Using the Appraisal of Guidelines, Research, and Evaluation II tool, we assessed the quality of the guidelines based on 6 key domains: scope, stakeholder involvement, development rigor, clarity, applicability, and editorial independence. While most guidelines performed well in areas such as scope and stakeholder involvement, significant variability was observed in their applicability and development rigor, reflecting the ongoing challenge of translating AI research into real-world health care settings. Additionally, the review highlighted the need for greater attention to reproducibility, ethics, and environmental impacts in medical AI research. Furthermore, the study underscores the importance of aligning guidelines with the Findable, Accessible, Interoperable, and Reusable principles to foster transparency, open science, and collaboration, which are crucial for advancing sustainable digital health research.
引用
收藏
页数:10
相关论文
共 58 条
  • [1] Baker M, 2016, NATURE, V533, P452, DOI 10.1038/533452a
  • [2] Introducing the FAIR Principles for research software
    Barker, Michelle
    Hong, Neil P. Chue
    Katz, Daniel S.
    Lamprecht, Anna-Lena
    Martinez-Ortiz, Carlos
    Psomopoulos, Fotis
    Harrow, Jennifer
    Castro, Leyla Jael
    Gruenpeter, Morane
    Martinez, Paula Andrea
    Honeyman, Tom
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [3] Reproducibility of prediction models in health services research
    Belbasis, Lazaros
    Panagiotou, Orestis A.
    [J]. BMC RESEARCH NOTES, 2022, 15 (01)
  • [4] Bossuyt PM, 2015, BMJ-BRIT MED J, V351, DOI [10.1136/bmj.h5527, 10.1148/radiol.2015151516, 10.1373/clinchem.2015.246280]
  • [5] AGREE II: advancing guideline development, reporting and evaluation in health care
    Brouwers, Melissa C.
    Kho, Michelle E.
    Browman, George P.
    Burgers, Jako S.
    Cluzeau, Francoise
    Feder, Gene
    Fervers, Beatrice
    Graham, Ian D.
    Grimshaw, Jeremy
    Hanna, Steven E.
    Littlejohns, Peter
    Makarski, Julie
    Zitzelsberger, Louise
    [J]. CANADIAN MEDICAL ASSOCIATION JOURNAL, 2010, 182 (18) : E839 - E842
  • [6] Celi LA, 2019, PLOS ONE, V14, DOI 10.1371/journal.pone.0210232
  • [7] TRIPOD plus AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods
    Collins, Gary S.
    Moons, Karel G. M.
    Dhiman, Paula
    Riley, Richard
    Beam, Andrew L.
    Van Calster, Ben
    Ghassemi, Marzyeh
    Liu, Xiaoxuan
    Reitsma, Johannes B.
    van Smeden, Maarten
    Boulesteix, Anne-Laure
    Camaradou, Jennifer Catherine
    Celi, Leo Anthony
    Denaxas, Spiros
    Denniston, Alastair K.
    Glocker, Ben
    Golub, Robert M.
    Harvey, Hugh
    Heinze, Georg
    Hoffman, Michael M.
    Kengne, Andre Pascal
    Lam, Emily
    Lee, Naomi
    Loder, Elizabeth W.
    Maier-Hein, Lena
    Mateen, Bilal A.
    McCradden, Melissa
    Oakden-Rayner, Lauren
    Ordish, Johan
    Parnell, Richard
    Rose, Sherri
    Singh, Karandeep
    Wynants, Laure
    Logullo, Patricia
    [J]. BMJ-BRITISH MEDICAL JOURNAL, 2024, 385
  • [8] Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) The TRIPOD Statement
    Collins, Gary S.
    Reitsma, Johannes B.
    Altman, Douglas G.
    Moons, Karel G. M.
    [J]. CIRCULATION, 2015, 131 (02) : 211 - 219
  • [9] Guidelines for Artificial Intelligence in Medicine: Literature Review and Content Analysis of Frameworks
    Crossnohere, Norah L.
    Elsaid, Mohamed
    Paskett, Jonathan
    Bose-Brill, Seuli
    Bridges, John F. P.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (08)
  • [10] Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review
    de Hond, Anne A. H.
    Leeuwenberg, Artuur M.
    Hooft, Lotty
    Kant, Ilse M. J.
    Nijman, Steven W. J.
    van Os, Hendrikus J. A.
    Aardoom, Jiska J.
    Debray, Thomas P. A.
    Schuit, Ewoud
    van Smeden, Maarten
    Reitsma, Johannes B.
    Steyerberg, Ewout W.
    Chavannes, Niels H.
    Moons, Karel G. M.
    [J]. NPJ DIGITAL MEDICINE, 2022, 5 (01)