Streamlining medical software development with CARE lifecycle and CARE agent: an AI-driven technology readiness level assessment tool

被引:0
作者
Hart, Steven N. [1 ]
Day, Patrick L. [1 ]
Garcia, Christopher A. [1 ]
机构
[1] Mayo Clin, Dept Lab Med & Pathol, 200 1st St SW, Rochester, MN 55901 USA
关键词
CARE lifecycle; CARE agent; Technology readiness level (TRL); AI-driven assessment; Self-service tooling; Retrieval-augmented generation (RAG); Local deployment; Data privacy; Stakeholder personas; Automation in technology evaluation; ARTIFICIAL-INTELLIGENCE; CHALLENGES;
D O I
10.1186/s12911-025-03099-0
中图分类号
R-058 [];
学科分类号
摘要
BackgroundDeveloping medical software requires navigating complex regulatory, ethical, and operational challenges. A comprehensive framework that supports both technical maturity and clinical safety is essential for effective artificial intelligence and machine learning system deployment. This paper introduces the Clinical Artificial Intelligence Readiness Evaluator Lifecycle and the Clinical Artificial Intelligence Readiness Evaluator Agent-a framework and AI-driven tool designed to streamline technology readiness level assessments in medical software development.MethodsWe developed the framework using an iterative process grounded in collaborative stakeholder analysis. Key institutional stakeholders-including clinical informatics experts, data engineers, ethicists, and operational leaders-were engaged to identify and prioritize the regulatory, ethical, and technical requirements unique to clinical AI/ML development. This approach, combined with a thorough review of existing methodologies, informed the creation of a lifecycle model that guides technology maturation from initial concept to full deployment. The AI-driven tool was implemented using a retrieval-augmented generation strategy and evaluated through a synthetic use case (the Diabetes Outcome Predictor). Evaluation metrics included the proportion of correctly addressed assessment questions and the overall time required for automated review, with human adjudication validating the tool's performance.ResultsThe findings indicate that the proposed framework effectively captures the complexities of clinical AI development. In the synthetic use case, the AI-driven tool identified that 32.8% of the assessment questions remained unanswered, while human adjudication confirmed discrepancies in 19.4% of these instances. These outcomes suggest that, when fully refined, the automated assessment process can reduce the need for extensive multi-stakeholder involvement, accelerate project timelines, and enhance resource efficiency.ConclusionsThe Clinical Artificial Intelligence Readiness Evaluator Lifecycle and Agent offer a robust and methodologically sound approach for evaluating the maturity of medical AI systems. By integrating stakeholder-driven insights with an AI-based assessment process, this framework lays the groundwork for more streamlined, secure, and effective clinical AI development. Future work will focus on optimizing retrieval strategies and expanding validation across diverse clinical applications.
引用
收藏
页数:10
相关论文
共 25 条
[1]  
[Anonymous], 2024, LangChain, V2022
[2]   Big Data and Machine Learning in Health Care [J].
Beam, Andrew L. ;
Kohane, Isaac S. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13) :1317-1318
[3]   Harnessing the potential of large language models in medical education: promise and pitfalls [J].
Benitez, Trista M. ;
Xu, Yueyuan ;
Boudreau, J. Donald ;
Kow, Alfred Wei Chieh ;
Bello, Fernando ;
Phuoc, Le Van ;
Wang, Xiaofei ;
Sun, Xiaodong ;
Leung, Gilberto Ka-Kit ;
Lan, Yanyan ;
Wang, Yaxing ;
Cheng, Davy ;
Tham, Yih-Chung ;
Wong, Tien Yin ;
Chung, Kevin C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (03) :776-783
[4]   The now and future of ChatGPT and GPT in psychiatry [J].
Cheng, Szu-Wei ;
Chang, Chung-Wen ;
Chang, Wan-Jung ;
Wang, Hao-Wei ;
Liang, Chih-Sung ;
Kishimoto, Taishiro ;
Chang, Jane Pei-Chen ;
Kuo, John S. ;
Su, Kuan-Pin .
PSYCHIATRY AND CLINICAL NEUROSCIENCES, 2023, 77 (11) :592-596
[5]   Artificial intelligence for decision making in the era of Big Data - evolution, challenges and research agenda [J].
Duan, Yanqing ;
Edwards, John S. ;
Dwivedi, Yogesh K. .
INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 48 :63-71
[6]  
Health CfD. & Radiological, 2024, Blog: A lifecycle management approach toward delivering safe, effective AI-enabled health care
[7]   Development and preliminary testing of Health Equity Across the AI Lifecycle (HEAAL): A framework for healthcare delivery organizations to mitigate the risk of AI solutions worsening health inequities [J].
Kim, Jee Young ;
Hasan, Alifia ;
Kellogg, Katherine C. ;
Ratliff, William ;
Murray, Sara G. ;
Suresh, Harini ;
Valladares, Alexandra ;
Shaw, Keo ;
Tobey, Danny ;
Vidal, David E. ;
Lifson, Mark A. ;
Patel, Manesh ;
Raji, Inioluwa Deborah ;
Gao, Michael ;
Knechtle, William ;
Tang, Linda ;
Balu, Suresh ;
Sendak, Mark P. .
PLOS DIGITAL HEALTH, 2024, 3 (05)
[8]   Technology readiness levels for machine learning systems [J].
Lavin, Alexander ;
Gilligan-Lee, Ciaran M. ;
Visnjic, Alessya ;
Ganju, Siddha ;
Newman, Dava ;
Ganguly, Sujoy ;
Lange, Danny ;
Baydin, Atilim Gunes ;
Sharma, Amit ;
Gibson, Adam ;
Zheng, Stephan ;
Xing, Eric P. ;
Mattmann, Chris ;
Parr, James ;
Gal, Yarin .
NATURE COMMUNICATIONS, 2022, 13 (01)
[9]   Technology readiness assessments: A retrospective [J].
Mankins, John C. .
ACTA ASTRONAUTICA, 2009, 65 (9-10) :1216-1223
[10]   The imperative for regulatory oversight of large language models (or generative AI) in healthcare [J].
Mesko, Bertalan ;
Topol, Eric J. J. .
NPJ DIGITAL MEDICINE, 2023, 6 (01)