共 25 条
- [1] Testing and Evaluation of Health Care Applications of Large Language Models: A Systematic Review [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2025, 333 (04): : 319 - 328
- [2] Belz A, 2021, Arxiv, DOI [arXiv:2103.07929, 10.48550/arXiv.2103.07929, DOI 10.48550/ARXIV.2103.07929]
- [3] ChatGPT: standard reporting guidelines for responsible use [J]. NATURE, 2023, 618 (7964) : 238 - 238
- [4] Protocol for the development of the Chatbot Assessment Reporting Tool (CHART) for clinical advice [J]. BMJ OPEN, 2024, 14 (05):
- [5] Gallifant J, 2024, PREPRINT, DOI [10.1101/2024.07.24.24310930, DOI 10.1101/2024.07.24.24310930]
- [6] Gilson Aidan, 2023, JMIR Med Educ, V9, pe45312, DOI 10.2196/45312
- [7] Gundersen OE, 2018, AAAI CONF ARTIF INTE, P1644
- [8] Reproducibility standards for machine learning in the life sciences [J]. NATURE METHODS, 2021, 18 (10) : 1132 - 1135
- [10] Hutson M, 2018, SCIENCE, V359, P725, DOI 10.1126/science.359.6377.725