Data set terminology of deep learning in medicine: a historical review and recommendation

被引:6
|
作者
Walston, Shannon L. [1 ]
Seki, Hiroshi [1 ]
Takita, Hirotaka [1 ]
Mitsuyama, Yasuhito [1 ]
Sato, Shingo [2 ]
Hagiwara, Akifumi [3 ]
Ito, Rintaro [4 ]
Hanaoka, Shouhei [5 ]
Miki, Yukio [1 ]
Ueda, Daiju [1 ,6 ,7 ]
机构
[1] Osaka Metropolitan Univ, Grad Sch Med, Dept Diagnost & Intervent Radiol, Osaka, Japan
[2] Thomas Jefferson Univ, Sidney Kimmel Canc Ctr, Philadelphia, PA USA
[3] Juntendo Univ, Sch Med, Dept Radiol, Tokyo, Japan
[4] Nagoya Univ, Dept Radiol, Nagoya, Japan
[5] Univ Tokyo Hosp, Dept Radiol, Tokyo, Japan
[6] Osaka Metropolitan Univ, Grad Sch Med, Dept Artificial Intelligence, Osaka, Japan
[7] Osaka Metropolitan Univ, Ctr Hlth Sci Innovat, Osaka, Japan
关键词
Terminology; Artificial intelligence; Deep learning; Data partition; Data splitting; ARTIFICIAL-INTELLIGENCE; VALIDATION; MODEL; PROGNOSIS; TOOL;
D O I
10.1007/s11604-024-01608-1
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Medicine and deep learning-based artificial intelligence (AI) engineering represent two distinct fields each with decades of published history. The current rapid convergence of deep learning and medicine has led to significant advancements, yet it has also introduced ambiguity regarding data set terms common to both fields, potentially leading to miscommunication and methodological discrepancies. This narrative review aims to give historical context for these terms, accentuate the importance of clarity when these terms are used in medical deep learning contexts, and offer solutions to mitigate misunderstandings by readers from either field. Through an examination of historical documents, including articles, writing guidelines, and textbooks, this review traces the divergent evolution of terms for data sets and their impact. Initially, the discordant interpretations of the word 'validation' in medical and AI contexts are explored. We then show that in the medical field as well, terms traditionally used in the deep learning domain are becoming more common, with the data for creating models referred to as the 'training set', the data for tuning of parameters referred to as the 'validation (or tuning) set', and the data for the evaluation of models as the 'test set'. Additionally, the test sets used for model evaluation are classified into internal (random splitting, cross-validation, and leave-one-out) sets and external (temporal and geographic) sets. This review then identifies often misunderstood terms and proposes pragmatic solutions to mitigate terminological confusion in the field of deep learning in medicine. We support the accurate and standardized description of these data sets and the explicit definition of data set splitting terminologies in each publication. These are crucial methods for demonstrating the robustness and generalizability of deep learning applications in medicine. This review aspires to enhance the precision of communication, thereby fostering more effective and transparent research methodologies in this interdisciplinary field.
引用
收藏
页码:1100 / 1109
页数:10
相关论文
共 50 条
  • [21] Machine and deep learning methods for clinical outcome prediction based on physiological data of COVID-19 patients: a scoping review
    Viderman, Dmitriy
    Kotov, Alexander
    Popov, Maxim
    Abdildin, Yerkin
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 182
  • [22] Applications of machine learning and deep learning in musculoskeletal medicine: a narrative review
    Martina Feierabend
    Julius Michael Wolfgart
    Maximilian Praster
    Marina Danalache
    Filippo Migliorini
    Ulf Krister Hofmann
    European Journal of Medical Research, 30 (1)
  • [23] Interactive precision medicine revolution: unleashing a deep learning framework for drug response and recommendation
    Gundavarapu, Mallikarjuna Rao
    Venkata, Raghavender Kotla
    Latha, S. Bhargavi
    Kumar, N. V. Pavan
    Deepa, R. N. Ashlin
    Kotov, Evgeny Vladimirovich
    Nautiyal, Rishi Dev
    Alzubaidi, Laith H.
    COGENT ENGINEERING, 2024, 11 (01):
  • [24] Machine Learning and Deep Learning in Cardiothoracic Imaging: A Scoping Review
    Khosravi, Bardia
    Rouzrokh, Pouria
    Faghani, Shahriar
    Moassefi, Mana
    Vahdati, Sanaz
    Mahmoudi, Elham
    Chalian, Hamid
    Erickson, Bradley J.
    DIAGNOSTICS, 2022, 12 (10)
  • [25] Controllable Data Generation by Deep Learning: A Review
    Wang, Shiyu
    Du, Yuanqi
    Guo, Xiaojie
    Pan, Bo
    Qin, Zhaohui
    Zhao, Liang
    ACM COMPUTING SURVEYS, 2024, 56 (09)
  • [26] Exploring the State of Machine Learning and Deep Learning in Medicine: A Survey of the Italian Research Community
    Bottrighi, Alessio
    Pennisi, Marzio
    INFORMATION, 2023, 14 (09)
  • [27] Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review
    Abayomi-Alli, Olusola O.
    Damasevicius, Robertas
    Qazi, Atika
    Adedoyin-Olowe, Mariam
    Misra, Sanjay
    ELECTRONICS, 2022, 11 (22)
  • [28] Deep learning in multimodal remote sensing data fusion: A comprehensive review
    Li, Jiaxin
    Hong, Danfeng
    Gao, Lianru
    Yao, Jing
    Zheng, Ke
    Zhang, Bing
    Chanussot, Jocelyn
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
  • [29] Deep learning in wastewater treatment: a critical review
    Alvi, Maira
    Batstone, Damien
    Mbamba, Christian Kazadi
    Keymer, Philip
    French, Tim
    Ward, Andrew
    Dwyer, Jason
    Cardell-Oliver, Rachel
    WATER RESEARCH, 2023, 245
  • [30] Recommendation system based on deep learning methods: a systematic review and new directions
    Da'u, Aminu
    Salim, Naomie
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2709 - 2748