Data set terminology of deep learning in medicine: a historical review and recommendation

被引:6
|
作者
Walston, Shannon L. [1 ]
Seki, Hiroshi [1 ]
Takita, Hirotaka [1 ]
Mitsuyama, Yasuhito [1 ]
Sato, Shingo [2 ]
Hagiwara, Akifumi [3 ]
Ito, Rintaro [4 ]
Hanaoka, Shouhei [5 ]
Miki, Yukio [1 ]
Ueda, Daiju [1 ,6 ,7 ]
机构
[1] Osaka Metropolitan Univ, Grad Sch Med, Dept Diagnost & Intervent Radiol, Osaka, Japan
[2] Thomas Jefferson Univ, Sidney Kimmel Canc Ctr, Philadelphia, PA USA
[3] Juntendo Univ, Sch Med, Dept Radiol, Tokyo, Japan
[4] Nagoya Univ, Dept Radiol, Nagoya, Japan
[5] Univ Tokyo Hosp, Dept Radiol, Tokyo, Japan
[6] Osaka Metropolitan Univ, Grad Sch Med, Dept Artificial Intelligence, Osaka, Japan
[7] Osaka Metropolitan Univ, Ctr Hlth Sci Innovat, Osaka, Japan
关键词
Terminology; Artificial intelligence; Deep learning; Data partition; Data splitting; ARTIFICIAL-INTELLIGENCE; VALIDATION; MODEL; PROGNOSIS; TOOL;
D O I
10.1007/s11604-024-01608-1
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Medicine and deep learning-based artificial intelligence (AI) engineering represent two distinct fields each with decades of published history. The current rapid convergence of deep learning and medicine has led to significant advancements, yet it has also introduced ambiguity regarding data set terms common to both fields, potentially leading to miscommunication and methodological discrepancies. This narrative review aims to give historical context for these terms, accentuate the importance of clarity when these terms are used in medical deep learning contexts, and offer solutions to mitigate misunderstandings by readers from either field. Through an examination of historical documents, including articles, writing guidelines, and textbooks, this review traces the divergent evolution of terms for data sets and their impact. Initially, the discordant interpretations of the word 'validation' in medical and AI contexts are explored. We then show that in the medical field as well, terms traditionally used in the deep learning domain are becoming more common, with the data for creating models referred to as the 'training set', the data for tuning of parameters referred to as the 'validation (or tuning) set', and the data for the evaluation of models as the 'test set'. Additionally, the test sets used for model evaluation are classified into internal (random splitting, cross-validation, and leave-one-out) sets and external (temporal and geographic) sets. This review then identifies often misunderstood terms and proposes pragmatic solutions to mitigate terminological confusion in the field of deep learning in medicine. We support the accurate and standardized description of these data sets and the explicit definition of data set splitting terminologies in each publication. These are crucial methods for demonstrating the robustness and generalizability of deep learning applications in medicine. This review aspires to enhance the precision of communication, thereby fostering more effective and transparent research methodologies in this interdisciplinary field.
引用
收藏
页码:1100 / 1109
页数:10
相关论文
共 50 条
  • [11] Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review
    Brendel, Matthew
    Su, Chang
    Bai, Zilong
    Zhang, Hao
    Elemento, Olivier
    Wang, Fei
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2022, 20 (05) : 814 - 835
  • [12] Towards computational solutions for precision medicine based big data healthcare system using deep learning models: A review
    Thirunavukarasu, Ramkumar
    Doss, C. George Priya
    Gnanasambandan, R.
    Gopikrishnan, Mohanraj
    Palanisamy, Venketesh
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [13] A survey on deep learning in medicine: Why, how and when?
    Piccialli, Francesco
    Di Somma, Vittorio
    Giampaolo, Fabio
    Cuomo, Salvatore
    Fortino, Giancarlo
    INFORMATION FUSION, 2021, 66 : 111 - 137
  • [14] Deep Learning and Neurology: A Systematic Review
    Valliani, Aly Al-Amyn
    Ranti, Daniel
    Oermann, Eric Karl
    NEUROLOGY AND THERAPY, 2019, 8 (02) : 351 - 365
  • [15] An Introductory Review of Deep Learning for Prediction Models With Big Data
    Emmert-Streib, Frank
    Yang, Zhen
    Feng, Han
    Tripathi, Shailesh
    Dehmer, Matthias
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2020, 3
  • [16] Deep Learning for Diabetes: A Systematic Review
    Zhu, Taiyu
    Li, Kezhi
    Herrero, Pau
    Georgiou, Pantelis
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (07) : 2744 - 2757
  • [17] A Comprehensive Review on Deep Learning-Based Data Fusion
    Hussain, Mazhar
    O'Nils, Mattias
    Lundgren, Jan
    Mousavirad, Seyed Jalaleddin
    IEEE ACCESS, 2024, 12 : 180093 - 180124
  • [18] Big data and deep learning in preventive and rehabilitation medicine
    Jaeger, M.
    Mayer, C.
    Hefter, H.
    Siebler, M.
    Kecskemethy, A.
    ORTHOPADE, 2018, 47 (10): : 826 - 833
  • [19] Review of Deep Learning-Based Personalized Learning Recommendation
    Zhong, Ling
    Wei, Yantao
    Yao, Huang
    Deng, Wei
    Wang, Zhifeng
    Tong, Mingwen
    2020 11TH INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT, AND E-LEARNING (IC4E 2020), 2020, : 145 - 149
  • [20] Rise of Deep Learning for Genomic, Proteomic, and Metabolomic Data Integration in Precision Medicine
    Grapov, Dmitry
    Fahrmann, Johannes
    Wanichthanarak, Kwanjeera
    Khoomrung, Sakda
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2018, 22 (10) : 630 - 636