Clinical data mining: challenges, opportunities, and recommendations for translational applications

被引:2
作者
Qiao, Huimin [1 ]
Chen, Yijing [2 ]
Qian, Changshun [3 ]
Guo, You [1 ,2 ,3 ,4 ]
机构
[1] Gannan Med Univ, Affiliated Hosp 1, Med Big Data & Bioinformat Res Ctr, Ganzhou, Peoples R China
[2] Gannan Med Univ, Sch Publ Hlth & Hlth Management, Ganzhou, Peoples R China
[3] Jiangxi Univ Sci & Technol, Sch Informat Engn, Ganzhou, Peoples R China
[4] Ganzhou Key Lab Med Big Data, Ganzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Clinical data mining; Transformative application; Heterogeneity; Analytic workflow; Predictive model; FALSE DISCOVERY RATE; MENDELIAN RANDOMIZATION; STATISTICAL CONSIDERATIONS; SUBGROUP ANALYSIS; LUNG-CANCER; BIG DATA; TRIALS; SCORE; RISK; ASSOCIATION;
D O I
10.1186/s12967-024-05005-0
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Clinical data mining of predictive models offers significant advantages for re-evaluating and leveraging large amounts of complex clinical real-world data and experimental comparison data for tasks such as risk stratification, diagnosis, classification, and survival prediction. However, its translational application is still limited. One challenge is that the proposed clinical requirements and data mining are not synchronized. Additionally, the exotic predictions of data mining are difficult to apply directly in local medical institutions. Hence, it is necessary to incisively review the translational application of clinical data mining, providing an analytical workflow for developing and validating prediction models to ensure the scientific validity of analytic workflows in response to clinical questions. This review systematically revisits the purpose, process, and principles of clinical data mining and discusses the key causes contributing to the detachment from practice and the misuse of model verification in developing predictive models for research. Based on this, we propose a niche-targeting framework of four principles: Clinical Contextual, Subgroup-Oriented, Confounder- and False Positive-Controlled (CSCF), to provide guidance for clinical data mining prior to the model's development in clinical settings. Eventually, it is hoped that this review can help guide future research and develop personalized predictive models to achieve the goal of discovering subgroups with varied remedial benefits or risks and ensuring that precision medicine can deliver its full potential.
引用
收藏
页数:17
相关论文
共 175 条
  • [31] Role of Artificial Intelligence in Patient Safety Outcomes: Systematic Literature Review
    Choudhury, Avishek
    Asan, Onur
    [J]. JMIR MEDICAL INFORMATICS, 2020, 8 (07)
  • [32] Problems in the deployment of machine-learned models in health care
    Cohen, Joseph Paul
    Cao, Tianshi
    Viviano, Joseph D.
    Huang, Chin-Wei
    Fralick, Michael
    Ghassemi, Marzyeh
    Mamdani, Muhammad
    Greiner, Russell
    Bengio, Yoshua
    [J]. CANADIAN MEDICAL ASSOCIATION JOURNAL, 2021, 193 (35) : E1391 - E1394
  • [33] Reporting of artificial intelligence prediction models
    Collins, Gary S.
    Moons, Karel G. M.
    [J]. LANCET, 2019, 393 (10181) : 1577 - 1579
  • [34] Artificial intelligence in cancer research and precision medicine: Applications, limitations and priorities to drive transformation in the delivery of equitable and unbiased care
    Corti, Chiara
    Cobanaj, Marisa
    Dee, Edward C.
    Criscitiello, Carmen
    Tolaney, Sara M.
    Celi, Leo A.
    Curigliano, Giuseppe
    [J]. CANCER TREATMENT REVIEWS, 2023, 112
  • [35] Do observational studies using propensity score methods agree with randomized trials? A systematic comparison of studies on acute coronary syndromes
    Dahabreh, Issa J.
    Sheldrick, Radley C.
    Paulus, Jessica K.
    Chung, Mei
    Varvarigou, Vasileia
    Jafri, Haseeb
    Rassen, Jeremy A.
    Trikalinos, Thomas A.
    Kitsios, Georgios D.
    [J]. EUROPEAN HEART JOURNAL, 2012, 33 (15) : 1893 - 1901
  • [36] Perspectives on validation of clinical predictive algorithms
    de Hond, Anne A. H.
    Shah, Vaibhavi B. B.
    Kant, Ilse M. J.
    Van Calster, Ben
    Steyerberg, Ewout W.
    Hernandez-Boussard, Tina
    [J]. NPJ DIGITAL MEDICINE, 2023, 6 (01)
  • [37] Investigating Subgroup Effects in Randomized Clinical Trials
    Dehbi, Hakim-Moulay
    Hackshaw, Allan
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2017, 35 (02) : 253 - +
  • [38] Mendelian randomization as an instrumental variable approach to causal inference
    Didelez, Vanessa
    Sheehan, Nuala
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (04) : 309 - 330
  • [39] Reproducibility, relevance and reliability as barriers to efficient and credible biomedical technology translation
    Dirnagl, Ulrich
    Duda, Georg N.
    Grainger, David W.
    Reinke, Petra
    Roubenoff, Ronenn
    [J]. ADVANCED DRUG DELIVERY REVIEWS, 2022, 182
  • [40] General guidance on exploratory and confirmatory subgroup analysis in late-stage clinical trials
    Dmitrienko, Alex
    Muysers, Christoph
    Fritsch, Arno
    Lipkovich, Ilya
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2016, 26 (01) : 71 - 98