Clinical data mining: challenges, opportunities, and recommendations for translational applications

被引:2
作者
Qiao, Huimin [1 ]
Chen, Yijing [2 ]
Qian, Changshun [3 ]
Guo, You [1 ,2 ,3 ,4 ]
机构
[1] Gannan Med Univ, Affiliated Hosp 1, Med Big Data & Bioinformat Res Ctr, Ganzhou, Peoples R China
[2] Gannan Med Univ, Sch Publ Hlth & Hlth Management, Ganzhou, Peoples R China
[3] Jiangxi Univ Sci & Technol, Sch Informat Engn, Ganzhou, Peoples R China
[4] Ganzhou Key Lab Med Big Data, Ganzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Clinical data mining; Transformative application; Heterogeneity; Analytic workflow; Predictive model; FALSE DISCOVERY RATE; MENDELIAN RANDOMIZATION; STATISTICAL CONSIDERATIONS; SUBGROUP ANALYSIS; LUNG-CANCER; BIG DATA; TRIALS; SCORE; RISK; ASSOCIATION;
D O I
10.1186/s12967-024-05005-0
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Clinical data mining of predictive models offers significant advantages for re-evaluating and leveraging large amounts of complex clinical real-world data and experimental comparison data for tasks such as risk stratification, diagnosis, classification, and survival prediction. However, its translational application is still limited. One challenge is that the proposed clinical requirements and data mining are not synchronized. Additionally, the exotic predictions of data mining are difficult to apply directly in local medical institutions. Hence, it is necessary to incisively review the translational application of clinical data mining, providing an analytical workflow for developing and validating prediction models to ensure the scientific validity of analytic workflows in response to clinical questions. This review systematically revisits the purpose, process, and principles of clinical data mining and discusses the key causes contributing to the detachment from practice and the misuse of model verification in developing predictive models for research. Based on this, we propose a niche-targeting framework of four principles: Clinical Contextual, Subgroup-Oriented, Confounder- and False Positive-Controlled (CSCF), to provide guidance for clinical data mining prior to the model's development in clinical settings. Eventually, it is hoped that this review can help guide future research and develop personalized predictive models to achieve the goal of discovering subgroups with varied remedial benefits or risks and ensuring that precision medicine can deliver its full potential.
引用
收藏
页数:17
相关论文
共 175 条
  • [1] Tutorial on statistical considerations on subgroup analysis in confirmatory clinical trials
    Alosh, Mohamed
    Huque, Mohammad F.
    Bretz, Frank
    D'Agostino, Ralph B., Sr.
    [J]. STATISTICS IN MEDICINE, 2017, 36 (08) : 1334 - 1360
  • [2] [Anonymous], 2019, Correction to Lancet Oncol, V20, P262
  • [3] Evaluation of machine learning solutions in medicine
    Antoniou, Tony
    Mamdani, Muhammad
    [J]. CANADIAN MEDICAL ASSOCIATION JOURNAL, 2021, 193 (36) : E1425 - E1429
  • [4] A qualitative signature for early diagnosis of hepatocellular carcinoma based on relative expression orderings
    Ao, Lu
    Zhang, Zimei
    Guan, Qingzhou
    Guo, Yating
    Guo, You
    Zhang, Jiahui
    Lv, Xingwei
    Huang, Haiyan
    Zhang, Huarong
    Wang, Xianlong
    Guo, Zheng
    [J]. LIVER INTERNATIONAL, 2018, 38 (10) : 1812 - 1819
  • [5] Methods in Comparative Effectiveness Research
    Armstrong, Katrina
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2012, 30 (34) : 4208 - 4214
  • [6] When to use the Bonferroni correction
    Armstrong, Richard A.
    [J]. OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2014, 34 (05) : 502 - 508
  • [7] FDAApproval Summary: Olaparib Monotherapy or in Combination with Bevacizumab for the Maintenance Treatment of Patients with Advanced Ovarian Cancer
    Arora, Shaily
    Balasubramaniam, Sanjeeve
    Zhang, Hui
    Berman, Tara
    Narayan, Preeti
    Suzman, Daniel
    Bloomquist, Erik
    Tang, Shenghui
    Gong, Yutao
    Sridhara, Rajeshwari
    Turcu, Francisca Reyes
    Chatterjee, Deb
    Saritas-Yildirim, Banu
    Ghosh, Soma
    Philip, Reena
    Pathak, Anand
    Gao, Jennifer J.
    Amiri-Kordestani, Laleh
    Pazdur, Richard
    Beaver, Julia A.
    [J]. ONCOLOGIST, 2021, 26 (01) : E164 - E172
  • [8] Association of Longer Leukocyte Telomere Length With Cardiac Size, Function, and Heart Failure
    Aung, Nay
    Wang, Qingning
    van Duijvenboden, Stefan
    Burns, Richard
    Stoma, Svetlana
    Raisi-Estabragh, Zahra
    Ahmet, Selda
    Allara, Elias
    Wood, Angela
    Di Angelantonio, Emanuele
    Danesh, John
    Munroe, Patricia B.
    Young, Alistair
    Harvey, Nicholas C.
    Codd, Veryan
    Nelson, Christopher P.
    Petersen, Steffen E.
    Samani, Nilesh J.
    [J]. JAMA CARDIOLOGY, 2023, 8 (09) : 808 - 815
  • [9] Instrumental variable methods for causal inference
    Baiocchi, Michael
    Cheng, Jing
    Small, Dylan S.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (13) : 2297 - 2340
  • [10] External validation of a convolutional neural network artificial intelligence tool to predict malignancy in pulmonary nodules
    Baldwin, David R.
    Gustafson, Jennifer
    Pickup, Lyndsey
    Arteta, Carlos
    Novotny, Petr
    Declerck, Jerome
    Kadir, Timor
    Figueiras, Catarina
    Sterba, Albert
    Exell, Alan
    Potesil, Vaclav
    Holland, Paul
    Spence, Hazel
    Clubley, Alison
    O'Dowd, Emma
    Clark, Matthew
    Ashford-Turner, Victoria
    Callister, Matthew E. J.
    Gleeson, Fergus, V
    [J]. THORAX, 2020, 75 (04) : 306 - 312