Detecting irregularities in randomized controlled trials using machine learning

被引:0
作者
Nelson, Walter [1 ]
Petch, Jeremy [1 ,2 ,3 ]
Ranisau, Jonathan
Zhao, Robin
Balasubramanian, Kumar
Bangdiwala, Shrikant, I [2 ]
机构
[1] Hamilton Hlth Sci, Ctr Data Sci & Digital Hlth, Hamilton, ON, Canada
[2] Populat Hlth Res Inst, 20 Copeland Ave, Hamilton L8L 2X2, ON, Canada
[3] McMaster Univ, Dept Med, Hamilton, ON, Canada
关键词
Central statistical monitoring; machine learning; artificial intelligence; outlier detection; quality assurance; data quality; randomized controlled trials; DABIGATRAN;
D O I
10.1177/17407745241297947
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background: Over the course of a clinical trial, irregularities may arise in the data. Trialists implement human-intensive, expensive central statistical monitoring procedures to identify and correct these irregularities before the results of the trial are analyzed and disseminated. Machine learning algorithms have shown promise for identifying center-level irregularities in multi-center clinical trials with minimal human intervention. We aimed to characterize the form-level data irregularities in several historical clinical trials and evaluate the ability of a machine learning-based outlier detection algorithm to identify them.Methods: Data irregularities previously identified by humans in historical clinical trials were ascertained by comparing preliminary snapshots of the trial databases to the final, locked databases. We measured the ability of a machine learning based outlier detection algorithm to identify form-level irregularities using concordance (area under the receiver operator characteristic), positive predictive value (precision), and sensitivity (recall).Results: We examined preliminary snapshots of seven historical clinical trials which randomized a total of 77,001 participants. We extracted a total of 1,267,484 completed entries from 358 case report forms containing irregularities from all snapshots across all trials, containing a total of 24,850 form-wide irregularities (median per-form form-level irregularity rate: 1.81%). Our proposed machine learning algorithm detects form-level irregularities with a median concordance of 0.74 (interquartile range = 0.57-0.89), slightly exceeding the performance of a previously proposed machine learning approach with a median area under the receiver operator characteristic of 0.73 (interquartile range = 0.54-0.88).Conclusion: Data irregularities in historical clinical trials were ascertained by comparing preliminary snapshots of the trial database to the final database. These irregularities can be categorized according to their scope. Irregularities can be successfully detected by a machine learning algorithm as early or earlier than a human can, without human intervention. Such an approach may complement existing techniques for central statistical monitoring in large multi-center randomized controlled trials and possibly improve the efficiency of costly data verification processes.
引用
收藏
页码:178 / 187
页数:10
相关论文
共 50 条
  • [31] Detecting IoT Attacks using Multi-Layer Data Through Machine Learning
    Alam, Hina
    Yaqub, Muhammad Shaharyar
    Nadir, Ibrahim
    2022 SECOND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND HIGH PERFORMANCE COMPUTING (DCHPC), 2022, : 52 - 59
  • [32] Research on financial irregularities identification: a machine learning perspective
    Chai K.-C.
    Zhu J.
    Yang Y.
    Lan H.-R.
    Ou Y.-L.
    Li Q.
    International Journal of Internet Manufacturing and Services, 2022, 8 (04): : 383 - 399
  • [33] Detecting Crown Rot Disease in Wheat in Controlled Environment Conditions Using Digital Color Imaging and Machine Learning
    Xie, Yiting
    Plett, Darren
    Liu, Huajian
    AGRIENGINEERING, 2022, 4 (01): : 141 - 155
  • [34] Detecting insurance fraud using supervised and unsupervised machine learning
    Debener, Joern
    Heinke, Volker
    Kriebel, Johannes
    JOURNAL OF RISK AND INSURANCE, 2023, 90 (03) : 743 - 768
  • [35] Machine Learning for Detecting Blood Transfusion Needs Using Biosignals
    Ko H.
    Park C.
    Kang W.S.
    Nam Y.
    Yoon D.
    Lee J.
    Computer Systems Science and Engineering, 2023, 46 (02): : 2369 - 2381
  • [36] Detecting and Preventing ROP Attacks using Machine Learning on ARM
    Welearegai, Gebrehiwet B.
    Hu, Chenpo
    Hammer, Christian
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 667 - 677
  • [37] Detecting BGP Routing Anomalies Using Machine Learning: A Review
    Muosa, Ali Hassan
    Ali, A. H.
    FORTHCOMING NETWORKS AND SUSTAINABILITY IN THE AIOT ERA, VOL 1, FONES-AIOT 2024, 2024, 1035 : 145 - 164
  • [38] Detecting Architectural Integrity Violation Patterns Using Machine Learning
    Zakurdaeva, Alla
    Weiss, Michael
    Muegge, Steven
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1480 - 1487
  • [39] Detecting Arsenic Contamination Using Satellite Imagery and Machine Learning
    Agrawal, Ayush
    Petersen, Mark R.
    TOXICS, 2021, 9 (12)
  • [40] Detecting Object Open Angle and Direction Using Machine Learning
    Zhang, Yiwen
    Ren, Yongxiong
    Xie, Guodong
    Wang, Zhi
    Zhang, Hao
    Xu, Tianxu
    Xu, Haoyuan
    Huang, Hao
    Bao, Changjing
    Pan, Zhongqi
    Yue, Yang
    IEEE ACCESS, 2020, 8 : 12300 - 12306