Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials

被引:8
|
作者
John, Lijo [1 ,2 ,3 ]
Mahanta, Hridoy Jyoti [1 ,3 ]
Soujanya, Y. [2 ,3 ]
Sastry, G. Narahari [1 ,2 ,3 ]
机构
[1] CSIR North East Inst Sci & Technol, Adv Computat & Data Sci Div, Jorhat 785006, Assam, India
[2] CSIR Indian Inst Chem Technol, Polymers & Funct Mat Div, Hyderabad 500007, India
[3] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
关键词
Machine learning; Clinical trials; Molecular descriptors; Fingerprints; Feature selection; FEATURE-SELECTION; SUCCESS; ALERTS; SYSTEM;
D O I
10.1016/j.compbiomed.2022.106494
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the major challenges in drug development is having acceptable levels of efficacy and safety throughout all the phases of clinical trials followed by the successful launch in the market. While there are many factors such as molecular properties, toxicity parameters, mechanism of action at the target site, etc. that regulates the thera-peutic action of a compound, a holistic approach directed towards data-driven studies will invariably strengthen the predictive toxicological sciences. Our quest for the current study is to find out various reasons as to why an investigational candidate would fail in the clinical trials after multiple iterations of refinement and optimization. We have compiled a dataset that comprises of approved and withdrawn drugs as well as toxic compounds and essentially have used time-split based approach to generate the training and validation set. Five highly robust and scalable machine learning binary classifiers were used to develop the predictive models that were trained with features like molecular descriptors and fingerprints and then validated rigorously to achieve acceptable performance in terms of a set of performance metrics. The mean AUC scores for all the five classifiers with the hold-out test set were obtained in the range of 0.66-0.71. The models were further used to predict the probability score for the clinical candidate dataset. The top compounds predicted to be toxic were analyzed to estimate different dimensions of toxicity. Apparently, through this study, we propose that with the appropriate use of feature extraction and machine learning methods, one can estimate the likelihood of success or failure of investigational drugs candidates thereby opening an avenue for future trends in computational toxicological studies. The models developed in the study can be accessed at https://github.com/gnsastry/predicting_clinical_t rials.git.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Machine Learning for Clinical Trials in the Era of COVID-19
    Zame, William R.
    Bica, Ioana
    Shen, Cong
    Curth, Alicia
    Lee, Hyun-Suk
    Bailey, Stuart
    Weatherall, James
    Wright, David
    Bretz, Frank
    van der Schaar, Mihaela
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2020, 12 (04): : 506 - 517
  • [42] Predicting glioma grades: integrating clinical and molecular data with machine learning and explainable AI
    Tuysuzoglu, Goksu
    Tokmak, Ozge Kart
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2024, 12 (04)
  • [43] Predicting Clinical Outcomes in COVID-19 and Pneumonia Patients: A Machine Learning Approach
    Cai, Kaida
    Wang, Zhengyan
    Yang, Xiaofang
    Fu, Wenzhi
    Zhao, Xin
    VIRUSES-BASEL, 2024, 16 (10):
  • [44] Predicting Employability of Candidates: Comparative Study of Different Machine Learning Models
    Hitharth, K. B. Sai
    Dhanya, N. M.
    PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 179 - 190
  • [45] Comparing Deep Learning and Classical Machine Learning Approaches for Predicting Inpatient Violence Incidents from Clinical Text
    Menger, Vincent
    Scheepers, Floor
    Spruit, Marco
    APPLIED SCIENCES-BASEL, 2018, 8 (06):
  • [46] Leveraging Bioinformatics and Machine Learning for Identifying Prognostic Biomarkers and Predicting Clinical Outcomes in Lung Adenocarcinoma
    Cai, Kaida
    Fu, Wenzhi
    Liu, Hanwen
    Yang, Xiaofang
    Wang, Zhengyan
    Zhao, Xin
    GENES, 2024, 15 (12)
  • [47] Clinical trials: Management of investigational products during the COVID-19 pandemic
    Flet, Laurent
    Tching-Sin, Martine
    Chanat, Adeline
    Gendre, Pauline
    Tibi, Annick
    AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2020, 77 (16) : 1277 - 1278
  • [48] Regulatory Considerations on the use of Machine Learning based tools in Clinical Trials
    Massella, Maurizio
    Dri, Diego Alejandro
    Gramaglia, Donatella
    HEALTH AND TECHNOLOGY, 2022, 12 (06) : 1085 - 1096
  • [49] Machine Learning Approaches for Clinical Psychology and Psychiatry
    Dwyer, Dominic B.
    Falkai, Peter
    Koutsouleris, Nikolaos
    ANNUAL REVIEW OF CLINICAL PSYCHOLOGY, VOL 14, 2018, 14 : 91 - 118
  • [50] How to Solve Clinical Challenges in Mood Disorders; Machine Learning Approaches Using Electrophysiological Markers
    Song, Young Wook
    Lee, Ho Sung
    Kim, Sungkean
    Kim, Kibum
    Kim, Bin-Na
    Kim, Ji Sun
    CLINICAL PSYCHOPHARMACOLOGY AND NEUROSCIENCE, 2024, 22 (03) : 416 - 430