TranScreen: Transfer Learning on Graph-Based Anti-Cancer Virtual Screening Model

被引:6
作者
Salem, Milad [1 ]
Khormali, Aminollah [1 ]
Arshadi, Arash Keshavarzi [2 ]
Webb, Julia [2 ]
Yuan, Jiann-Shiun [1 ]
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32816 USA
[2] Univ Cent Florida, Burnett Sch Biomed Sci, Orlando, FL 32816 USA
关键词
cancer; drug discovery; machine learning; transfer learning; virtual screening; DRUG DISCOVERY; CANCER; P53;
D O I
10.3390/bdcc4030016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning's automatic feature extraction has proven its superior performance over traditional fingerprint-based features in the implementation of virtual screening models. However, these models face multiple challenges in the field of early drug discovery, such as over-training and generalization to unseen data, due to the inherently unbalanced and small datasets. In this work, the TranScreen pipeline is proposed, which utilizes transfer learning and a collection of weight initializations to overcome these challenges. An amount of 182 graph convolutional neural networks are trained on molecular source datasets and the learned knowledge is transferred to the target task for fine-tuning. The target task of p53-based bioactivity prediction, an important factor for anti-cancer discovery, is chosen to showcase the capability of the pipeline. Having trained a collection of source models, three different approaches are implemented to compare and rank them for a given task before fine-tuning. The results show improvement in performance of the model in multiple cases, with the best model increasing the area under receiver operating curve ROC-AUC from 0.75 to 0.91 and the recall from 0.25 to 1. This improvement is vital for practical virtual screening via lowering the false negatives and demonstrates the potential of transfer learning. The code and pre-trained models are made accessible online.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 58 条
  • [1] Low Data Drug Discovery with One-Shot Learning
    Altae-Tran, Han
    Ramsundar, Bharath
    Pappu, Aneesh S.
    Pande, Vijay
    [J]. ACS CENTRAL SCIENCE, 2017, 3 (04) : 283 - 293
  • [2] Bray Freddie, 2018, CA Cancer J Clin, V68, P394, DOI [10.3322/caac.21609, 10.3322/caac.21492]
  • [3] DeepMalaria: Artificial Intelligence Driven Discovery of Potent Antiplasmodials
    Arshadi, Arash Keshavarzi
    Salem, Milad
    Collins, Jennifer
    Yuan, Jiann Shiun
    Chakrabarti, Debopam
    [J]. FRONTIERS IN PHARMACOLOGY, 2020, 10
  • [4] Stress signals utilize multiple pathways to stabilize p53
    Ashcroft, M
    Taya, Y
    Vousden, KH
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (09) : 3224 - 3233
  • [5] Why are there hotspot mutations in the TP53 gene in human cancers?
    Baugh, Evan H.
    Ke, Hua
    Levine, Arnold J.
    Bonneau, Richard A.
    Chan, Chang S.
    [J]. CELL DEATH AND DIFFERENTIATION, 2018, 25 (01) : 154 - 160
  • [6] Ben-David Shai, 2006, PROC ADV NEURAL INF
  • [7] Bjerrum EJ., 2017, ARXIV PREPRINT ARXIV
  • [8] Boumi S., 2020, ARXIV200310874
  • [9] Opinion - Migrating cancer stem cells - an integrated concept of malignant tumour progression
    Brabletz, T
    Jung, A
    Spaderna, S
    Hlubek, F
    Kirchner, T
    [J]. NATURE REVIEWS CANCER, 2005, 5 (09) : 744 - 749
  • [10] High throughput screening in drug discovery
    Carnero A.
    [J]. Clinical and Translational Oncology, 2006, 8 (7) : 482 - 490