On the robustness of generalization of drug-drug interaction models

被引:3
作者
Kpanou, Rogia [1 ,2 ]
Osseni, Mazid Abiodoun [1 ]
Tossou, Prudencio [1 ,2 ]
Laviolette, Francois [1 ]
Corbeil, Jacques [3 ]
机构
[1] Univ Laval, Comp Sci & Software Engn, 1065 Av Med, Quebec City, PQ, Canada
[2] InVivo AI, Mila 180 Corp Lab L,6650,01 Rue St Urba, Montreal, PQ H2S 3G9, Canada
[3] Univ Laval, Dept Mol Med, 1065 Av Med, Quebec City, PQ, Canada
关键词
Drug-drug interaction; Side effects; Deep learning; Robustness; Generalizability; LANGUAGE; NETWORK; SYSTEM;
D O I
10.1186/s12859-021-04398-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Deep learning methods are a proven commodity in many fields and endeavors. One of these endeavors is predicting the presence of adverse drug-drug interactions (DDIs). The models generated can predict, with reasonable accuracy, the phenotypes arising from the drug interactions using their molecular structures. Nevertheless, this task requires improvement to be truly useful. Given the complexity of the predictive task, an extensive benchmarking on structure-based models for DDIs prediction was performed to evaluate their drawbacks and advantages. Results We rigorously tested various structure-based models that predict drug interactions using different splitting strategies to simulate different real-world scenarios. In addition to the effects of different training and testing setups on the robustness and generalizability of the models, we then explore the contribution of traditional approaches such as multitask learning and data augmentation. Conclusion Structure-based models tend to generalize poorly to unseen drugs despite their ability to identify new DDIs among drugs seen during training accurately. Indeed, they efficiently propagate information between known drugs and could be valuable for discovering new DDIs in a database. However, these models will most probably fail when exposed to unknown drugs. While multitask learning does not help in our case to solve the problem, the use of data augmentation does at least mitigate it. Therefore, researchers must be cautious of the bias of the random evaluation scheme, especially if their goal is to discover new DDIs.
引用
收藏
页数:21
相关论文
共 59 条
  • [51] SMILES, A CHEMICAL LANGUAGE AND INFORMATION-SYSTEM .1. INTRODUCTION TO METHODOLOGY AND ENCODING RULES
    WEININGER, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1988, 28 (01): : 31 - 36
  • [52] Xu K.X, 2018, Precambrian Geochronology and Geochemistry of the Tectonic Conjunction of the Qinling‐Qilian Orogen: Evidence from the Longshan Complex
  • [53] SumGNN: multi-typed drug interaction prediction via efficient knowledge graph summarization
    Yu, Yue
    Huang, Kexin
    Zhang, Chao
    Glass, Lucas M.
    Sun, Jimeng
    Xiao, Cao
    [J]. BIOINFORMATICS, 2021, 37 (18) : 2988 - 2995
  • [54] Label Propagation Prediction of Drug-Drug Interactions Based on Clinical Side Effects
    Zhang, Ping
    Wang, Fei
    Hu, Jianying
    Sorrentino, Robert
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [55] Predicting drug-drug interactions using multi-modal deep auto-encoders based network embedding and positive-unlabeled learning
    Zhang, Yang
    Qiu, Yang
    Cui, Yuxin
    Liu, Shichao
    Zhang, Wen
    [J]. METHODS, 2020, 179 : 37 - 46
  • [56] Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data
    Zhao, Xing-Ming
    Iskar, Murat
    Zeller, Georg
    Kuhn, Michael
    van Noort, Vera
    Bork, Peer
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (12)
  • [57] An attention-based effective neural model for drug-drug interactions extraction
    Zheng, Wei
    Lin, Hongfei
    Luo, Ling
    Zhao, Zhehuan
    Li, Zhengguang
    Zhang, Yijia
    Yang, Zhihao
    Wang, Jian
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [58] Modeling polypharmacy side effects with graph convolutional networks
    Zitnik, Marinka
    Agrawal, Monica
    Leskovec, Jure
    [J]. BIOINFORMATICS, 2018, 34 (13) : 457 - 466
  • [59] Zitnik M, 2016, BIOCOMPUT-PAC SYM, P81