Reducing false positive rate of docking-based virtual screening by active learning

被引:7
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Identification of influenza PA-Nter endonuclease inhibitors using pharmacophore- and docking-based virtual screening
    Ferro, Stefania
    Gitto, Rosaria
    Buemi, Maria Rosa
    Karamanou, Spyridoula
    Stevaert, Annelies
    Naesens, Lieve
    De Luca, Laura
    BIOORGANIC & MEDICINAL CHEMISTRY, 2018, 26 (15) : 4544 - 4550
  • [22] Docking-based virtual screening of TβR1 inhibitors: evaluation of pose prediction and scoring functions
    Shuai Wang
    Jun-Hao Jiang
    Ruo-Yu Li
    Ping Deng
    BMC Chemistry, 14
  • [23] Discovery of novel CDK8 inhibitors using multiple crystal structures in docking-based virtual screening
    Wang, Taijin
    Yang, Zhuang
    Zhang, Yongguang
    Yan, Wei
    Wang, Fang
    He, Linhong
    Zhou, Yuanyuan
    Chen, Lijuan
    EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2017, 129 : 275 - 286
  • [24] Homology Model and Docking-Based Virtual Screening for Ligands of Human Dyskerin as New Inhibitors of Telomerase for Cancer Treatment
    Gabriela Armando, Romina
    Mengual Gomez, Diego Luis
    Ivan Juritz, Ezequiel
    Lorenzano Menna, Pablo
    Eduardo Gomez, Daniel
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
  • [25] Molecular docking-based virtual screening and dynamics simulation study of novel and potential SIRT7 inhibitors
    Guo, Xinli
    Chen, Rui
    Cao, Liping
    CHEMICAL BIOLOGY & DRUG DESIGN, 2023, 102 (04) : 707 - 717
  • [26] Improved Docking-Based Virtual Screening Using the Score Correction Strategy for Specific Endothelial Lipase Inhibitors Identification
    Luo Qi-Yao
    Wang Zi-Yun
    Jin Hong-Wei
    Liu Zhen-Ming
    Zhang Liang-Ren
    ACTA PHYSICO-CHIMICA SINICA, 2016, 32 (10) : 2606 - 2619
  • [27] Discovery of Novel Tankyrase Inhibitors through Molecular Docking-Based Virtual Screening and Molecular Dynamics Simulation Studies
    Berishvili, Vladimir P.
    Kuimov, Alexander N.
    Voronkov, Andrew E.
    Radchenko, Eugene, V
    Kumar, Pradeep
    Choonara, Yahya E.
    Pillay, Viness
    Kamal, Ahmed
    Palyulin, Vladimir A.
    MOLECULES, 2020, 25 (14):
  • [28] Docking Ligands into Flexible and Solvated Macromolecules. 7. Impact of Protein Flexibility and Water Molecules on Docking-Based Virtual Screening Accuracy
    Therrien, Eric
    Weill, Nathanael
    Tomberg, Anna
    Corbeil, Christopher R.
    Lee, Devin
    Moitessier, Nicolas
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (11) : 3198 - 3210
  • [29] Discovery of Natural Compounds as SARS-CoV-2's Main Protease Inhibitors by Docking-based Virtual Screening
    Wang, Jing
    Jiang, Yu
    Wu, Yingnan
    Ma, Yuheng
    Yu, Hui
    Wang, Zhanli
    LETTERS IN DRUG DESIGN & DISCOVERY, 2024, 21 (10) : 1604 - 1610
  • [30] Identification of Pim-1 Kinase Inhibitors by Pharmacophore Model, Molecular Docking-based Virtual Screening, and Biological Evaluation
    Huang, Jing
    Yuan, Ye
    Zhu, Xiaoxiao
    Li, Guodong
    Xu, Ya
    Chen, Wenlin
    Zhu, Ying
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2022, 18 (03) : 240 - 246