Reducing false positive rate of docking-based virtual screening by active learning

被引:11
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Identification of influenza PA-Nter endonuclease inhibitors using pharmacophore- and docking-based virtual screening [J].
Ferro, Stefania ;
Gitto, Rosaria ;
Buemi, Maria Rosa ;
Karamanou, Spyridoula ;
Stevaert, Annelies ;
Naesens, Lieve ;
De Luca, Laura .
BIOORGANIC & MEDICINAL CHEMISTRY, 2018, 26 (15) :4544-4550
[22]   Molecular docking-based virtual screening and dynamics simulation study of novel and potential SIRT7 inhibitors [J].
Guo, Xinli ;
Chen, Rui ;
Cao, Liping .
CHEMICAL BIOLOGY & DRUG DESIGN, 2023, 102 (04) :707-717
[23]   Homology Model and Docking-Based Virtual Screening for Ligands of Human Dyskerin as New Inhibitors of Telomerase for Cancer Treatment [J].
Gabriela Armando, Romina ;
Mengual Gomez, Diego Luis ;
Ivan Juritz, Ezequiel ;
Lorenzano Menna, Pablo ;
Eduardo Gomez, Daniel .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
[24]   Docking-based virtual screening of TβR1 inhibitors: evaluation of pose prediction and scoring functions [J].
Shuai Wang ;
Jun-Hao Jiang ;
Ruo-Yu Li ;
Ping Deng .
BMC Chemistry, 14
[25]   Discovery of novel CDK8 inhibitors using multiple crystal structures in docking-based virtual screening [J].
Wang, Taijin ;
Yang, Zhuang ;
Zhang, Yongguang ;
Yan, Wei ;
Wang, Fang ;
He, Linhong ;
Zhou, Yuanyuan ;
Chen, Lijuan .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2017, 129 :275-286
[26]   Discovery of Novel Tankyrase Inhibitors through Molecular Docking-Based Virtual Screening and Molecular Dynamics Simulation Studies [J].
Berishvili, Vladimir P. ;
Kuimov, Alexander N. ;
Voronkov, Andrew E. ;
Radchenko, Eugene, V ;
Kumar, Pradeep ;
Choonara, Yahya E. ;
Pillay, Viness ;
Kamal, Ahmed ;
Palyulin, Vladimir A. .
MOLECULES, 2020, 25 (14)
[27]   Improved Docking-Based Virtual Screening Using the Score Correction Strategy for Specific Endothelial Lipase Inhibitors Identification [J].
Luo Qi-Yao ;
Wang Zi-Yun ;
Jin Hong-Wei ;
Liu Zhen-Ming ;
Zhang Liang-Ren .
ACTA PHYSICO-CHIMICA SINICA, 2016, 32 (10) :2606-2619
[28]   Docking Ligands into Flexible and Solvated Macromolecules. 7. Impact of Protein Flexibility and Water Molecules on Docking-Based Virtual Screening Accuracy [J].
Therrien, Eric ;
Weill, Nathanael ;
Tomberg, Anna ;
Corbeil, Christopher R. ;
Lee, Devin ;
Moitessier, Nicolas .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (11) :3198-3210
[29]   Discovery of Natural Compounds as SARS-CoV-2's Main Protease Inhibitors by Docking-based Virtual Screening [J].
Wang, Jing ;
Jiang, Yu ;
Wu, Yingnan ;
Ma, Yuheng ;
Yu, Hui ;
Wang, Zhanli .
LETTERS IN DRUG DESIGN & DISCOVERY, 2024, 21 (10) :1604-1610
[30]   Discovery of novel antagonists targeting the DNA binding domain of androgen receptor by integrated docking-based virtual screening and bioassays [J].
Jin-ping Pang ;
Chao Shen ;
Wen-fang Zhou ;
Yun-xia Wang ;
Lu-hu Shan ;
Xin Chai ;
Ying Shao ;
Xue-ping Hu ;
Feng Zhu ;
Dan-yan Zhu ;
Li Xiao ;
Lei Xu ;
Xiao-hong Xu ;
Dan Li ;
Ting-jun Hou .
Acta Pharmacologica Sinica, 2022, 43 :229-239