Reducing false positive rate of docking-based virtual screening by active learning

被引:7
作者
Wang, Lei [1 ]
Shi, Shao-Hua [2 ]
Li, Hui [1 ]
Zeng, Xiang-Xiang [1 ,3 ]
Liu, Su-You
Liu, Zhao-Qian [1 ]
Deng, Ya-Feng [4 ]
Lu, Ai-Ping [5 ]
Hou, Ting-Jun [6 ]
Cao, Dong-Sheng [1 ]
机构
[1] Cent South Univ, Xiangya Sch Pharmaceut Sci, Changsha, Peoples R China
[2] Hong Kong Baptist Univ, Sch Chinese Med, Hong Kong, Peoples R China
[3] Hunan Univ, Dept Comp Sci, Changsha, Peoples R China
[4] CarbonSilicon AI Technol, Hangzhou, Peoples R China
[5] Hong Kong Baptist Univ, Inst Adv Translat Med Bone & Joint Dis, Sch Chinese Med, Hong Kong, Peoples R China
[6] Zhejiang Univ, Coll Pharmaceut Sci, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
molecular docking; machine learning-based scoring function (MLSF); active learning; virtual screening (VS); false positive; SCORING FUNCTIONS;
D O I
10.1093/bib/bbac626
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
引用
收藏
页数:11
相关论文
共 50 条
[31]   Discovery of novel antagonists targeting the DNA binding domain of androgen receptor by integrated docking-based virtual screening and bioassays [J].
Pang, Jin-ping ;
Shen, Chao ;
Zhou, Wen-fang ;
Wang, Yun-xia ;
Shan, Lu-hu ;
Chai, Xin ;
Shao, Ying ;
Hu, Xue-ping ;
Zhu, Feng ;
Zhu, Dan-yan ;
Xiao, Li ;
Xu, Lei ;
Xu, Xiao-hong ;
Li, Dan ;
Hou, Ting-jun .
ACTA PHARMACOLOGICA SINICA, 2022, 43 (01) :229-239
[32]   Identification of Pim-1 Kinase Inhibitors by Pharmacophore Model, Molecular Docking-based Virtual Screening, and Biological Evaluation [J].
Huang, Jing ;
Yuan, Ye ;
Zhu, Xiaoxiao ;
Li, Guodong ;
Xu, Ya ;
Chen, Wenlin ;
Zhu, Ying .
CURRENT COMPUTER-AIDED DRUG DESIGN, 2022, 18 (03) :240-246
[33]   Docking-Based Virtual Screening Enables Prioritizing Protein Kinase Inhibitors With In Vitro Phenotypic Activity Against Schistosoma mansoni [J].
Moreira, Bernardo Pereira ;
Batista, Izabella Cristina Andrade ;
Tavares, Naiara Clemente ;
Armstrong, Tom ;
Gava, Sandra Grossi ;
Torres, Gabriella Parreiras ;
Mourao, Marina Moraes ;
Falcone, Franco H. .
FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2022, 12
[34]   Novel Acetylcholinesterase Inhibitors Identified from ZINC Database Using Docking-Based Virtual Screening for Alzheimer's Disease. [J].
Abdusalam, Ashraf Ahmed Ali ;
Vikneswaran, Murugaiyah .
CHEMISTRYSELECT, 2020, 5 (12) :3593-3599
[35]   Novel hits for acetylcholinesterase inhibition derived by docking-based screening on ZINC database [J].
Doytchinova, Irini ;
Atanasova, Mariyana ;
Valkova, Iva ;
Stavrakov, Georgi ;
Philipova, Irena ;
Zhivkova, Zvetanka ;
Zheleva-Dimitrova, Dimitrina ;
Konstantinov, Spiro ;
Dimitrov, Ivan .
JOURNAL OF ENZYME INHIBITION AND MEDICINAL CHEMISTRY, 2018, 33 (01) :768-776
[36]   Molecular Docking-Based Virtual Screening of FDA-Approved Drugs Using Trypanothione Reductase Identified New Trypanocidal Agents [J].
Gomez-Escobedo, Rogelio ;
Mendez-alvarez, Domingo ;
Vazquez, Citlali ;
Saavedra, Emma ;
Vazquez, Karina ;
Alcantara-Farfan, Veronica ;
Cordero-Martinez, Joaquin ;
Gonzalez-Gonzalez, Alonzo ;
Rivera, Gildardo ;
Nogueda-Torres, Benjamin .
MOLECULES, 2024, 29 (16)
[37]   Docking-based comparative intermolecular contacts analysis and in silico screening reveal new potent acetylcholinesterase inhibitors [J].
Habash, Maha ;
Abuhamdah, Sawsan ;
Younis, Khaled ;
Taha, Mutasem O. .
MEDICINAL CHEMISTRY RESEARCH, 2017, 26 (11) :2768-2784
[38]   REDUCE FALSE-POSITIVE RATE BY ACTIVE LEARNING FOR AUTOMATIC POLYP DETECTION IN COLONOSCOPY VIDEOS [J].
Guo, Zhe ;
Zhang, Ruiyao ;
Li, Qin ;
Liu, Xinkai ;
Nemoto, Daiki ;
Togashi, Kazutomo ;
Niroshana, Isuru S. M. ;
Shi, Yuchen ;
Zhu, Xin .
2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, :1655-1658
[39]   Advancing Ligand Docking through Deep Learning: Challenges and Prospects in Virtual Screening [J].
Zhang, Xujun ;
Shen, Chao ;
Zhang, Haotian ;
Kang, Yu ;
Hsieh, Chang-Yu ;
Hou, Tingjun .
ACCOUNTS OF CHEMICAL RESEARCH, 2024, 57 (10) :1500-1509
[40]   The rational search for PDE10A inhibitors from Sophora flavescens roots using pharmacophore- and docking-based virtual screening [J].
Fan, Han-Tian ;
Guo, Jun-Fang ;
Zhang, Yu-Xin ;
Gu, Yu-Xi ;
Ning, Zhong-Qi ;
Qiao, Yan-Jiang ;
Wang, Xing .
MOLECULAR MEDICINE REPORTS, 2018, 17 (01) :388-393