Deep learning-based classification model for GPR151 activator activity prediction

被引:2
作者
Xu, Huangchao [1 ,2 ]
Zhang, Baohua [1 ]
Liu, Qian [1 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Dongsheng Sourth St 2, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, 1 Yanqihu East Rd, Beijing 101408, Peoples R China
关键词
Activity prediction; Deep learning; Feature extractor; NEUROPATHIC PAIN; DOCKING; DESCRIPTORS;
D O I
10.1186/s12859-023-05369-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundGPR151 is a kind of protein belonging to G protein-coupled receptor family that is closely associated with a variety of physiological and pathological processes.The potential use of GPR151 as a therapeutic target for the management of metabolic disorders has been demonstrated in several studies, highlighting the demand to explore its activators further. Activity prediction serves as a vital preliminary step in drug discovery, which is both costly and time-consuming. Thus, the development of reliable activity classification model has become an essential way in the process of drug discovery, aiming to enhance the efficiency of virtual screening.ResultsWe propose a learning-based method based on feature extractor and deep neural network to predict the activity of GPR151 activators. We first introduce a new molecular feature extraction algorithm which utilizes the idea of bag-of-words model in natural language to densify the sparse fingerprint vector. Mol2vec method is also used to extract diverse features. Then, we construct three classical feature selection algorithms and three types of deep learning model to enhance the representational capacity of molecules and predict activity label by five different classifiers. We conduct experiments using our own dataset of GPR151 activators. The results demonstrate high classification accuracy and stability, with the optimal model Mol2vec-CNN significantly improving performance across multiple classifiers. The svm classifier achieves the best accuracy of 0.92 and F1 score of 0.76 which indicates promising applications for our method in the field of activity prediction.ConclusionThe results suggest that the experimental design of this study is appropriate and well-conceived. The deep learning-based feature extraction algorithm established in this study outperforms traditional feature selection algorithm for activity prediction. The model developed can be effectively utilized in the pre-screening stage of drug virtual screening.
引用
收藏
页数:18
相关论文
共 39 条
  • [1] Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers
    Abraham, Mark James
    Murtola, Teemu
    Schulz, Roland
    Páll, Szilárd
    Smith, Jeremy C.
    Hess, Berk
    Lindah, Erik
    [J]. SoftwareX, 2015, 1-2 : 19 - 25
  • [2] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [3] The habenular G-protein-coupled receptor 151 regulates synaptic plasticity and nicotine intake
    Antolin-Fontes, Beatriz
    Li, Kun
    Ables, Jessica L.
    Riad, Michael H.
    Gorlich, Andreas
    Williams, Maya
    Wang, Cuidong
    Lipford, Sylvia M.
    Dao, Maria
    Liu, Jianxi
    Molina, Henrik
    Heintz, Nathaniel
    Kenny, Paul J.
    Ibanez-Tallon, Ines
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (10) : 5502 - 5509
  • [4] Goh GB, 2017, Arxiv, DOI arXiv:1706.06689
  • [5] Efficiency of Homology Modeling Assisted Molecular Docking in G-protein Coupled Receptors
    Bhunia, Shome S.
    Saxena, Anil K.
    [J]. CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2021, 21 (04) : 269 - 294
  • [6] G protein-coupled receptor 151 regulates glucose metabolism and hepatic gluconeogenesis
    Bielczyk-Maczynska, Ewa
    Zhao, Meng
    Zushin, Peter-James H.
    Schnurr, Theresia M.
    Kim, Hyun-Jung
    Li, Jiehan
    Nallagatla, Pratima
    Sangwung, Panjamaporn
    Park, Chong Y.
    Cornn, Cameron
    Stahl, Andreas
    Svensson, Katrin J.
    Knowles, Joshua W.
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [7] The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding
    Brown, RD
    Martin, YC
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01): : 1 - 9
  • [8] Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets
    Butina, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04): : 747 - 750
  • [9] Machine learning for molecular and materials science
    Butler, Keith T.
    Davies, Daniel W.
    Cartwright, Hugh
    Isayev, Olexandr
    Walsh, Aron
    [J]. NATURE, 2018, 559 (7715) : 547 - 555
  • [10] FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction
    Cai, Hanxuan
    Zhang, Huimin
    Zhao, Duancheng
    Wu, Jingxing
    Wang, Ling
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)