Prediction and Optimization of Nav1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing

被引:17
作者
Kong, Weikaixin [3 ]
Tu, Xinyu [3 ]
Huang, Weiran [3 ]
Yang, Yang [5 ]
Xie, Zhengwei [1 ,2 ]
Huang, Zhuo [3 ,4 ]
机构
[1] Peking Univ, Int Canc Inst, Sch Basic Med Sci, Hlth Sci Ctr, Beijing 100191, Peoples R China
[2] Peking Univ, Dept Pharmacol, Sch Basic Med Sci, Hlth Sci Ctr, Beijing 100191, Peoples R China
[3] Peking Univ, Dept Mol & Cellular Pharmacol, Sch Pharmaceut Sci, Hlth Sci Ctr, Beijing 100191, Peoples R China
[4] Peking Univ, Dept Mol & Cellular Pharmacol, State Key Lab Nat & Biomimet Drugs, Sch Pharmaceut Sci,Hlth Sci Ctr, Beijing 100191, Peoples R China
[5] Purdue Univ, Coll Pharm, Dept Med Chem & Mol Pharmacol, W Lafayette, IN 47907 USA
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
PAIN; CLASSIFICATION; REGRESSION; MUTATIONS; SCN9A; ALGORITHM; DISCOVERY; MODELS; TOOL;
D O I
10.1021/acs.jcim.9b01180
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Although the Na(v)1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of Na(v)1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective Na(v)1.7 sodium channel inhibitors. We customized the imbalanced data set from ChEMBL and BindingDB to train and filter the best classification model. Then, the whole-cell voltage-clamp was employed to validate the inhibitors. We assembled a molecular group optimization method by combining the Grammar Variational Autoencoder, classification model, and simulated annealing. We found that the RF-CDK model (random forest + CDK fingerprint) performs best in the imbalanced data set. Of the three compounds that may have inhibitory effects, nortriptyline has been experimentally verified. In the molecule optimization process, 40 molecules located in the applicability domain of RF-CDK were used as a starting point, among which 34 molecules evolved to molecules with greater molecular scores (MS). The molecule with the highest MS was derived from CHEMBL232524S. The model and method we developed for Na(v)1.7 inhibitors are also applicable to other targets.
引用
收藏
页码:2739 / 2753
页数:15
相关论文
共 56 条
  • [1] Computational Derivation of Structural Alerts from Large Toxicology Data Sets
    Ahlberg, Ernst
    Carlsson, Lars
    Boyer, Scott
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (10) : 2945 - 2952
  • [2] [Anonymous], 2014, RCDK INTERFACE CDK L
  • [3] [Anonymous], 2017, PMLR
  • [4] Modeling Kinase Inhibition Using Highly Confident Data Sets
    Avram, Sorin
    Bora, Alina
    Halip, Liliana
    Curpan, Ramona
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (05) : 957 - 967
  • [5] Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
  • [6] Changes in the expression of tetrodotoxin-sensitive sodium channels within dorsal root ganglia neurons in inflammatory pain
    Black, JA
    Liu, SJ
    Tanaka, M
    Cummins, TR
    Waxman, SG
    [J]. PAIN, 2004, 108 (03) : 237 - 247
  • [7] Complex Support Vector Machines for Regression and Quaternary Classification
    Bouboulis, Pantelis
    Theodoridis, Sergios
    Mavroforakis, Charalampos
    Evaggelatou-Dalla, Leoni
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (06) : 1260 - 1274
  • [8] Survey of chronic pain in Europe: Prevalence, impact on daily life, and treatment
    Breivik, H
    Collett, B
    Ventafridda, V
    Cohen, R
    Gallacher, D
    [J]. EUROPEAN JOURNAL OF PAIN, 2006, 10 (04) : 287 - 333
  • [9] Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets
    Butina, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04): : 747 - 750
  • [10] admetSAR: A Comprehensive Source and Free Tool for Assessment of Chemical ADMET Properties
    Cheng, Feixiong
    Li, Weihua
    Zhou, Yadi
    Shen, Jie
    Wu, Zengrui
    Liu, Guixia
    Lee, Philip W.
    Tang, Yun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (11) : 3099 - 3105