In Silico HCT116 Human Colon Cancer Cell-Based Models En Route to the Discovery of Lead-Like Anticancer Drugs

被引:26
作者
Cruz, Sara [1 ]
Gomes, Sofia E. [2 ]
Borralho, Pedro M. [2 ]
Rodrigues, Cecilia M. P. [2 ]
Gaudencio, Susana P. [1 ,3 ,4 ]
Pereira, Florbela [1 ]
机构
[1] Univ Nova Lisboa, Fac Sci & Technol, Dept Chem, LAQV REQUIMTE, P-2829516 Caparica, Portugal
[2] Univ Lisbon, Fac Pharm, Res Inst Med iMed ULisboa, P-1964003 Lisbon, Portugal
[3] Univ Nova Lisboa, Fac Sci & Technol, Dept Chem, UCIBIO REQUIMTE, P-2829516 Caparica, Portugal
[4] Univ Nova Lisboa, Fac Sci & Technol, Dept Life Sci, P-2829516 Caparica, Portugal
关键词
anticancer activity; HCT116 cell line; quantitative structure-activity relationship (QSAR); machine learning (ML); molecular descriptors; NMR descriptors; marine natural products (MNPs); marine-derived actinobacteria; RANDOM FOREST; QSAR; DERIVATIVES; ANTITUMOR; MARINE; DESIGN; POTENT; IDENTIFICATION; DEREPLICATION; PRINCIPLES;
D O I
10.3390/biom8030056
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To discover new inhibitors against the human colon carcinoma HCT116 cell line, two quantitative structure-activity relationship (QSAR) studies using molecular and nuclear magnetic resonance (NMR) descriptors were developed through exploration of machine learning techniques and using the value of half maximal inhibitory concentration (IC50). In the first approach, A, regression models were developed using a total of 7339 molecules that were extracted from the ChEMBL and ZINC databases and recent literature. The performance of the regression models was successfully evaluated by internal and external validations, the best model achieved R-2 of 0.75 and 0.73 and root mean square error (RMSE) of 0.66 and 0.69 for the training and test sets, respectively. With the inherent time-consuming efforts of working with natural products (NPs), we conceived a new NP drug hit discovery strategy that consists in frontloading samples with 1D NMR descriptors to predict compounds with anticancer activity prior to bioactivity screening for NPs discovery, approach B. The NMR QSAR classification models were built using 1D NMR data (H-1 and C-13) as descriptors, from 50 crude extracts, 55 fractions and five pure compounds obtained from actinobacteria isolated from marine sediments collected off the Madeira Archipelago. The overall predictability accuracies of the best model exceeded 63% for both training and test sets.
引用
收藏
页数:22
相关论文
共 52 条
  • [51] PaDEL-Descriptor: An Open Source Software to Calculate Molecular Descriptors and Fingerprints
    Yap, Chun Wei
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (07) : 1466 - 1474
  • [52] Synthesis, Molecular Structure, Metabolic Stability and QSAR Studies of a Novel Series of Anticancer N-Acylbenzenesulfonamides
    Zolnowska, Beata
    Slawinski, Jaroslaw
    Belka, Mariusz
    Baczek, Tomasz
    Kawiak, Anna
    Chojnacki, Jaroslaw
    Pogorzelska, Aneta
    Szafranski, Krzysztof
    [J]. MOLECULES, 2015, 20 (10) : 19101 - 19129