Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information

被引:5
作者
Chen, Qian [1 ]
Wu, Leihong [1 ]
Liu, Wei [1 ]
Xing, Li [1 ]
Fan, Xiaohui [1 ]
机构
[1] Zhejiang Univ, Coll Pharmaceut Sci, Pharmaceut Informat Inst, Hangzhou 310058, Zhejiang, Peoples R China
来源
MOLECULES | 2013年 / 18卷 / 09期
基金
美国国家科学基金会;
关键词
quantitative structure-activity relationships (QSAR); SAR paradox; molecular modeling; gene expression; integrative analysis; RISK-ASSESSMENT; METALLOTHIONEIN; SELECTION; CANCER; PREDICTION; TOXICITY; CARCINOGENESIS; CLASSIFICATION; MECHANISMS; PARADIGM;
D O I
10.3390/molecules180910789
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.
引用
收藏
页码:10789 / 10801
页数:13
相关论文
共 50 条
  • [11] Neuronal apoptosis revealed by genomic analysis: Integrating gene expression profiles with functional information
    Cavallaro, Sebastiano
    NEUROINFORMATICS, 2007, 5 (02) : 115 - 126
  • [12] Mining cancer gene expression databases for latent information on intronic microRNAs
    Monterisi, Simona
    D'Ario, Giovanni
    Dama, Elisa
    Rotmensz, Nicole
    Confalonieri, Stefano
    Tordonato, Chiara
    Troglio, Flavia
    Bertalot, Giovanni
    Maisonneuve, Patrick
    Viale, Giuseppe
    Nicassio, Francesco
    Vecchi, Manuela
    Di Fiore, Pier Paolo
    Bianchi, Fabrizio
    MOLECULAR ONCOLOGY, 2015, 9 (02) : 473 - 487
  • [13] A route-based pathway analysis framework integrating mutation information and gene expression data
    Zhao, Yue
    Hoang, Tham H.
    Joshi, Pujan
    Hong, Seung-Hyun
    Giardina, Charles
    Shin, Dong-Guk
    METHODS, 2017, 124 : 3 - 12
  • [14] Difference in driver gene expression patterns between perihilar and peripheral intrahepatic cholangiocarcinoma in an experimental mouse model
    Adachi, Toshiyuki
    Adachi, Tomohiko
    Nakagaki, Takehiro
    Ono, Shinichiro
    Hidaka, Masaaki
    Ito, Shinichiro
    Kanetaka, Kengo
    Takatsuki, Mitsuhisa
    Nishida, Noriyuki
    Eguchi, Susumu
    JOURNAL OF HEPATO-BILIARY-PANCREATIC SCIENCES, 2020, 27 (08) : 477 - 486
  • [15] Gene Expression Data Classification using Support Vector Machine and Mutual Information-based Gene Selection
    Vanitha, Devi Arockia C.
    Devaraj, D.
    Venkatesulu, M.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 13 - 21
  • [16] Information-incorporated Gaussian graphical model for gene expression data
    Yi, Huangdi
    Zhang, Qingzhao
    Lin, Cunjie
    Ma, Shuangge
    BIOMETRICS, 2022, 78 (02) : 512 - 523
  • [17] Developing Enhanced Blood-Brain Barrier Permeability Models: Integrating External Bio-Assay Data in QSAR Modeling
    Wang, Wenyi
    Kim, Marlene T.
    Sedykh, Alexander
    Zhu, Hao
    PHARMACEUTICAL RESEARCH, 2015, 32 (09) : 3055 - 3065
  • [18] Integrating gene expression profiling and clinical data
    Paoli, Silvano
    Jurman, Giuseppe
    Albanese, Davide
    Merler, Stefano
    Furlanello, Cesare
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 47 (01) : 58 - 69
  • [19] Integrating 3D structural information into systems biology
    Murray, Diana
    Petrey, Donald
    Honig, Barry
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2021, 296
  • [20] A semi-parametric statistical model for integrating gene expression profiles across different platforms
    Lyu, Yafei
    Li, Qunhua
    BMC BIOINFORMATICS, 2016, 17