Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information

被引:5
作者
Chen, Qian [1 ]
Wu, Leihong [1 ]
Liu, Wei [1 ]
Xing, Li [1 ]
Fan, Xiaohui [1 ]
机构
[1] Zhejiang Univ, Coll Pharmaceut Sci, Pharmaceut Informat Inst, Hangzhou 310058, Zhejiang, Peoples R China
基金
美国国家科学基金会;
关键词
quantitative structure-activity relationships (QSAR); SAR paradox; molecular modeling; gene expression; integrative analysis; RISK-ASSESSMENT; METALLOTHIONEIN; SELECTION; CANCER; PREDICTION; TOXICITY; CARCINOGENESIS; CLASSIFICATION; MECHANISMS; PARADIGM;
D O I
10.3390/molecules180910789
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.
引用
收藏
页码:10789 / 10801
页数:13
相关论文
共 50 条
[11]   Neuronal apoptosis revealed by genomic analysis: Integrating gene expression profiles with functional information [J].
Cavallaro, Sebastiano .
NEUROINFORMATICS, 2007, 5 (02) :115-126
[12]   Mining cancer gene expression databases for latent information on intronic microRNAs [J].
Monterisi, Simona ;
D'Ario, Giovanni ;
Dama, Elisa ;
Rotmensz, Nicole ;
Confalonieri, Stefano ;
Tordonato, Chiara ;
Troglio, Flavia ;
Bertalot, Giovanni ;
Maisonneuve, Patrick ;
Viale, Giuseppe ;
Nicassio, Francesco ;
Vecchi, Manuela ;
Di Fiore, Pier Paolo ;
Bianchi, Fabrizio .
MOLECULAR ONCOLOGY, 2015, 9 (02) :473-487
[13]   A route-based pathway analysis framework integrating mutation information and gene expression data [J].
Zhao, Yue ;
Hoang, Tham H. ;
Joshi, Pujan ;
Hong, Seung-Hyun ;
Giardina, Charles ;
Shin, Dong-Guk .
METHODS, 2017, 124 :3-12
[14]   Difference in driver gene expression patterns between perihilar and peripheral intrahepatic cholangiocarcinoma in an experimental mouse model [J].
Adachi, Toshiyuki ;
Adachi, Tomohiko ;
Nakagaki, Takehiro ;
Ono, Shinichiro ;
Hidaka, Masaaki ;
Ito, Shinichiro ;
Kanetaka, Kengo ;
Takatsuki, Mitsuhisa ;
Nishida, Noriyuki ;
Eguchi, Susumu .
JOURNAL OF HEPATO-BILIARY-PANCREATIC SCIENCES, 2020, 27 (08) :477-486
[15]   Gene Expression Data Classification using Support Vector Machine and Mutual Information-based Gene Selection [J].
Vanitha, Devi Arockia C. ;
Devaraj, D. ;
Venkatesulu, M. .
GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 :13-21
[16]   Information-incorporated Gaussian graphical model for gene expression data [J].
Yi, Huangdi ;
Zhang, Qingzhao ;
Lin, Cunjie ;
Ma, Shuangge .
BIOMETRICS, 2022, 78 (02) :512-523
[17]   Developing Enhanced Blood-Brain Barrier Permeability Models: Integrating External Bio-Assay Data in QSAR Modeling [J].
Wang, Wenyi ;
Kim, Marlene T. ;
Sedykh, Alexander ;
Zhu, Hao .
PHARMACEUTICAL RESEARCH, 2015, 32 (09) :3055-3065
[18]   Integrating gene expression profiling and clinical data [J].
Paoli, Silvano ;
Jurman, Giuseppe ;
Albanese, Davide ;
Merler, Stefano ;
Furlanello, Cesare .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 47 (01) :58-69
[19]   A semi-parametric statistical model for integrating gene expression profiles across different platforms [J].
Lyu, Yafei ;
Li, Qunhua .
BMC BIOINFORMATICS, 2016, 17
[20]   A semi-parametric statistical model for integrating gene expression profiles across different platforms [J].
Yafei Lyu ;
Qunhua Li .
BMC Bioinformatics, 17