Enhanced QSAR Model Performance by Integrating Structural and Gene Expression Information

被引:5
作者
Chen, Qian [1 ]
Wu, Leihong [1 ]
Liu, Wei [1 ]
Xing, Li [1 ]
Fan, Xiaohui [1 ]
机构
[1] Zhejiang Univ, Coll Pharmaceut Sci, Pharmaceut Informat Inst, Hangzhou 310058, Zhejiang, Peoples R China
来源
MOLECULES | 2013年 / 18卷 / 09期
基金
美国国家科学基金会;
关键词
quantitative structure-activity relationships (QSAR); SAR paradox; molecular modeling; gene expression; integrative analysis; RISK-ASSESSMENT; METALLOTHIONEIN; SELECTION; CANCER; PREDICTION; TOXICITY; CARCINOGENESIS; CLASSIFICATION; MECHANISMS; PARADIGM;
D O I
10.3390/molecules180910789
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.
引用
收藏
页码:10789 / 10801
页数:13
相关论文
共 50 条
  • [1] Integrating Gene Expression and Phenotypic Information to Analyze Alzheimer's Disease
    Ray, Monika
    Zhang, Weixiong
    JOURNAL OF ALZHEIMERS DISEASE, 2009, 16 (01) : 73 - 84
  • [2] Integrating Biological Knowledge with Gene Expression Profiles for Survival Prediction of Cancer
    Chen, Xi
    Wang, Lily
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (02) : 265 - 278
  • [3] IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer's disease markers
    Ozelbas, Enes
    Sevimoglu, Tuba
    Kahveci, Tamer
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (04):
  • [4] RNN-CNN Based Cancer Prediction Model for Gene Expression
    Thakur, Tanima
    Batra, Isha
    Malik, Arun
    Ghimire, Deepak
    Kim, Seong-Heum
    Sanwar Hosen, A. S. M.
    IEEE ACCESS, 2023, 11 : 131024 - 131044
  • [5] Integrating Information Gain and Chi-Square for Enhanced Malware Detection Performance
    Rafrastara, Fauzi Adi
    Ghozi, Wildanil
    Sani, Ramadhan Rakhmat
    Handoko, Lekso Budi
    Abdussalam
    Pramudya, Elkaf Rahmawan
    Abdollah, Faizal M.
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2025, 24 (01): : 79 - 101
  • [6] Identifying Gene Network Rewiring by Integrating Gene Expression and Gene Network Data
    Xu, Ting
    Ou-Yang, Le
    Hu, Xiaohua
    Zhang, Xiao-Fei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 2079 - 2085
  • [7] Biologically inspired survival analysis based on integrating gene expression as mediator with genomic variants
    Youssef, Ibrahim
    Clarke, Robert
    Shih, Ie-Ming
    Wang, Yue
    Yu, Guoqiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 77 : 231 - 239
  • [8] Integrating Biological Context into the Analysis of Gene Expression Data
    Perscheid, Cindy
    Uflacker, Matthias
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 801 : 339 - 343
  • [9] Lessons from a decade of integrating cancer copy number alterations with gene expression profiles
    Huang, Norman
    Shah, Parantu K.
    Li, Cheng
    BRIEFINGS IN BIOINFORMATICS, 2012, 13 (03) : 305 - 316
  • [10] Neuronal Apoptosis Revealed by Genomic Analysis: Integrating Gene Expression Profiles with Functional Information
    Sebastiano Cavallaro
    Neuroinformatics, 2007, 5 : 115 - 126