A Computational-Based Method for Predicting Drug-Target Interactions by Using Stacked Autoencoder Deep Neural Network

被引:150
作者
Wang, Lei [1 ,2 ]
You, Zhu-Hong [3 ]
Chen, Xing [4 ]
Xia, Shi-Xiong [1 ]
Liu, Feng [5 ]
Yan, Xin [6 ]
Zhou, Yong [1 ]
Song, Ke-Jian [7 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou, Peoples R China
[2] Zaozhuang Univ, Coll Informat Sci & Engn, Zaozhuang, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, China Beijing South Rd,40-1, Urumqi 830011, Peoples R China
[4] China Univ Min & Technol, Sch Informat & Control Engn, 1 Univ Rd, Xuzhou 221116, Peoples R China
[5] China Natl Coal Assoc, Beijing, Peoples R China
[6] Zaozhuang Univ, Sch Foreign Languages, Zaozhuang, Peoples R China
[7] JiangXi Univ Sci & Technol, Sch Informat Engn, Ganzhou, Peoples R China
关键词
deep learning; drug-target interactions; position-specific scoring matrix; stacked autoencoder; PROTEIN-PROTEIN INTERACTIONS; DATABASE; MATRIX;
D O I
10.1089/cmb.2017.0135
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying the interaction between drugs and target proteins is an important area of drug research, which provides a broad prospect for low-risk and faster drug development. However, due to the limitations of traditional experiments when revealing drug-protein interactions (DTIs), the screening of targets not only takes a lot of time and money but also has high false-positive and false-negative rates. Therefore, it is imperative to develop effective automatic computational methods to accurately predict DTIs in the postgenome era. In this article, we propose a new computational method for predicting DTIs from drug molecular structure and protein sequence by using the stacked autoencoder of deep learning, which can adequately extract the raw data information. The proposed method has the advantage that it can automatically mine the hidden information from protein sequences and generate highly representative features through iterations of multiple layers. The feature descriptors are then constructed by combining the molecular substructure fingerprint information, and fed into the rotation forest for accurate prediction. The experimental results of fivefold cross-validation indicate that the proposed method achieves superior performance on gold standard data sets (enzymes,ion channels,GPCRs[G-protein-coupled receptors], and nuclear receptors) with accuracy of 0.9414, 0.9116, 0.8669, and 0.8056, respectively. We further comprehensively explore the performance of the proposed method by comparing it with other feature extraction algorithms, state-of-the-art classifiers, and other excellent methods on the same data set. The excellent comparison results demonstrate that the proposed method is highly competitive when predicting drug-target interactions.
引用
收藏
页码:361 / 373
页数:13
相关论文
共 45 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], 2008, ICML 08, DOI 10.1145/1390156.1390294
[3]  
[Anonymous], 2013, PLOS ONE
[4]  
[Anonymous], 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11
[5]  
Bengio Y., 2006, ADV NEURAL INFORM PR, V19
[6]   Exploring nonlinear relationships in chemical data using kernel-based methods [J].
Cao, Dong-Sheng ;
Liang, Yi-Zeng ;
Xu, Qing-Song ;
Hu, Qian-Nan ;
Zhang, Liang-Xiao ;
Fu, Guang-Hui .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 107 (01) :106-115
[7]   Prediction of aqueous solubility of druglike organic compounds using partial least squares, back-propagation network and support vector machine [J].
Cao, Dong-Sheng ;
Xu, Qing-Song ;
Liang, Yi-Zeng ;
Chen, Xian ;
Li, Hong-Dong .
JOURNAL OF CHEMOMETRICS, 2010, 24 (9-10) :584-595
[8]   A Semi-Supervised Method for Drug-Target Interaction Prediction with Consistency in Networks [J].
Chen, Hailin ;
Zhang, Zuping .
PLOS ONE, 2013, 8 (05)
[9]   NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning [J].
Chen, Xing ;
Ren, Biao ;
Chen, Ming ;
Wang, Quanxin ;
Zhang, Lixin ;
Yan, Guiying .
PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (07)
[10]   Drug-target interaction prediction: databases, web servers and computational models [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xiaotian ;
Zhang, Xu ;
Dai, Feng ;
Yin, Jian ;
Zhang, Yongdong .
BRIEFINGS IN BIOINFORMATICS, 2016, 17 (04) :696-712