Biomedical Data and Deep Learning Computational Models for Predicting Compound-Protein Relations

被引:22
作者
Zhao, Qichang [1 ,2 ]
Yang, Mengyun [1 ,2 ,3 ]
Cheng, Zhongjian [1 ,2 ]
Li, Yaohang [4 ]
Wang, Jianxin [1 ,2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Cent South Univ, Hunan Prov Key Lab Bioinformat, Changsha 410083, Peoples R China
[3] Shaoyang Univ, Sch Sci, Shaoyang 422000, Peoples R China
[4] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
基金
中国国家自然科学基金;
关键词
Drugs; Proteins; Compounds; Deep learning; Databases; Three-dimensional displays; Task analysis; Virtual screening; compound-protein relation prediction; deep learning; TARGET INTERACTION PREDICTION; NEURAL-NETWORK; AFFINITY PREDICTION; DRUG DISCOVERY; SIMILARITY; RESOURCE; DIMENSIONALITY; DATABASES; LANGUAGE; GENE;
D O I
10.1109/TCBB.2021.3069040
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The identification of compound-protein relations (CPRs), which includes compound-protein interactions (CPIs) and compound-protein affinities (CPAs), is critical to drug development. A common method for compound-protein relation identification is the use of in vitro screening experiments. However, the number of compounds and proteins is massive, and in vitro screening experiments are labor-intensive, expensive, and time-consuming with high failure rates. Researchers have developed a computational field called virtual screening (VS) to aid experimental drug development. These methods utilize experimentally validated biological interaction information to generate datasets and use the physicochemical and structural properties of compounds and target proteins as input information to train computational prediction models. At present, deep learning has been widely used in computer vision and natural language processing and has experienced epoch-making progress. At the same time, deep learning has also been used in the field of biomedicine widely, and the prediction of CPRs based on deep learning has developed rapidly and has achieved good results. The purpose of this study is to investigate and discuss the latest applications of deep learning techniques in CPR prediction. First, we describe the datasets and feature engineering (i.e., compound and protein representations and descriptors) commonly used in CPR prediction methods. Then, we review and classify recent deep learning approaches in CPR prediction. Next, a comprehensive comparison is performed to demonstrate the prediction performance of representative methods on classical datasets. Finally, we discuss the current state of the field, including the existing challenges and our proposed future directions. We believe that this investigation will provide sufficient references and insight for researchers to understand and develop new deep learning methods to enhance CPR predictions.
引用
收藏
页码:2092 / 2110
页数:19
相关论文
共 158 条
[1]   DeepCDA: deep cross-domain compound-protein affinity prediction through LSTM and convolutional neural networks [J].
Abbasi, Karim ;
Razzaghi, Parvin ;
Poso, Antti ;
Amanlou, Massoud ;
Ghasemi, Jahan B. ;
Masoudi-Nejad, Ali .
BIOINFORMATICS, 2020, 36 (17) :4633-4642
[2]  
Abdel-Basset M., 2020, IEEE ACCESS, V8
[3]   Multi-view self-attention for interpretable drug-target interaction prediction [J].
Agyemang, Brighter ;
Wu, Wei-Ping ;
Kpiebaareh, Michael Yelpengne ;
Lei, Zhihua ;
Nanor, Ebenezer ;
Chen, Lei .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 110
[4]  
Agyemang B, 2019, I COMP CONF WAVELET, P266, DOI [10.1109/ICCWAMTIP47768.2019.9067510, 10.1109/iccwamtip47768.2019.9067510]
[5]   Recent improvements to Binding MOAD: a resource for protein-ligand binding affinities and structures [J].
Ahmed, Aqeel ;
Smith, Richard D. ;
Clark, Jordan J. ;
Dunbar, James B., Jr. ;
Carlson, Heather A. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D465-D469
[6]  
[Anonymous], 2015, INT C LEARNING REPRE
[7]  
[Anonymous], 2009, MOL DESCRIPTORS CHEM
[8]  
[Anonymous], 2015, Nucleic Acids Research, V43, pD204
[9]   Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics [J].
Asgari, Ehsaneddin ;
Mofrad, Mohammad R. K. .
PLOS ONE, 2015, 10 (11)
[10]   Drug repositioning: Identifying and developing new uses for existing drugs [J].
Ashburn, TT ;
Thor, KB .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) :673-683