Deep Multi-Label Joint Learning for RNA and DNA-Binding Proteins Prediction

被引:5
作者
Du, Xiuquan [1 ]
Hu, Jiajia [2 ]
机构
[1] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
关键词
DNA-binding proteins; RNA-binding proteins; multi-label learning; WEB SERVER; RULES;
D O I
10.1109/TCBB.2022.3150280
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The recognition of DNA- (DBPs) and RNA-binding proteins (RBPs) is not only conducive to understanding cell function, but also a challenging task. Previous studies have shown that these proteins are usually considered separately due to different binding domains. In addition, due to the high similarity between DBPs and RBPs, it is possible for DBPs predictor to predict RBPs as DBPs, and vice versa, which leads to high cross-prediction rate. In this study, we creatively propose a novel deep multi-label joint learning framework to leverage the relationship between multiple labels and binding proteins. First, a multi-label variant network is designed to explore multi-scale context hidden information. Then, multi-label Long Short-Term Memory (multiLSTM) is used to mine the potential relationship between labels. Finally, the calibrated hidden features from variant network are considered for different levels of joint learning so that multiLSTM can better explore the correlation between them. Extensive experiments are also carried out to compare the proposed method with other existing methods. Furthermore, we also provide further insights into the importance of the relevant bioanalysis of proteins obtained from our model and summarize these binding proteins that are significantly related to a disease. Our method is freely available at http://39.108.90.186/dmlj.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 61 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Asghari Mehdi Poursheikhali, 2019, Avicenna Journal of Medical Biotechnology, V11, P104
[3]  
Ben-Baruch E, 2021, Arxiv, DOI [arXiv:2009.14119, DOI 10.48550/ARXIV.2009.14119]
[4]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[5]  
[Anonymous], 2020, CA Cancer J Clin, V70, P313, DOI [10.3322/caac.21492, 10.3322/caac.21609]
[6]   pLoc-mAnimal: predict subcellular localization of animal proteins with both single and multiple sites [J].
Cheng, Xiang ;
Zhao, Shu-Guang ;
Lin, Wei-Zhong ;
Xiao, Xuan ;
Chou, Kuo-Chen .
BIOINFORMATICS, 2017, 33 (22) :3524-3531
[7]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483
[8]   GRAPHICAL RULES FOR ENZYME-CATALYZED RATE LAWS [J].
CHOU, KC ;
FORSEN, S .
BIOCHEMICAL JOURNAL, 1980, 187 (03) :829-835
[9]  
CHOU KC, 1980, CHEM SCRIPTA, V16, P109
[10]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255