PMiSLocMF: predicting miRNA subcellular localizations by incorporating multi-source features of miRNAs

被引:13
作者
Chen, Lei [1 ]
Gu, Jiahui [1 ]
Zhou, Bo [2 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
[2] Shanghai Univ Med & Hlth Sci, Sch Basic Med Sci, 279 Zhouzhu Rd, Shanghai 201318, Peoples R China
关键词
subcellular localization; miRNA; graph attention auto-encoder; node2vec; miRNA-drug association; miRNA-disease association; miRNA-mRNA association; MULTI-LABEL CLASSIFIER; RNA; MICRORNA; RESOURCE;
D O I
10.1093/bib/bbae386
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The microRNAs (miRNAs) play crucial roles in several biological processes. It is essential for a deeper insight into their functions and mechanisms by detecting their subcellular localizations. The traditional methods for determining miRNAs subcellular localizations are expensive. The computational methods are alternative ways to quickly predict miRNAs subcellular localizations. Although several computational methods have been proposed in this regard, the incomplete representations of miRNAs in these methods left the room for improvement. In this study, a novel computational method for predicting miRNA subcellular localizations, named PMiSLocMF, was developed. As lots of miRNAs have multiple subcellular localizations, this method was a multi-label classifier. Several properties of miRNA, such as miRNA sequences, miRNA functional similarity, miRNA-disease, miRNA-drug, and miRNA-mRNA associations were adopted for generating informative miRNA features. To this end, powerful algorithms [node2vec and graph attention auto-encoder (GATE)] and one newly designed scheme were adopted to process above properties, producing five feature types. All features were poured into self-attention and fully connected layers to make predictions. The cross-validation results indicated the high performance of PMiSLocMF with accuracy higher than 0.83, average area under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AUPR) exceeding 0.90 and 0.77, respectively. Such performance was better than all previous methods based on the same dataset. Further tests proved that using all feature types can improve the performance of PMiSLocMF, and GATE and self-attention layer can help enhance the performance. Finally, we deeply analyzed the influence of miRNA associations with diseases, drugs, and mRNAs on PMiSLocMF. The dataset and codes are available at https://github.com/Gu20201017/PMiSLocMF.
引用
收藏
页数:16
相关论文
共 65 条
[1]   Principles of miRNA-mRNA interactions: beyond sequence complementarity [J].
Afonso-Grunz, Fabian ;
Mueller, Soeren .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2015, 72 (16) :3127-3141
[2]  
Asim MN., 2021, IEEE IJCNN
[3]   MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information [J].
Asim, Muhammad Nabeel ;
Malik, Muhammad Imran ;
Zehe, Christoph ;
Trygg, Johan ;
Dengel, Andreas ;
Ahmed, Sheraz .
GENES, 2020, 11 (12) :1-23
[4]   Human Argonaute 2 Is Tethered to Ribosomal RNA through MicroRNA Interactions [J].
Atwood, Blake L. ;
Woolnough, Jessica L. ;
Lefevre, Gaelle M. ;
Ribeiro, Mariana Saint Just ;
Felsenfeld, Gary ;
Giles, Keith E. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2016, 291 (34) :17919-17928
[5]   DAmiRLocGNet: miRNA subcellular localization prediction by combining miRNA-disease associations and graph convolutional networks [J].
Bai, Tao ;
Yan, Ke ;
Liu, Bin .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
[6]  
Cai Yimei, 2009, Genomics Proteomics & Bioinformatics, V7, P147, DOI 10.1016/S1672-0229(08)60044-3
[7]   MicroRNAs: Key Regulators in the Central Nervous System and Their Implication in Neurological Diseases [J].
Cao, Dan-Dan ;
Li, Lu ;
Chan, Wai-Yee .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (06)
[8]   Prediction of Drug Pathway-based Disease Classes using Multiple Properties of Drugs [J].
Chen, Lei ;
Li, Linyang .
CURRENT BIOINFORMATICS, 2024, 19 (09) :859-872
[9]   Improved multi-label classifiers for predicting protein subcellular localization [J].
Chen, Lei ;
Qu, Ruyun ;
Liu, Xintong .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) :214-236
[10]   PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes [J].
Chen, Lei ;
Zhang, Chenyu ;
Xu, Jing .
BMC BIOINFORMATICS, 2024, 25 (01)