PRIMITI: A computational approach for accurate prediction of miRNA-target mRNA interaction

被引:2
作者
Uthayopas, Korawich [1 ,2 ]
de Sa, Alex G. C. [1 ,2 ,3 ]
Alavi, Azadeh [4 ]
Pires, Douglas E. V. [1 ,2 ,5 ]
Ascher, David B. [1 ,3 ]
机构
[1] Univ Queensland, Australian Ctr Ecogenom, Sch Chem & Mol Biosci, Brisbane, Qld 4072, Australia
[2] Baker Heart & Diabet Inst, Computat Biol & Clin Informat, Melbourne, Vic 3004, Australia
[3] Univ Melbourne, Baker Dept Cardiometab Hlth, Parkville, Vic 3010, Australia
[4] RMIT Univ, Sch Computat Technol, Melbourne, Vic 3000, Australia
[5] Univ Melbourne, Sch Comp & Informat Syst, Parkville, Vic 3052, Australia
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 23卷
基金
英国医学研究理事会;
关键词
MicroRNAs; MiRNA-target interaction prediction; MiRNA-mediated repression; Machine learning; EXtreme Gradient Boosting (XGBoost); MICRORNA BINDING-SITES; IDENTIFICATION;
D O I
10.1016/j.csbj.2024.06.030
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Current medical research has been demonstrating the roles of miRNAs in a variety of cellular mechanisms, lending credence to the association between miRNA dysregulation and multiple diseases. Understanding the mechanisms of miRNA is critical for developing effective diagnostic and therapeutic strategies. miRNA-mRNA interactions emerge as the most important mechanism to be understood despite their experimental validation constraints. Accordingly, several computational models have been developed to predict miRNA-mRNA interactions, albeit presenting limited predictive capabilities, poor characterisation of miRNA-mRNA interactions, and low usability. To address these drawbacks, we developed PRIMITI, a PRedictive model for the Identification of novel miRNA-Target mRNA Interactions. PRIMITI is a novel machine learning model that utilises CLIP-seq and expression data to characterise functional target sites in 3'-untranslated regions (3'-UTRs) and predict miRNAtarget mRNA repression activity. The model was trained using a reliable negative sample selection approach and the robust extreme gradient boosting (XGBoost) model, which was coupled with newly introduced features, including sequence and genetic variation information. PRIMITI achieved an area under the receiver operating characteristic (ROC) curve (AUC) up to 0.96 for a prediction of functional miRNA-target site binding and 0.96 for a prediction of miRNA-target mRNA repression activity on cross-validation and an independent blind test. Additionally, the model outperformed state-of-the-art methods in recovering miRNA-target repressions in an unseen microarray dataset and in a collection of validated miRNA-mRNA interactions, highlighting its utility for preliminary screening. PRIMITI is available on a reliable, scalable, and user-friendly web server at https://biosig. lab.uq.edu.au/primiti.
引用
收藏
页码:3030 / 3039
页数:10
相关论文
共 54 条
[1]   Predicting effective microRNA target sites in mammalian mRNAs [J].
Agarwal, Vikram ;
Bell, George W. ;
Nam, Jin-Wu ;
Bartel, David P. .
ELIFE, 2015, 4
[2]   Informative RNA base embedding for RNA structural alignment and clustering by deep representation learning [J].
Akiyama, Manato ;
Sakakibara, Yasubumi .
NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (01)
[3]   An estimate of the total number of true human miRNAs [J].
Alles, Julia ;
Fehlmann, Tobias ;
Fischer, Ulrike ;
Backes, Christina ;
Galata, Valentina ;
Minet, Marie ;
Hart, Martin ;
Abu-Halima, Masood ;
Graesser, Friedrich A. ;
Lenhof, Hans-Peter ;
Keller, Andreas ;
Meese, Eckart .
NUCLEIC ACIDS RESEARCH, 2019, 47 (07) :3353-3364
[4]  
[Anonymous], 1994, 11 INT C MACHINE LEA, DOI 10.1016/B978-1-55860-335-6.50023-4
[5]   MP-VHPPI: Meta predictor for viral host protein-protein interaction prediction in multiple hosts and viruses [J].
Asim, Muhammad Nabeel ;
Fazeel, Ahtisham ;
Ibrahim, Muhammad Ali ;
Dengel, Andreas ;
Ahmed, Sheraz .
FRONTIERS IN MEDICINE, 2022, 9
[6]   Transcriptome-wide Discovery of microRNA Binding Sites in Human Brain [J].
Boudreau, Ryan L. ;
Jiang, Peng ;
Gilmore, Brian L. ;
Spengler, Ryan M. ;
Tirabassi, Rebecca ;
Nelson, Jay A. ;
Ross, Christopher A. ;
Xing, Yi ;
Davidson, Beverly L. .
NEURON, 2014, 81 (02) :294-305
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   miRDB: an online database for prediction of functional microRNA targets [J].
Chen, Yuhao ;
Wang, Xiaowei .
NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) :D127-D131
[9]   iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Marquez-Lago, Tatiana T. ;
Leier, Andre ;
Revote, Jerico ;
Zhu, Yan ;
Powell, David R. ;
Akutsu, Tatsuya ;
Webb, Geoffrey, I ;
Chou, Kuo-Chen ;
Smith, A. Ian ;
Daly, Roger J. ;
Li, Jian ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) :1047-1057
[10]   iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Leier, Andre ;
Marquez-Lago, Tatiana T. ;
Wang, Yanan ;
Webb, Geoffrey I. ;
Smith, A. Ian ;
Daly, Roger J. ;
Chou, Kuo-Chen ;
Song, Jiangning .
BIOINFORMATICS, 2018, 34 (14) :2499-2502