Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures

被引:41
|
作者
Livi, Carmen M. [1 ,2 ]
Blanzieri, Enrico [1 ]
机构
[1] Univ Trent, Dept Comp Sci & Informat Engn, Trento, Italy
[2] Univ Pompeu Fabra, Barcelona, Spain
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
RNA-protein interaction; Support vector machine; SITES; IDENTIFICATION; CLIP; IDENTIFY;
D O I
10.1186/1471-2105-15-123
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA-binding proteins interact with specific RNA molecules to regulate important cellular processes. It is therefore necessary to identify the RNA interaction partners in order to understand the precise functions of such proteins. Protein-RNA interactions are typically characterized using in vivo and in vitro experiments but these may not detect all binding partners. Therefore, computational methods that capture the protein-dependent nature of such binding interactions could help to predict potential binding partners in silico. Results: We have developed three methods to predict whether an RNA can interact with a particular RNA-binding protein using support vector machines and different features based on the sequence (the Oli method), the motif score (the OliMo method) and the secondary structure (the OliMoSS method). We applied these approaches to different experimentally-derived datasets and compared the predictions with RNAcontext and RPISeq. Oli outperformed OliMoSS and RPISeq, confirming our protein-specific predictions and suggesting that tetranucleotide frequencies are appropriate discriminative features. Oli and RNAcontext were the most competitive methods in terms of the area under curve. A precision-recall curve analysis achieved higher precision values for Oli. On a second experimental dataset including real negative binding information, Oli outperformed RNAcontext with a precision of 0.73 vs. 0.59. Conclusions: Our experiments showed that features based on primary sequence information are sufficiently discriminating to predict specific RNA-protein interactions. Sequence motifs and secondary structure information were not necessary to improve these predictions. Finally we confirmed that protein-specific experimental data concerning RNA-protein interactions are valuable sources of information that can be used for the efficient training of models for in silico predictions. The scripts are available upon request to the corresponding author.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Protein-specific prediction of mRNA binding using RNA sequences, binding motifs and predicted secondary structures
    Carmen M Livi
    Enrico Blanzieri
    BMC Bioinformatics, 15
  • [2] Protein-Specific Prediction of RNA-Binding Sites Based on Information Entropy
    Ji, Yue
    Bai, Lu
    Li, Menglong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [3] Prediction of Protein-Protein Interactions from Secondary Structures in Binding Motifs Using the Statistic Method
    Yu, Jian-Tao
    Guo, Mao-Zu
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2008, : 100 - 103
  • [4] Discovering protein-binding RNA motifs with a generative model of RNA sequences
    Park, Byungkyu
    Han, Kyungsook
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 84
  • [5] Protein-specific Effects of Binding to Silica Nanoparticles
    Bharti, Bhuvnesh
    Findenegg, Gerhard H.
    CHEMISTRY LETTERS, 2012, 41 (10) : 1122 - 1124
  • [6] Predicted structures and phyletic distribution of Hfq, a RNA-binding protein
    Sun, XG
    Wartell, RM
    BIOPHYSICAL JOURNAL, 2003, 84 (02) : 459A - 459A
  • [7] Prediction of RNA-Binding residues in protein sequences using support vector machines
    Wang, Liangjiang
    Brown, Susan J.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 2382 - +
  • [8] Prediction of RNA-binding amino acids from protein and RNA sequences
    Choi, Sungwook
    Han, Kyungsook
    BMC BIOINFORMATICS, 2011, 12
  • [9] Prediction of RNA-binding amino acids from protein and RNA sequences
    Sungwook Choi
    Kyungsook Han
    BMC Bioinformatics, 12
  • [10] Evolution of a carbohydrate binding module into a protein-specific binder
    Gunnarsson, LC
    Dexlin, L
    Karlsson, EN
    Holst, O
    Ohlin, M
    BIOMOLECULAR ENGINEERING, 2006, 23 (2-3): : 111 - 117