Feature-Based and String-Based Models for Predicting RNA-Protein Interaction

被引:16
作者
Adjeroh, Donald [1 ]
Allaga, Maen [1 ]
Tan, Jun [1 ]
Lin, Jie [2 ]
Jiang, Yue [2 ]
Abbasi, Ahmed [3 ]
Zhou, Xiaobo [4 ,5 ]
机构
[1] West Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26508 USA
[2] Fujian Normal Univ, Fac Software, Fuzhou 350108, Fujian, Peoples R China
[3] Univ Virginia, McIntire Sch Commerce, Charlottesville, VA 22904 USA
[4] Univ Texas Hlth Sci Ctr Houston UTHlth, McGovern Med Sch, Houston, TX 77030 USA
[5] Univ Texas Hlth Sci Ctr Houston UTHlth, Sch Biomed Informat, Houston, TX 77030 USA
基金
美国国家科学基金会;
关键词
RNA Protein Interaction; RPI; k-mers; suffix trees; richness; protein structure; RNA structure;
D O I
10.3390/molecules23030697
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this work, we study two approaches for the problem of RNA-Protein Interaction (RPI). In the first approach, we use a feature-based technique by combining extracted features from both sequences and secondary structures. The feature-based approach enhanced the prediction accuracy as it included much more available information about the RNA-protein pairs. In the second approach, we apply search algorithms and data structures to extract effective string patterns for prediction of RPI, using both sequence information (protein and RNA sequences), and structure information (protein and RNA secondary structures). This led to different string-based models for predicting interacting RNA-protein pairs. We show results that demonstrate the effectiveness of the proposed approaches, including comparative results against leading state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 36 条
[21]   Protein structural similarity search by Ramachandran codes [J].
Lo, Wei-Cheng ;
Huang, Po-Jung ;
Chang, Chih-Hung ;
Lyu, Ping-Chiang .
BMC BIOINFORMATICS, 2007, 8 (1)
[22]   Computational prediction of associations between long non-coding RNAs and proteins [J].
Lu, Qiongshi ;
Ren, Sijin ;
Lu, Ming ;
Zhang, Yong ;
Zhu, Dahai ;
Zhang, Xuegong ;
Li, Tingting .
BMC GENOMICS, 2013, 14
[23]   Applications of Deep Learning in Biomedicine [J].
Mamoshina, Polina ;
Vieira, Armando ;
Putin, Evgeny ;
Zhavoronkov, Alex .
MOLECULAR PHARMACEUTICS, 2016, 13 (05) :1445-1454
[24]   A novel approach to represent and compare RNA secondary structures [J].
Mattei, Eugenio ;
Ausiello, Gabriele ;
Ferre, Fabrizio ;
Helmer-Citterich, Manuela .
NUCLEIC ACIDS RESEARCH, 2014, 42 (10) :6146-6157
[25]   Predicting RNA-Protein Interactions Using Only Sequence Information [J].
Muppirala, Usha K. ;
Honavar, Vasant G. ;
Dobbs, Drena .
BMC BIOINFORMATICS, 2011, 12
[26]  
Nepusz T, 2012, NAT METHODS, V9, P471, DOI [10.1038/NMETH.1938, 10.1038/nmeth.1938]
[27]   RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach [J].
Pan, Xiaoyong ;
Shen, Hong-Bin .
BMC BIOINFORMATICS, 2017, 18
[28]   IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction [J].
Pan, Xiaoyong ;
Fan, Yong-Xian ;
Yan, Junchi ;
Shen, Hong-Bin .
BMC GENOMICS, 2016, 17
[29]   Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes [J].
Rabani, Michal ;
Kertesz, Michael ;
Segal, Eran .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (39) :14885-14890
[30]   ProtorP: a protein-protein interaction analysis server [J].
Reynolds, Christopher ;
Damerell, David ;
Jones, Susan .
BIOINFORMATICS, 2009, 25 (03) :413-416