Feature-Based and String-Based Models for Predicting RNA-Protein Interaction

被引:16
作者
Adjeroh, Donald [1 ]
Allaga, Maen [1 ]
Tan, Jun [1 ]
Lin, Jie [2 ]
Jiang, Yue [2 ]
Abbasi, Ahmed [3 ]
Zhou, Xiaobo [4 ,5 ]
机构
[1] West Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26508 USA
[2] Fujian Normal Univ, Fac Software, Fuzhou 350108, Fujian, Peoples R China
[3] Univ Virginia, McIntire Sch Commerce, Charlottesville, VA 22904 USA
[4] Univ Texas Hlth Sci Ctr Houston UTHlth, McGovern Med Sch, Houston, TX 77030 USA
[5] Univ Texas Hlth Sci Ctr Houston UTHlth, Sch Biomed Informat, Houston, TX 77030 USA
来源
MOLECULES | 2018年 / 23卷 / 03期
基金
美国国家科学基金会;
关键词
RNA Protein Interaction; RPI; k-mers; suffix trees; richness; protein structure; RNA structure;
D O I
10.3390/molecules23030697
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this work, we study two approaches for the problem of RNA-Protein Interaction (RPI). In the first approach, we use a feature-based technique by combining extracted features from both sequences and secondary structures. The feature-based approach enhanced the prediction accuracy as it included much more available information about the RNA-protein pairs. In the second approach, we apply search algorithms and data structures to extract effective string patterns for prediction of RPI, using both sequence information (protein and RNA sequences), and structure information (protein and RNA secondary structures). This led to different string-based models for predicting interacting RNA-protein pairs. We show results that demonstrate the effectiveness of the proposed approaches, including comparative results against leading state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 36 条
  • [1] Adjeroh D., 2008, The Burrows-Wheeler Transform: Data Compression, Suffix Arrays, and Pattern Matching
  • [2] Suffix-Sorting via Shannon-Fano-Elias Codes
    Adjeroh, Donald
    Nan, Fei
    [J]. ALGORITHMS, 2010, 3 (02) : 145 - 167
  • [3] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [4] [Anonymous], NUCL ACIDS RES
  • [5] [Anonymous], DATA MINING PRACTICA
  • [6] [Anonymous], BIOPHYS J
  • [7] [Anonymous], 2014, SUPPORT VECTOR MACHI, DOI DOI 10.1007/978-3-319-02300-7
  • [8] [Anonymous], 1997, ACM SIGACT NEWS
  • [9] Efficient pattern matching for RNA secondary structures
    Beal, Richard
    Adjeroh, Donald
    [J]. THEORETICAL COMPUTER SCIENCE, 2015, 592 : 59 - 71
  • [10] Breiman L., 2001, Machine Learning, V45, P5