Sequence-based heuristics for faster annotation of non-coding RNA families

被引:60
|
作者
Weinberg, Z [1 ]
Ruzzo, WL
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/bti743
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. Results: In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that-unlike family-specific solutions-can scale to hundreds of ncRNA families.
引用
收藏
页码:35 / 39
页数:5
相关论文
共 50 条
  • [31] Learning Parameters for Non-coding RNA Sequence-Structure Alignment
    Song, Yinglei
    Liu, Chunmei
    Qu, Junfeng
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 72 - +
  • [33] FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome
    Wucher, Valentin
    Legeai, Fabrice
    Hedan, Benoit
    Rizk, Guillaume
    Lagoutte, Laetitia
    Leeb, Tosso
    Jagannathan, Vidhya
    Cadieu, Edouard
    David, Audrey
    Lohi, Hannes
    Cirera, Susanna
    Fredholm, Merete
    Botherel, Nadine
    Leegwater, Peter A. J.
    Le Beguec, Celine
    Fieten, Hille
    Johnson, Jeremy
    Alfoldi, Jessica
    Andre, Catherine
    Lindblad-Toh, Kerstin
    Hitte, Christophe
    Derrien, Thomas
    NUCLEIC ACIDS RESEARCH, 2017, 45 (08)
  • [34] Development and characterization of non-coding RNA based simple sequence repeat markers in Capsicum species
    Jaiswal, Vandana
    Rawoof, Abdul
    Dubey, Meenakshi
    Chhapekar, Sushil Satish
    Sharma, Vineet
    Ramchiary, Nirala
    GENOMICS, 2020, 112 (02) : 1554 - 1564
  • [35] The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology
    Huang, Jingshan
    Eilbeck, Karen
    Smith, Barry
    Blake, Judith A.
    Dou, Dejing
    Huang, Weili
    Natale, Darren A.
    Ruttenberg, Alan
    Huan, Jun
    Zimmermann, Michael T.
    Jiang, Guoqian
    Lin, Yu
    Wu, Bin
    Strachan, Harrison J.
    He, Yongqun
    Zhang, Shaojie
    Wang, Xiaowei
    Liu, Zixing
    Borchert, Glen M.
    Tan, Ming
    JOURNAL OF BIOMEDICAL SEMANTICS, 2016, 7
  • [36] The BRAF activated non-coding RNA: A pivotal long non-coding RNA in human malignancies
    Liu, Xiu-Fen
    Hao, Ji-Long
    Xie, Tian
    Pant, Om Prakash
    Lu, Cheng-Bo
    Lu, Cheng-Wei
    Zhou, Dan-Dan
    CELL PROLIFERATION, 2018, 51 (04)
  • [37] The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology
    Jingshan Huang
    Karen Eilbeck
    Barry Smith
    Judith A. Blake
    Dejing Dou
    Weili Huang
    Darren A. Natale
    Alan Ruttenberg
    Jun Huan
    Michael T. Zimmermann
    Guoqian Jiang
    Yu Lin
    Bin Wu
    Harrison J. Strachan
    Yongqun He
    Shaojie Zhang
    Xiaowei Wang
    Zixing Liu
    Glen M. Borchert
    Ming Tan
    Journal of Biomedical Semantics, 7
  • [38] Non-coding RNA-based regulation of inflammation
    Ashrafizadeh, Milad
    Zarrabi, Ali
    Mostafavi, Ebrahim
    Aref, Amir Reza
    Sethi, Gautam
    Wang, Lingzhi
    Tergaonkar, Vinay
    SEMINARS IN IMMUNOLOGY, 2022, 59
  • [39] mirTools 2.0 for non-coding RNA discovery, profiling and functional annotation based on high-throughput sequencing
    Wu, Jinyu
    Liu, Qi
    Wang, Xin
    Zheng, Jiayong
    Wang, Tao
    You, Mingcong
    Sun, Zhong Sheng
    Shi, Qinghua
    RNA BIOLOGY, 2013, 10 (07) : 1087 - 1092
  • [40] Coding non-coding human telomerase RNA
    Naraykina, Y.
    Rubtsova, M.
    Vasilkova, D.
    Meerson, M.
    Zvereva, M.
    Lazarev, V.
    Manuvera, V.
    Kovalchuk, S.
    Anikanov, N.
    Butenko, I.
    Pobeguts, O.
    Govorun, V.
    Dontsova, O.
    FEBS JOURNAL, 2017, 284 : 13 - 13