BlastR-fast and accurate database searches for non-coding RNAs

被引:23
|
作者
Bussotti, Giovanni [1 ,2 ]
Raineri, Emanuele [1 ,2 ,3 ]
Erb, Ionas [1 ,2 ]
Zytnicki, Matthias [1 ,2 ,4 ]
Wilm, Andreas [5 ]
Beaudoing, Emmanuel [1 ,2 ,6 ]
Bucher, Philipp [7 ,8 ]
Notredame, Cedric [1 ,2 ]
机构
[1] CRG, Bioinformat & Genom Program, Barcelona 08003, Spain
[2] UPF, Barcelona 08003, Spain
[3] CNAG Ctr Nacl Anal Genom, E-08028 Barcelona, Spain
[4] URGI INRA Versailles, Dept Plant Breeding & Genet, F-78026 Versailles, France
[5] Univ Coll Dublin, Conway Inst Biomol & Biomed Sci, Dublin 4, Ireland
[6] Univ Lausanne, Ctr Integrat Genom, Genom Technol Facil, CH-1015 Lausanne, Switzerland
[7] Ecole Polytech Fed Lausanne, ISREC, Sch Life Sci, CH-1015 Lausanne, Switzerland
[8] SIB, CH-1015 Lausanne, Switzerland
基金
新加坡国家研究基金会;
关键词
SECONDARY STRUCTURE; SEQUENCE ALIGNMENT; NUCLEOTIDE; IDENTIFICATION; SUBSTITUTION; ALGORITHM; EVOLUTION; HOMOLOGS; ELEMENTS; MODELS;
D O I
10.1093/nar/gkr335
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present and validate BlastR, a method for efficiently and accurately searching non-coding RNAs. Our approach relies on the comparison of di-nucleotides using BlosumR, a new log-odd substitution matrix. In order to use BlosumR for comparison, we recoded RNA sequences into protein-like sequences. We then showed that BlosumR can be used along with the BlastP algorithm in order to search non-coding RNA sequences. Using Rfam as a gold standard, we benchmarked this approach and show BlastR to be more sensitive than BlastN. We also show that BlastR is both faster and more sensitive than BlastP used with a single nucleotide log-odd substitution matrix. BlastR, when used in combination with WU-BlastP, is about 5% more accurate than WU-BlastN and about 50 times slower. The approach shown here is equally effective when combined with the NCBI-Blast package. The software is an open source freeware available from www.tcoffee.org/blastr.html.
引用
收藏
页码:6886 / 6895
页数:10
相关论文
共 50 条
  • [21] Systemic lupus erythematosus: From non-coding RNAs to exosomal non-coding RNAs
    Nezhad, Marzie Tabatabaie Nezhad
    Rajabi, Mohammadreza
    Nekooeizadeh, Pegah
    Sanjari, Siavash
    Pourvirdi, Bita
    Heidari, Mohammad Mehdi
    Esfahani, Pegah Veradi
    Abdoli, Amirhossein
    Bagheri, Sahar
    Tobeiha, Mohammad
    PATHOLOGY RESEARCH AND PRACTICE, 2023, 247
  • [22] Epigenetic regulation in myocardial infarction: Non-coding RNAs and exosomal non-coding RNAs
    Fadaei, Sara
    Zarepour, Fatemeh
    Parvaresh, Mehrnoosh
    Motamedzadeh, Alireza
    Zadeh, Seyed Saeed Tamehri
    Sheida, Amirhossein
    Shabani, Mohammad
    Hamblin, Michael R.
    Rezaee, Mehdi
    Zarei, Maryam
    Mirzaei, Hamed
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [23] Role of non-coding RNAs and exosomal non-coding RNAs in vasculitis: A narrative review
    Tang, Xiuming
    Guo, Jiajuan
    Qi, Feng
    Rezaei, Mohammad J.
    International Journal of Biological Macromolecules, 2024, 261
  • [24] Role of non-coding RNAs and exosomal non-coding RNAs in vasculitis: A narrative review
    Tang, Xiuming
    Guo, Jiajuan
    Qi, Feng
    Rezaei, Mohammad J.
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 261
  • [25] An atlas of human long non-coding RNAs with accurate 5′ ends
    Hon, Chung-Chau
    Ramilowski, Jordan A.
    Harshbarger, Jayson
    Bertin, Nicolas
    Rackham, Owen J. L.
    Gough, Julian
    Denisenko, Elena
    Schmeier, Sebastian
    Poulsen, Thomas M.
    Severin, Jessica
    Lizio, Marina
    Kawaji, Hideya
    Kasukawa, Takeya
    Itoh, Masayoshi
    Burroughs, A. Maxwell
    Noma, Shohei
    Djebali, Sarah
    Alam, Tanvir
    Medvedeva, Yulia A.
    Testa, Alison C.
    Lipovich, Leonard
    Yip, Chi-Wai
    Abugessaisa, Imad
    Mendez, Mickael
    Hasegawa, Akira
    Tang, Dave
    Lassmann, Timo
    Heutink, Peter
    Babina, Magda
    Wells, Christine A.
    Kojima, Soichi
    Nakamura, Yukio
    Suzuki, Harukazu
    Daub, Carsten O.
    de Hoon, Michiel J. L.
    Arner, Erik
    Hayashizaki, Yoshihide
    Carninci, Piero
    Forrest, Alistair R. R.
    NATURE, 2017, 543 (7644) : 199 - +
  • [26] An atlas of human long non-coding RNAs with accurate 5′ ends
    Chung-Chau Hon
    Jordan A. Ramilowski
    Jayson Harshbarger
    Nicolas Bertin
    Owen J. L. Rackham
    Julian Gough
    Elena Denisenko
    Sebastian Schmeier
    Thomas M. Poulsen
    Jessica Severin
    Marina Lizio
    Hideya Kawaji
    Takeya Kasukawa
    Masayoshi Itoh
    A. Maxwell Burroughs
    Shohei Noma
    Sarah Djebali
    Tanvir Alam
    Yulia A. Medvedeva
    Alison C. Testa
    Leonard Lipovich
    Chi-Wai Yip
    Imad Abugessaisa
    Mickaël Mendez
    Akira Hasegawa
    Dave Tang
    Timo Lassmann
    Peter Heutink
    Magda Babina
    Christine A. Wells
    Soichi Kojima
    Yukio Nakamura
    Harukazu Suzuki
    Carsten O. Daub
    Michiel J. L. de Hoon
    Erik Arner
    Yoshihide Hayashizaki
    Piero Carninci
    Alistair R. R. Forrest
    Nature, 2017, 543 : 199 - 204
  • [27] Accurate computational prediction of the transcribed strand of CRISPR non-coding RNAs
    Biswas, Ambarish
    Fineran, Peter C.
    Brown, Chris M.
    BIOINFORMATICS, 2014, 30 (13) : 1805 - 1813
  • [28] GreeNC 2.0: a comprehensive database of plant long non-coding RNAs
    Di Marsico, Marco
    Paytuvi Gallart, Andreu
    Sanseverino, Walter
    Aiese Cigliano, Riccardo
    NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) : D1442 - D1447
  • [29] BmncRNAdb: a comprehensive database of non-coding RNAs in the silkworm, Bombyx mori
    Zhou, Qiu-Zhong
    Zhang, Bindan
    Yu, Quan-You
    Zhang, Ze
    BMC BIOINFORMATICS, 2016, 17
  • [30] BmncRNAdb: a comprehensive database of non-coding RNAs in the silkworm, Bombyx mori
    Qiu-Zhong Zhou
    Bindan Zhang
    Quan-You Yu
    Ze Zhang
    BMC Bioinformatics, 17