Finding sRNA generative locales from high-throughput sequencing data with NiBLS

被引:10
|
作者
MacLean, Daniel [1 ]
Moulton, Vincent [2 ]
Studholme, David J. [1 ]
机构
[1] John Innes Ctr, Sainsbury Lab, Norwich NR4 7UH, Norfolk, England
[2] Univ E Anglia, Norwich NR4 7TJ, Norfolk, England
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
MICRORNAS; SIRNAS;
D O I
10.1186/1471-2105-11-93
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. Results: We describe and implement an algorithm that can determine small RNA generative locales from high-throughput sequencing data. The algorithm creates a network, or graph, of the small RNAs by creating links between them depending on their proximity on the target genome. For each of the sub-networks in the resulting graph the clustering coefficient, a measure of the interconnectedness of the subnetwork, is used to identify the generative locales. We test the algorithm over a wide range of parameters using RFAM sequences as positive controls and demonstrate that the algorithm has good sensitivity and specificity in a range of Arabidopsis and mouse small RNA sequence sets and that the locales it generates are robust to differences in the choice of parameters. Conclusions: NiBLS is a fast, reliable and sensitive method for determining small RNA locales in high-throughput sequence data that is generally applicable to all classes of small RNA.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Finding sRNA generative locales from high-throughput sequencing data with NiBLS
    Daniel MacLean
    Vincent Moulton
    David J Studholme
    BMC Bioinformatics, 11
  • [2] Detecting Alu insertions from high-throughput sequencing data
    David, Matei
    Mustafa, Harun
    Brudno, Michael
    NUCLEIC ACIDS RESEARCH, 2013, 41 (17)
  • [3] Tools for mapping high-throughput sequencing data
    Fonseca, Nuno A.
    Rung, Johan
    Brazma, Alvis
    Marioni, John C.
    BIOINFORMATICS, 2012, 28 (24) : 3169 - 3177
  • [4] Genome reassembly with high-throughput sequencing data
    Parrish, Nathaniel
    Sudakov, Benjamin
    Eskin, Eleazar
    BMC GENOMICS, 2013, 14
  • [5] Genome reassembly with high-throughput sequencing data
    Nathaniel Parrish
    Benjamin Sudakov
    Eleazar Eskin
    BMC Genomics, 14
  • [6] Compression of Structured High-Throughput Sequencing Data
    Campagne, Fabien
    Dorff, Kevin C.
    Chambwe, Nyasha
    Robinson, James T.
    Mesirov, Jill P.
    PLOS ONE, 2013, 8 (11):
  • [7] DisCVR: Rapid viral diagnosis from high-throughput sequencing data
    Maabar, Maha
    Davison, Andrew J.
    Vucak, Matej
    Thorburn, Fiona
    Murcia, Pablo R.
    Gunson, Rory
    Palmarini, Massimo
    Hughes, Joseph
    VIRUS EVOLUTION, 2019, 5 (02)
  • [8] Generate gene expression profile from high-throughput sequencing data
    Hui Liu
    Zhichao Jiang
    Xiangzhong Fang
    Hanjiang Fu
    Xiaofei Zheng
    Lei Cha
    Wuju Li
    Frontiers of Mathematics in China, 2011, 6 : 1131 - 1145
  • [9] Genotype-Frequency Estimation from High-Throughput Sequencing Data
    Maruki, Takahiro
    Lynch, Michael
    GENETICS, 2015, 201 (02) : 473 - +
  • [10] Generate gene expression profile from high-throughput sequencing data
    Liu, Hui
    Jiang, Zhichao
    Fang, Xiangzhong
    Fu, Hanjiang
    Zheng, Xiaofei
    Cha, Lei
    Li, Wuju
    FRONTIERS OF MATHEMATICS IN CHINA, 2011, 6 (06) : 1131 - 1145