long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data

被引:25
作者
Amarasinghe, Shanika L. [1 ,2 ]
Ritchie, Matthew E. [1 ,2 ,3 ]
Gouil, Quentin [1 ,2 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Epigenet & Dev Div, 1G Royal Parade, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Med Biol, 1G Royal Parade, Parkville, Vic 3052, Australia
[3] Univ Melbourne, Sch Math & Stat, 813 Swanston St, Parkville, Vic 3010, Australia
来源
GIGASCIENCE | 2021年 / 10卷 / 02期
基金
澳大利亚国家健康与医学研究理事会; 英国医学研究理事会;
关键词
database; long-read sequencing; data analysis; nanopore; PacBio; ALIGNMENT; RNA;
D O I
10.1093/gigascience/giab003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The data produced by long-read third-generation sequencers have unique characteristics compared to short-read sequencing data, often requiring tailored analysis tools for tasks ranging from quality control to downstream processing. The rapid growth in software that addresses these challenges for different genomics applications is difficult to keep track of, which makes it hard for users to choose the most appropriate tool for their analysis goal and for developers to identify areas of need and existing solutions to benchmark against. Findings: We describe the implementation of long-read-tools.org, an open-source database that organizes the rapidly expanding collection of long-read data analysis tools and allows its exploration through interactive browsing and filtering. The current database release contains 478 tools across 32 categories. Most tools are developed in Python, and the most frequent analysis tasks include base calling, de novo assembly, error correction, quality checking/filtering, and isoform detection, while long-read single-cell data analysis and transcriptomics are areas with the fewest tools available. Conclusion: Continued growth in the application of long-read sequencing in genomics research positions the long-read-tools.org database as an essential resource that allows researchers to keep abreast of both established and emerging software to help guide the selection of the most relevant tool for their analysis needs.
引用
收藏
页数:7
相关论文
共 34 条
  • [1] Amarasinghe SL, 2021, **DATA OBJECT**, DOI [10.5524/100853, DOI 10.5524/100853]
  • [2] Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics
    Ardui, Simon
    Ameur, Adam
    Vermeesch, Joris R.
    Hestand, Matthew S.
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (05) : 2159 - 2168
  • [3] Improving PacBio Long Read Accuracy by Short Read Alignment
    Au, Kin Fai
    Underwood, Jason G.
    Lee, Lawrence
    Wong, Wing Hung
    [J]. PLOS ONE, 2012, 7 (10):
  • [4] SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
    Bankevich, Anton
    Nurk, Sergey
    Antipov, Dmitry
    Gurevich, Alexey A.
    Dvorkin, Mikhail
    Kulikov, Alexander S.
    Lesin, Valery M.
    Nikolenko, Sergey I.
    Son Pham
    Prjibelski, Andrey D.
    Pyshkin, Alexey V.
    Sirotkin, Alexander V.
    Vyahhi, Nikolay
    Tesler, Glenn
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) : 455 - 477
  • [5] Assembling large genomes with single-molecule sequencing and locality-sensitive hashing
    Berlin, Konstantin
    Koren, Sergey
    Chin, Chen-Shan
    Drake, James P.
    Landolin, Jane M.
    Phillippy, Adam M.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (06) : 623 - +
  • [6] Chamberlain S., 2020, CLIENT OPEN CITATION
  • [7] Davis S, SEANDAVI AWESOME SIN
  • [8] STAR: ultrafast universal RNA-seq aligner
    Dobin, Alexander
    Davis, Carrie A.
    Schlesinger, Felix
    Drenkow, Jorg
    Zaleski, Chris
    Jha, Sonali
    Batut, Philippe
    Chaisson, Mark
    Gingeras, Thomas R.
    [J]. BIOINFORMATICS, 2013, 29 (01) : 15 - 21
  • [9] Elliott TA, REPEAT RESOURCES GOO
  • [10] Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations
    Heibi, Ivan
    Peroni, Silvio
    Shotton, David
    [J]. SCIENTOMETRICS, 2019, 121 (02) : 1213 - 1228