Scywalker: scalable end-to-end data analysis workflow for long-read single-cell transcriptome sequencing

被引:0
|
作者
De Rijk, Peter [1 ,2 ]
Watzeels, Tijs [2 ,3 ]
Kuecuekali, Fahri [2 ,3 ]
Van Dongen, Jasper [2 ,3 ]
Faura, Julia [2 ,4 ]
Willems, Patrick [5 ,6 ,7 ,8 ]
De Deyn, Lara [2 ,3 ]
Duchateau, Lena [2 ,3 ]
Grones, Carolin [5 ,6 ]
Eekhout, Thomas [5 ,6 ,9 ]
De Pooter, Tim [1 ,2 ]
Joris, Geert [1 ,2 ]
Rombauts, Stephane [5 ,6 ]
De Rybel, Bert [5 ,6 ]
Rademakers, Rosa [2 ,4 ]
Van Breusegem, Frank [5 ,6 ]
Strazisar, Mojca [1 ,2 ]
Sleegers, Kristel [2 ,3 ]
De Coster, Wouter [2 ,4 ]
机构
[1] VIB, VIB Ctr Mol Neurol, Neur Support Facil, Univ pl 1, B-2610 Antwerp, Belgium
[2] Univ Antwerp, Dept Biomed Sci, Univ pl 1, B-2610 Antwerp, Belgium
[3] VIB Ctr Mol Neurol, Complex Genet Alzheimers Dis Grp, Univ pl 1, B-2610 Antwerp, Belgium
[4] VIB Ctr Mol Neurol, Appl & Translat Neurogenom Grp, Univ pl 1, B-2610 Antwerp, Belgium
[5] Univ Ghent, Dept Plant Biotechnol & Bioinformat, Technol pk 71, B-9052 Zwijnaarde, Belgium
[6] VIB, VIB Ctr Plant Syst Biol, Technol pk 71, B-9052 Zwijnaarde, Belgium
[7] Univ Ghent, Dept Biomol Med, Corneel Heymanslaan 10, B-9000 Ghent, Belgium
[8] VIB Ctr Med Biotechnol VIB, Technol pk Zwijnaarde 75, B-9052 Ghent, Belgium
[9] VIB Single Cell Core VIB, Technol pk Zwijnaarde 71, B-9052 Ghent, Belgium
关键词
EXPRESSION; REVEALS;
D O I
10.1093/bioinformatics/btae549
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Existing nanopore single-cell data analysis tools showed severe limitations in handling current data sizes.Results We introduce scywalker, an innovative and scalable package developed to comprehensively analyze long-read sequencing data of full-length single-cell or single-nuclei cDNA. We developed novel scalable methods for cell barcode demultiplexing and single-cell isoform calling and quantification and incorporated these in an easily deployable package. Scywalker streamlines the entire analysis process, from sequenced fragments in FASTQ format to demultiplexed pseudobulk isoform counts, into a single command suitable for execution on either server or cluster. Scywalker includes data quality control, cell type identification, and an interactive report. Assessment of datasets from the human brain, Arabidopsis leaves, and previously benchmarked data from mixed cell lines demonstrate excellent correlation with short-read analyses at both the cell-barcoding and gene quantification levels. At the isoform level, we show that scywalker facilitates the direct identification of cell-type-specific expression of novel isoforms.Availability and implementation Scywalker is available on github.com/derijkp/scywalker under the GNU General Public License (GPL) and at https://zenodo.org/records/13359438/files/scywalker-0.108.0-Linux-x86_64.tar.gz.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] IsoTools: a flexible workflow for long-read transcriptome sequencing analysis
    Lienhard, Matthias
    van den Beucken, Twan
    Timmermann, Bernd
    Hochradel, Myriam
    Boerno, Stefan
    Caiment, Florian
    Vingron, Martin
    Herwig, Ralf
    BIOINFORMATICS, 2023, 39 (06)
  • [2] Advances in single-cell long-read sequencing technologies
    Gupta, Pallavi
    ONeill, Hannah
    Wolvetang, Ernst J.
    Chatterjee, Aniruddha
    Gupta, Ishaan
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (02)
  • [3] Long-read single-cell sequencing of liver cancer
    Luo, Jian-Hua
    Liu, Silvia
    Ren, Bao-Guo
    Yu, Yan-Ping
    CANCER RESEARCH, 2023, 83 (08)
  • [4] Advanced sequencing-based high-throughput and long-read single-cell transcriptome analysis
    Huang, Shanqing
    Shi, Weixiong
    Li, Shiyu
    Fan, Qian
    Yang, Chaoyong
    Cao, Jiao
    Wu, Lingling
    LAB ON A CHIP, 2024, 24 (10) : 2601 - 2621
  • [5] Single-cell transcriptomics in the context of long-read nanopore sequencing
    Hayrabedyan, Soren
    Kostova, Petya
    Zlatkov, Viktor
    Todorova, Krassimira
    BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2021, 35 (01) : 1439 - 1451
  • [6] Graph embedding and Gaussian mixture variational autoencoder network for end-to-end analysis of single-cell RNA sequencing data
    Xu, Junlin
    Xu, Jielin
    Meng, Yajie
    Lu, Changcheng
    Cai, Lijun
    Zeng, Xiangxiang
    Nussinov, Ruth
    Cheng, Feixiong
    CELL REPORTS METHODS, 2023, 3 (01):
  • [7] An end-to-end software solution for the analysis of high-throughput single-cell migration data
    Masuzzo, Paola
    Huyck, Lynn
    Simiczyjew, Aleksandra
    Ampe, Christophe
    Martens, Lennart
    Van Troys, Marleen
    SCIENTIFIC REPORTS, 2017, 7
  • [8] An end-to-end software solution for the analysis of high-throughput single-cell migration data
    Paola Masuzzo
    Lynn Huyck
    Aleksandra Simiczyjew
    Christophe Ampe
    Lennart Martens
    Marleen Van Troys
    Scientific Reports, 7
  • [9] Advances in long-read single-cell transcriptomics
    Kumari, Pallawi
    Kaur, Manmeet
    Dindhoria, Kiran
    Ashford, Bruce
    Amarasinghe, Shanika L.
    Thind, Amarinder Singh
    HUMAN GENETICS, 2024, 143 (9-10) : 1005 - 1020
  • [10] Single-cell and spatial transcriptomics: Bridging current technologies with long-read sequencing
    Yuan, Chengwei Ulrika
    Quah, Fu Xiang
    Hemberg, Martin
    MOLECULAR ASPECTS OF MEDICINE, 2024, 96