Isoform discovery in long-read sequencing: Tuning computational pipeline

被引:0
作者
Iglesias, Natalia [1 ]
Labari, Ignacio Garcia [2 ]
Spetale, Flavio [3 ]
Ponce, Sergio [4 ]
Tapia, Elizabeth [3 ]
Bulacio, Pilar [1 ]
机构
[1] CIFASIS UNR UTN FRSN, Rosario, Argentina
[2] CIFASIS CONICET UNR, Rosario, Argentina
[3] CIFASIS UNR, Rosario, Argentina
[4] UTN FRSN, San Nicolas, Argentina
来源
2024 IEEE BIENNIAL CONGRESS OF ARGENTINA, ARGENCON 2024 | 2024年
关键词
mRNA isoform; alternative splicing; long-read sequencing;
D O I
10.1109/ARGENCON62399.2024.10735861
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Isoform determination is crucial for understanding the functional diversity of proteins. However, obtaining high-quality sequencing data and optimizing bioinformatics tools for upstream analysis can be challenging. In this study, we present the optimization of a selected software pipeline using the Spike-In RNA Variant (SIRV) standard kit, which contains a diverse set of synthetic isoforms that mimic transcriptome complexity. SIRV data serves as a gold standard, enabling parameter tuning of software tools based on these synthetic reads. We applied the optimized pipeline to long-read sequencing data generated from the same SIRV molecular biology kit to enhance isoform detection capabilities. Our results highlight the importance of parameter optimization and demonstrate the advantages of long-read sequencing in resolving complex isoform structures. This approach offers improved accuracy in isoform identification, contributing to a more comprehensive understanding of protein diversity and function.
引用
收藏
页数:5
相关论文
共 10 条
  • [1] Context-aware transcript quantification from long-read RNA-seq data with Bambu
    Chen, Ying
    Sim, Andre
    Wan, Yuk Kei
    Yeo, Keith
    Lee, Joseph Jing Xian
    Ling, Min Hao
    Love, Michael I.
    Goke, Jonathan
    [J]. NATURE METHODS, 2023, 20 (08) : 1187 - +
  • [2] STAR: ultrafast universal RNA-seq aligner
    Dobin, Alexander
    Davis, Carrie A.
    Schlesinger, Felix
    Drenkow, Jorg
    Zaleski, Chris
    Jha, Sonali
    Batut, Philippe
    Chaisson, Mark
    Gingeras, Thomas R.
    [J]. BIOINFORMATICS, 2013, 29 (01) : 15 - 21
  • [3] Transcriptome assembly from long-read RNA-seq alignments with StringTie2
    Kovaka, Sam
    Zimin, Aleksey, V
    Pertea, Geo M.
    Razaghi, Roham
    Salzberg, Steven L.
    Pertea, Mihaela
    [J]. GENOME BIOLOGY, 2019, 20 (01)
  • [4] Minimap2: pairwise alignment for nucleotide sequences
    Li, Heng
    [J]. BIOINFORMATICS, 2018, 34 (18) : 3094 - 3100
  • [5] SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms
    Pardo-Palacios, Francisco J.
    Arzalluz-Luque, Angeles
    Kondratova, Liudmyla
    Salguero, Pedro
    Mestre-Tomas, Jorge
    Amorin, Rocio
    Estevan-Morio, Eva
    Liu, Tianyuan
    Nanni, Adalena
    Mcintyre, Lauren
    Tseng, Elizabeth
    Conesa, Ana
    [J]. NATURE METHODS, 2024, 21 (05) : 793 - 797
  • [6] Paul LM., 2016, bioRxiv
  • [7] High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors
    Shiau, Cheng-Kai
    Lu, Lina
    Kieser, Rachel
    Fukumura, Kazutaka
    Pan, Timothy
    Lin, Hsiao-Yun
    Yang, Jie
    Tong, Eric L.
    Lee, GaHyun
    Yan, Yuanqing
    Huse, Jason T.
    Gao, Ruli
    [J]. NATURE COMMUNICATIONS, 2023, 14 (01)
  • [8] High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes
    Singh, Mandeep
    Al-Eryani, Ghamdan
    Carswell, Shaun
    Ferguson, James M.
    Blackburn, James
    Barton, Kirston
    Roden, Daniel
    Luciani, Fabio
    Tri Giang Phan
    Junankar, Simon
    Jackson, Katherine
    Goodnow, Christopher C.
    Smith, Martin A.
    Swarbrick, Alexander
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [9] SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification
    Tardaguila, Manuel
    de la Fuente, Lorena
    Marti, Cristina
    Pereira, Cecile
    Jose Pardo-Palacios, Francisco
    del Risco, Hector
    Ferrell, Marc
    Mellado, Maravillas
    Macchietto, Marissa
    Verheggen, Kenneth
    Edelmann, Mariola
    Ezkurdia, Iakes
    Vazquez, Jesus
    Tress, Michael
    Mortazavi, Ali
    Martens, Lennart
    Rodriguez-Navarro, Susana
    Moreno-Manzano, Victoria
    Conesa, Ana
    [J]. GENOME RESEARCH, 2018, 28 (03) : 396 - 411
  • [10] Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing
    Tian, Luyi
    Jabbari, Jafar S.
    Thijssen, Rachel
    Gouil, Quentin
    Amarasinghe, Shanika L.
    Voogd, Oliver
    Kariyawasam, Hasaru
    Du, Mei R. M.
    Schuster, Jakob
    Wang, Changqing
    Su, Shian
    Dong, Xueyi
    Law, Charity W.
    Lucattini, Alexis
    Prawer, Yair David Joseph
    Collar-Fernandez, Coralina
    Chung, Jin D.
    Naim, Timur
    Chan, Audrey
    Ly, Chi Hai
    Lynch, Gordon S.
    Ryall, James G.
    Anttila, Casey J. A.
    Peng, Hongke
    Anderson, Mary Ann
    Flensburg, Christoffer
    Majewski, Ian
    Roberts, Andrew W.
    Huang, David C. S.
    Clark, Michael B.
    Ritchie, Matthew E.
    [J]. GENOME BIOLOGY, 2021, 22 (01)