Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform

被引:25
|
作者
Balazs, Zsolt [1 ]
Tombacz, Dora [1 ]
Szucs, Attila [1 ]
Snyder, Michael [2 ]
Boldogkoi, Zsolt [1 ]
机构
[1] Univ Szeged, Fac Med, Dept Med Biol, H-6720 Szeged, Hungary
[2] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
关键词
MESSENGER-RNA; REVEALS;
D O I
10.1038/sdata.2017.194
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complexity of its transcriptome, only few of its transcripts have been fully annotated. We hereby present a long-read RNA sequencing dataset of HCMV infected human lung fibroblast cells sequenced by the Pacific Biosciences RSII platform. Seven SMRT cells were sequenced using oligo(dT) primers to reverse transcribe poly(A)-selected RNA molecules and one library was prepared using random primers for the reverse transcription of the rRNA-depleted sample. Our dataset contains 122,636 human and 33,086 viral (HMCV strain Towne) reads. The described data include raw and processed sequencing files, and combined with other datasets, they can be used to validate transcriptome analysis tools, to compare library preparation methods, to test base calling algorithms or to identify genetic variants.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
    Zsolt Balázs
    Dóra Tombácz
    Attila Szűcs
    Michael Snyder
    Zsolt Boldogkői
    Scientific Data, 4
  • [2] Correction: Corrigendum: Long-read sequencing of the human cytomegalovirus transcriptome with the pacific biosciences RSII platform
    Zsolt Balázs
    Dóra Tombácz
    Attila Szűcs
    Michael Snyder
    Zsolt Boldogkői
    Scientific Data, 5
  • [3] Long-read sequencing of the human cytomegalovirus transcriptome with the pacific biosciences RSII platform (vol 4, 170194, 2017)
    Balazs, Zsolt
    Tombacz, Dora
    Szucs, Attila
    Snyder, Michael
    Boldogkoi, Zsolt
    SCIENTIFIC DATA, 2018, 5
  • [4] Dual Platform Long-Read RNA-Sequencing Dataset of the Human Cytomegalovirus Lytic Transcriptome
    Balazs, Zsolt
    Tombacz, Dora
    Szucs, Attila
    Snyder, Michael
    Boldogkoi, Zsolt
    FRONTIERS IN GENETICS, 2018, 9
  • [5] Mind the Gap: Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology
    English, Adam C.
    Richards, Stephen
    Han, Yi
    Wang, Min
    Vee, Vanesa
    Qu, Jiaxin
    Qin, Xiang
    Muzny, Donna M.
    Reid, Jeffrey G.
    Worley, Kim C.
    Gibbs, Richard A.
    PLOS ONE, 2012, 7 (11):
  • [6] Transcriptome variation in human tissues revealed by long-read sequencing
    Glinos, Dafni A.
    Garborcauskas, Garrett
    Hoffman, Paul
    Ehsan, Nava
    Jiang, Lihua
    Gokden, Alper
    Dai, Xiaoguang
    Aguet, Francois
    Brown, Kathleen L.
    Garimella, Kiran
    Bowers, Tera
    Costello, Maura
    Ardlie, Kristin
    Jian, Ruiqi
    Tucker, Nathan R.
    Ellinor, Patrick T.
    Harrington, Eoghan D.
    Tang, Hua
    Snyder, Michael
    Juul, Sissel
    Mohammadi, Pejman
    MacArthur, Daniel G.
    Lappalainen, Tuuli
    Cummings, Beryl
    NATURE, 2022, 608 (7922) : 353 - +
  • [7] Transcriptome variation in human tissues revealed by long-read sequencing
    Dafni A. Glinos
    Garrett Garborcauskas
    Paul Hoffman
    Nava Ehsan
    Lihua Jiang
    Alper Gokden
    Xiaoguang Dai
    François Aguet
    Kathleen L. Brown
    Kiran Garimella
    Tera Bowers
    Maura Costello
    Kristin Ardlie
    Ruiqi Jian
    Nathan R. Tucker
    Patrick T. Ellinor
    Eoghan D. Harrington
    Hua Tang
    Michael Snyder
    Sissel Juul
    Pejman Mohammadi
    Daniel G. MacArthur
    Tuuli Lappalainen
    Beryl B. Cummings
    Nature, 2022, 608 : 353 - 359
  • [8] Long-Read Sequencing of Human Cytomegalovirus Transcriptome Reveals RNA Isoforms Carrying Distinct Coding Potentials
    Balazs, Zsolt
    Tombacz, Dora
    Szucs, Attila
    Csabai, Zsolt
    Megyeri, Klara
    Petrov, Alexey N.
    Snyder, Michael
    Boldogkoi, Zsolt
    SCIENTIFIC REPORTS, 2017, 7
  • [9] Long-Read Sequencing of Human Cytomegalovirus Transcriptome Reveals RNA Isoforms Carrying Distinct Coding Potentials
    Zsolt Balázs
    Dóra Tombácz
    Attila Szűcs
    Zsolt Csabai
    Klára Megyeri
    Alexey N. Petrov
    Michael Snyder
    Zsolt Boldogkői
    Scientific Reports, 7
  • [10] Long-read cDNA sequencing identifies functional pseudogenes in the human transcriptome
    Robin-Lee Troskie
    Yohaann Jafrani
    Tim R. Mercer
    Adam D. Ewing
    Geoffrey J. Faulkner
    Seth W. Cheetham
    Genome Biology, 22