Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia

被引:12
作者
Feng, Yanzhi [1 ,2 ,3 ,4 ]
Zhao, Yang [1 ,2 ,3 ,4 ]
Zhang, Jiajia [1 ,2 ,3 ,4 ]
Wang, Baoping [1 ,2 ,3 ,4 ]
Yang, Chaowei [1 ,2 ,3 ,4 ]
Zhou, Haijiang [1 ,2 ,3 ,4 ]
Qiao, Jie [1 ,2 ,3 ,4 ]
机构
[1] Paulownia Res & Dev Ctr State Adm Forestry & Gras, Zhengzhou 450003, Peoples R China
[2] Chinese Acad Forestry, Nontimber Forestry Res & Dev Ctr, Zhengzhou 450003, Peoples R China
[3] Key Lab Nontimber Forest Germplasm Enhancement &, Zhengzhou 450003, Peoples R China
[4] Natl Innovat Alliance Paulownia, Zhengzhou 450003, Peoples R China
关键词
EST-SSR MARKERS; CROSS-SPECIES/GENERA TRANSFERABILITY; GENE-EXPRESSION; GENOME; COMPLEXITY; ISOFORMS; L;
D O I
10.1038/s41598-021-87538-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Paulownia catalpifolia is an important, fast-growing timber species known for its high density, color and texture. However, few transcriptomic and genetic studies have been conducted in P. catalpifolia. In this study, single-molecule real-time sequencing technology was applied to obtain the full-length transcriptome of P. catalpifolia leaves treated with varying degrees of drought stress. The sequencing data were then used to search for microsatellites, or simple sequence repeats (SSRs). A total of 28.83 Gb data were generated, 25,969 high-quality (HQ) transcripts with an average length of 1624 bp were acquired after removing the redundant reads, and 25,602 HQ transcripts (98.59%) were annotated using public databases. Among the HQ transcripts, 16,722 intact coding sequences, 149 long non-coding RNAs and 179 alternative splicing events were predicted, respectively. A total of 7367 SSR loci were distributed throughout 6293 HQ transcripts, of which 763 complex SSRs and 6604 complete SSRs. The SSR appearance frequency was 28.37%, and the average distribution distance was 5.59 kb. Among the 6604 complete SSR loci, 1-3 nucleotide repeats were dominant, occupying 97.85% of the total SSR loci, of which mono-, di- and tri-nucleotide repeats were 44.68%, 33.86% and 19.31%, respectively. We detected 112 repeat motifs, of which A/T (42.64%), AG/CT (12.22%), GA/TC (9.63%), GAA/TTC (1.57%) and CCA/TGG (1.54%) were most common in mono-, di- and tri-nucleotide repeats, respectively. The length of the repeat SSR motifs was 10-88 bp, and 4997 (75.67%) were <= 20 bp. This study provides a novel full-length transcriptome reference for P. catalpifolia and will facilitate the identification of germplasm resources and breeding of new drought-resistant P. catalpifolia varieties.
引用
收藏
页数:12
相关论文
共 56 条
[1]   A survey of the sorghum transcriptome using single-molecule long reads [J].
Abdel-Ghany, Salah E. ;
Hamilton, Michael ;
Jacobi, Jennifer L. ;
Ngam, Peter ;
Devitt, Nicholas ;
Schilkey, Faye ;
Ben-Hur, Asa ;
Reddy, Anireddy S. N. .
NATURE COMMUNICATIONS, 2016, 7
[2]   Discovery, validation, and in silico functional characterization of EST-SSR markers in Eucalyptus globulus [J].
Acuna, Cintia V. ;
Fernandez, Paula ;
Villalba, Pamela V. ;
Garcia, Martin N. ;
Esteban Hopp, H. ;
Marcucci Poltri, Susana N. .
TREE GENETICS & GENOMES, 2012, 8 (02) :289-301
[3]   Development of novel EST-SSR markers in the macauba palm (Acrocomia aculeata) using transcriptome sequencing and cross-species transferability in Arecaceae species [J].
Bazzo, Barbara Regina ;
de Carvalho, Lucas Miguel ;
Carazzolle, Marcelo Falsarella ;
Guimaraes Pereira, Goncalo Amarante ;
Colombo, Carlos Augusto .
BMC PLANT BIOLOGY, 2018, 18
[4]   The rise, fall and renaissance of microsatellites in eukaryotic genomes [J].
Buschiazzo, Emmanuel ;
Gemmell, Neil J. .
BIOESSAYS, 2006, 28 (10) :1040-1050
[5]   Analysis of transcripts and splice isoforms in Medicago sativa L. by single-molecule long-read sequencing [J].
Chao, Yuehui ;
Yuan, Jianbo ;
Guo, Tao ;
Xu, Lixin ;
Mu, Zhiyuan ;
Han, Liebao .
PLANT MOLECULAR BIOLOGY, 2019, 99 (03) :219-235
[6]   Analysis of transcripts and splice isoforms in red clover (Trifolium pratense L.) by single-molecule long-read sequencing [J].
Chao, Yuehui ;
Yuan, Jianbo ;
Li, Sifeng ;
Jia, Siqiao ;
Han, Liebao ;
Xu, Lixin .
BMC PLANT BIOLOGY, 2018, 18
[7]   A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing [J].
Chen, Shi-Yi ;
Deng, Feilong ;
Jia, Xianbo ;
Li, Cao ;
Lai, Song-Jia .
SCIENTIFIC REPORTS, 2017, 7
[8]   A Complete Transcriptional Landscape Analysis of Pinus elliottii Engelm. Using Third-Generation Sequencing and Comparative Analysis in the Pinus Phylogeny [J].
Diao, Shu ;
Ding, Xianying ;
Luan, Qifu ;
Jiang, Jingmin .
FORESTS, 2019, 10 (11)
[9]   Transcriptome Expression Profiling in Response to Drought Stress in Paulownia australis [J].
Dong, Yanpeng ;
Fan, Guoqiang ;
Zhao, Zhenli ;
Deng, Minjie .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (03) :4583-4607
[10]   Exploiting the Transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST-SSR Database [J].
Du, Fang K. ;
Xu, Fang ;
Qu, Hong ;
Feng, Sisi ;
Tang, Jijun ;
Wu, Rongling .
PLOS ONE, 2013, 8 (04)