Exploiting genome variation to improve next-generation sequencing data analysis and genome editing efficiency in Populus tremulaxalba 717-1B4

被引:36
作者
Xue, Liang-Jiao [1 ,2 ,3 ]
Alabady, Magdy S. [4 ]
Mohebbi, Mohammad [5 ]
Tsai, Chung-Jui [1 ,2 ,3 ]
机构
[1] Univ Georgia, Sch Forestry & Nat Resources, Athens, GA 30602 USA
[2] Univ Georgia, Dept Genet, Athens, GA 30602 USA
[3] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[4] Univ Georgia, Dept Plant Biol, Athens, GA 30602 USA
[5] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
关键词
Re-sequencing; SNP; Substituted genome; RNA-Seq; CRISPR; RNA-SEQ DATA; EXPRESSION; ALIGNMENT; DESIGN; FORMAT; GROWTH;
D O I
10.1007/s11295-015-0907-5
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
Populus species are widely distributed across the Northern Hemisphere. The genetic diversity makes the genus an ideal study system for traits of ecological or agronomic significance. However, sequence variation between the genome-sequenced Populus trichocarpa Nisqually-1 and many other Populus species and hybrids poses significant challenges for research that employs sequence-sensitive approaches, such as next-generation sequencing and sitespecific genome editing. Using the routinely transformed genotype Populus tremulaxalba 717-1B4 as a test case, we utilized established variant-calling pipelines with affordable re-sequencing (similar to 20x) and publicly available transcriptome data to generate a variant-substituted custom genome (sPta717). The sPta717 genome harbors over 10 million SNPs or small indels relative to the P. trichocarpa v3 reference genome. When applied to RNA-Seq analysis, the fraction of uniquely mapped reads increased by 13-28 % relative to that obtained with the P. trichocarpa reference genome, depending on read length and sequence type. The enhanced mapping rates enabled detection of several hundred more expressed genes and improved the differential expression analysis. Similar improvements were observed for DNA-Seq and ChIP-Seq data mapping. The sPta717 genome is also instrumental in guide RNA (gRNA) design for CRISPR-mediated genome editing. We showed that a majority of gRNAs designed from the P. trichocarpa reference genome contain mismatches with the corresponding target sequences of sPta717, likely rendering those gRNAs ineffective in transgenic 717. Awebsite is provided for querying the sPta717 genome by gene model or homology search. The same approach should be applicable to other outcrossing species with a closely related reference genome.
引用
收藏
页数:8
相关论文
共 38 条
[1]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[2]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[3]   Stress-responsive hydroxycinnamate glycosyltransferase modulates phenylpropanoid metabolism in Populus [J].
Babst, Benjamin A. ;
Chen, Han-Yi ;
Wang, Hong-Qiang ;
Payyavula, Raja S. ;
Thomas, Tina P. ;
Harding, Scott A. ;
Tsai, Chung-Jui .
JOURNAL OF EXPERIMENTAL BOTANY, 2014, 65 (15) :4191-4200
[4]   The variant call format and VCFtools [J].
Danecek, Petr ;
Auton, Adam ;
Abecasis, Goncalo ;
Albers, Cornelis A. ;
Banks, Eric ;
DePristo, Mark A. ;
Handsaker, Robert E. ;
Lunter, Gerton ;
Marth, Gabor T. ;
Sherry, Stephen T. ;
McVean, Gilean ;
Durbin, Richard .
BIOINFORMATICS, 2011, 27 (15) :2156-2158
[5]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[6]   Population genomics of Populus trichocarpa identifies signatures of selection and adaptive trait associations [J].
Evans, Luke M. ;
Slavov, Gancho T. ;
Rodgers-Melnick, Eli ;
Martin, Joel ;
Ranjan, Priya ;
Muchero, Wellington ;
Brunner, Amy M. ;
Schackwitz, Wendy ;
Gunter, Lee ;
Chen, Jin-Gui ;
Tuskan, Gerald A. ;
DiFazio, Stephen P. .
NATURE GENETICS, 2014, 46 (10) :1089-1096
[7]   The ENCODE (ENCyclopedia of DNA elements) Project [J].
Feingold, EA ;
Good, PJ ;
Guyer, MS ;
Kamholz, S ;
Liefer, L ;
Wetterstrand, K ;
Collins, FS ;
Gingeras, TR ;
Kampa, D ;
Sekinger, EA ;
Cheng, J ;
Hirsch, H ;
Ghosh, S ;
Zhu, Z ;
Pate, S ;
Piccolboni, A ;
Yang, A ;
Tammana, H ;
Bekiranov, S ;
Kapranov, P ;
Harrison, R ;
Church, G ;
Struhl, K ;
Ren, B ;
Kim, TH ;
Barrera, LO ;
Qu, C ;
Van Calcar, S ;
Luna, R ;
Glass, CK ;
Rosenfeld, MG ;
Guigo, R ;
Antonarakis, SE ;
Birney, E ;
Brent, M ;
Pachter, L ;
Reymond, A ;
Dermitzakis, ET ;
Dewey, C ;
Keefe, D ;
Denoeud, F ;
Lagarde, J ;
Ashurst, J ;
Hubbard, T ;
Wesselink, JJ ;
Castelo, R ;
Eyras, E ;
Myers, RM ;
Sidow, A ;
Batzoglou, S .
SCIENCE, 2004, 306 (5696) :636-640
[8]   The Tonoplast-Localized Sucrose Transporter in Populus (PtaSUT4) Regulates Whole-Plant Water Relations, Responses to Water Stress, and Photosynthesis [J].
Frost, Christopher J. ;
Nyamdari, Batbayar ;
Tsai, Chung-Jui ;
Harding, Scott A. .
PLOS ONE, 2012, 7 (08)
[9]   Full-length transcriptome assembly from RNA-Seq data without a reference genome [J].
Grabherr, Manfred G. ;
Haas, Brian J. ;
Yassour, Moran ;
Levin, Joshua Z. ;
Thompson, Dawn A. ;
Amit, Ido ;
Adiconis, Xian ;
Fan, Lin ;
Raychowdhury, Raktima ;
Zeng, Qiandong ;
Chen, Zehua ;
Mauceli, Evan ;
Hacohen, Nir ;
Gnirke, Andreas ;
Rhind, Nicholas ;
di Palma, Federica ;
Birren, Bruce W. ;
Nusbaum, Chad ;
Lindblad-Toh, Kerstin ;
Friedman, Nir ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2011, 29 (07) :644-U130
[10]   E-CRISP: fast CRISPR target site identification [J].
Heigwer, Florian ;
Kerr, Grainne ;
Boutros, Michael .
NATURE METHODS, 2014, 11 (02) :122-124