Next-generation sequencing analysis with a population-specific human reference genome

被引:0
作者
Suzuki, Tomohisa [1 ,2 ]
Ninomiya, Kota [1 ]
Funayama, Takamitsu [3 ,4 ]
Okamura, Yasunobu [3 ,5 ]
Tadaka, Shu [3 ]
Tohoku Med Megabank Project Study Grp, Kengo
Yamamoto, Masayuki [3 ,7 ]
Kure, Shigeo [2 ,8 ]
Kikuchi, Atsuo [2 ]
Tamiya, Gen [1 ,3 ,4 ]
Takayama, Jun [1 ,3 ,4 ]
机构
[1] Tohoku Univ, Dept AI & Innovat Med, Sch Med, Sendai, Miyagi 9808573, Japan
[2] Tohoku Univ, Sch Med, Dept Pediat, Sendai, Miyagi 9808574, Japan
[3] Tohoku Univ, Dept Integrat Genom, Tohoku Med Megabank Org, Sendai, Miyagi 9808573, Japan
[4] RIKEN Ctr Adv Intelligence Project, Chuo Ku, Tokyo 1030027, Japan
[5] Tohoku Univ, Adv Res Ctr Innovat Next Generat Med, Sendai, Miyagi 9808573, Japan
[6] Tohoku Univ, Inst Dev Aging & Canc, Dept Silico Anal, Sendai, Miyagi 9808575, Japan
[7] Tohoku Univ, Dept Biochem & Mol Biol, Tohoku Med Megabank Org, Sendai, Miyagi 9808573, Japan
[8] Miyagi Childrens Hosp, Sendai, Miyagi 9893126, Japan
关键词
population-specific reference genome; Japanese reference genome; genome resource; next-generation sequencing data analysis; variant calling; DISCOVERY;
D O I
10.1266/ggs.24-00112
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing (NGS) has become widely available and is routinely used in basic research and clinical practice. The reference genome sequence is an essential resource for NGS analysis, and several population-specific reference genomes have recently been constructed to provide a choice to deal with the vast genetic diversity of human samples. However, resources supporting population- specific references are insufficient, and it is burdensome to perform analysis using these reference genomes. Here, we constructed a set of resources to support NGS analysis using the Japanese reference genome, JG. We created resources for variant calling, variant effect prediction, gene and repeat element annotations, read mappability and RNA-seq analysis. We also provide a resource for reference coordinate conversion for further annotation enrichment. We then provide a variant calling protocol with JG. Our resources provide a guide to prepare sufficient resources for the use of population-specific reference genomes and can facilitate the migration of reference genomes.
引用
收藏
页数:8
相关论文
共 52 条
[11]   Coming of age: ten years of next-generation sequencing technologies [J].
Goodwin, Sara ;
McPherson, John D. ;
McCombie, W. Richard .
NATURE REVIEWS GENETICS, 2016, 17 (06) :333-351
[12]   A Draft Sequence of the Neandertal Genome [J].
Green, Richard E. ;
Krause, Johannes ;
Briggs, Adrian W. ;
Maricic, Tomislav ;
Stenzel, Udo ;
Kircher, Martin ;
Patterson, Nick ;
Li, Heng ;
Zhai, Weiwei ;
Fritz, Markus Hsi-Yang ;
Hansen, Nancy F. ;
Durand, Eric Y. ;
Malaspinas, Anna-Sapfo ;
Jensen, Jeffrey D. ;
Marques-Bonet, Tomas ;
Alkan, Can ;
Pruefer, Kay ;
Meyer, Matthias ;
Burbano, Hernan A. ;
Good, Jeffrey M. ;
Schultz, Rigo ;
Aximu-Petri, Ayinuer ;
Butthof, Anne ;
Hoeber, Barbara ;
Hoeffner, Barbara ;
Siegemund, Madlen ;
Weihmann, Antje ;
Nusbaum, Chad ;
Lander, Eric S. ;
Russ, Carsten ;
Novod, Nathaniel ;
Affourtit, Jason ;
Egholm, Michael ;
Verna, Christine ;
Rudan, Pavao ;
Brajkovic, Dejana ;
Kucan, Zeljko ;
Gusic, Ivan ;
Doronichev, Vladimir B. ;
Golovanova, Liubov V. ;
Lalueza-Fox, Carles ;
de la Rasilla, Marco ;
Fortea, Javier ;
Rosas, Antonio ;
Schmitz, Ralf W. ;
Johnson, Philip L. F. ;
Eichler, Evan E. ;
Falush, Daniel ;
Birney, Ewan ;
Mullikin, James C. .
SCIENCE, 2010, 328 (5979) :710-722
[13]   The UCSC Genome Browser Database: update 2006 [J].
Hinrichs, A. S. ;
Karolchik, D. ;
Baertsch, R. ;
Barber, G. P. ;
Bejerano, G. ;
Clawson, H. ;
Diekhans, M. ;
Furey, T. S. ;
Harte, R. A. ;
Hsu, F. ;
Hillman-Jackson, J. ;
Kuhn, R. M. ;
Pedersen, J. S. ;
Pohl, A. ;
Raney, B. J. ;
Rosenbloom, K. R. ;
Siepel, A. ;
Smith, K. E. ;
Sugnet, C. W. ;
Sultan-Qurraie, A. ;
Thomas, D. J. ;
Trumbower, H. ;
Weber, R. J. ;
Weirauch, M. ;
Zweig, A. S. ;
Haussler, D. ;
Kent, W. J. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D590-D598
[14]   Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses [J].
Kaminow, Benjamin ;
Ballouz, Sara ;
Gillis, Jesse ;
Dobin, Alexander .
GENOME RESEARCH, 2022, 32 (04) :738-749
[15]   Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype [J].
Kim, Daehwan ;
Paggi, Joseph M. ;
Park, Chanhee ;
Bennett, Christopher ;
Salzberg, Steven L. .
NATURE BIOTECHNOLOGY, 2019, 37 (08) :907-+
[16]   The Next-Generation Sequencing Revolution and Its Impact on Genomics [J].
Koboldt, Daniel C. ;
Steinberg, Karyn Meltz ;
Larson, David E. ;
Wilson, Richard K. ;
Mardis, Elaine R. .
CELL, 2013, 155 (01) :27-38
[17]   Exome variant discrepancies due to reference-genome differences [J].
Li, He ;
Dawood, Moez ;
Khayat, Michael M. ;
Farek, Jesse R. ;
Jhangiani, Shalini N. ;
Khan, Ziad M. ;
Mitani, Tadahiro ;
Coban-Akdemir, Zeynep ;
Lupski, James R. ;
Venner, Eric ;
Posey, Jennifer E. ;
Sabo, Aniko ;
Gibbs, Richard A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (07) :1239-1250
[18]   New strategies to improve minimap2 alignment accuracy [J].
Li, Heng .
BIOINFORMATICS, 2021, 37 (23) :4572-4574
[19]   Minimap2: pairwise alignment for nucleotide sequences [J].
Li, Heng .
BIOINFORMATICS, 2018, 34 (18) :3094-3100
[20]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079