Haplotype-resolved assembly of diploid genomes without parental data

被引:285
作者
Cheng, Haoyu [1 ,2 ]
Jarvis, Erich D. [3 ,4 ]
Fedrigo, Olivier [3 ]
Koepfli, Klaus-Peter [5 ,6 ,7 ]
Urban, Lara [8 ]
Gemmell, Neil J. [8 ]
Li, Heng [1 ,2 ]
机构
[1] Dana Farber Canc Inst, Dept Data Sci, Boston, MA 02115 USA
[2] Harvard Med Sch, Dept Biomed Informat, Boston, MA 02115 USA
[3] Rockefeller Univ, Vertebrate Genome Lab, 1230 York Ave, New York, NY 10021 USA
[4] Howard Hughes Med Inst, Chevy Chase, MD USA
[5] George Mason Univ, Smithsonian Mason Sch Conservat, Front Royal, VA USA
[6] Smithsonian Conservat Biol Inst, Ctr Species Survival, Natl Zool Pk, Washington, DC USA
[7] ITMO Univ, Comp Technol Lab, St Petersburg, Russia
[8] Univ Otago, Dept Anat, Dunedin, New Zealand
基金
美国国家卫生研究院;
关键词
20;
D O I
10.1038/s41587-022-01261-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Routine haplotype-resolved genome assembly from single samples remains an unresolved problem. Here we describe an algorithm that combines PacBio HiFi reads and Hi-C chromatin interaction data to produce a haplotype-resolved assembly without the sequencing of parents. Applied to human and other vertebrate samples, our algorithm consistently outperforms existing single-sample assembly pipelines and generates assemblies of similar quality to the best pedigree-based assemblies.
引用
收藏
页码:1332 / +
页数:7
相关论文
共 20 条
[2]   Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm [J].
Cheng, Haoyu ;
Concepcion, Gregory T. ;
Feng, Xiaowen ;
Zhang, Haowen ;
Li, Heng .
NATURE METHODS, 2021, 18 (02) :170-+
[3]  
Chin C. S, 2019, HUMAN GENOME ASSEMBL, DOI [10.1101/705616v1, 10.1101/705616]
[4]  
Chin CS, 2016, NAT METHODS, V13, P1050, DOI [10.1038/NMETH.4035, 10.1038/nmeth.4035]
[5]   The sterlet sturgeon genome sequence and the mechanisms of segmental rediploidization [J].
Du, Kang ;
Stoeck, Matthias ;
Kneitz, Susanne ;
Klopp, Christophe ;
Woltering, Joost M. ;
Adolfi, Mateus Contar ;
Feron, Romain ;
Prokopov, Dmitry ;
Makunin, Alexey ;
Kichigin, Ilya ;
Schmidt, Cornelia ;
Fischer, Petra ;
Kuhl, Heiner ;
Wuertz, Sven ;
Gessner, Joern ;
Kloas, Werner ;
Cabau, Cedric ;
Iampietro, Carole ;
Parrinello, Hugues ;
Tomlinson, Chad ;
Journot, Laurent ;
Postlethwait, John H. ;
Braasch, Ingo ;
Trifonov, Vladimir ;
Warren, Wesley C. ;
Meyer, Axel ;
Guiguen, Yann ;
Schartl, Manfred .
NATURE ECOLOGY & EVOLUTION, 2020, 4 (06) :841-852
[6]   HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies [J].
Edge, Peter ;
Bafna, Vineet ;
Bansal, Vikas .
GENOME RESEARCH, 2017, 27 (05) :801-812
[7]   Chromosome-scale, haplotype-resolved assembly of human genomes [J].
Garg, Shilpa ;
Fungtammasan, Arkarachai ;
Carroll, Andrew ;
Chou, Mike ;
Schmitt, Anthony ;
Zhou, Xiang ;
Mac, Stephen ;
Peluso, Paul ;
Hatas, Emily ;
Ghurye, Jay ;
Maguire, Jared ;
Mahmoud, Medhat ;
Cheng, Haoyu ;
Heller, David ;
Zook, Justin M. ;
Moemke, Tobias ;
Marschall, Tobias ;
Sedlazeck, Fritz J. ;
Aach, John ;
Chin, Chen-Shan ;
Church, George M. ;
Li, Heng .
NATURE BIOTECHNOLOGY, 2021, 39 (03) :309-312
[8]   Identifying and removing haplotypic duplication in primary genome assemblies [J].
Guan, Dengfeng ;
McCarthy, Shane A. ;
Wood, Jonathan ;
Howe, Kerstin ;
Wang, Yadong ;
Durbin, Richard .
BIOINFORMATICS, 2020, 36 (09) :2896-2898
[9]   De novo assembly of haplotype-resolved genomes with trio binning [J].
Koren, Sergey ;
Rhie, Arang ;
Walenz, Brian P. ;
Dilthey, Alexander T. ;
Bickhart, Derek M. ;
Kingan, Sarah B. ;
Hiendleder, Stefan ;
Williams, John L. ;
Smith, Timothy P. L. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2018, 36 (12) :1174-+
[10]   Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C [J].
Kronenberg, Zev N. ;
Rhie, Arang ;
Koren, Sergey ;
Concepcion, Gregory T. ;
Peluso, Paul ;
Munson, Katherine M. ;
Porubsky, David ;
Kuhn, Kristen ;
Mueller, Kathryn A. ;
Low, Wai Yee ;
Hiendleder, Stefan ;
Fedrigo, Olivier ;
Liachko, Ivan ;
Hall, Richard J. ;
Phillippy, Adam M. ;
Eichler, Evan E. ;
Williams, John L. ;
Smith, Timothy P. L. ;
Jarvis, Erich D. ;
Sullivan, Shawn T. ;
Kingan, Sarah B. .
NATURE COMMUNICATIONS, 2021, 12 (01)