GreenHill: a de novo chromosome-level scaffolding and phasing tool using Hi-C

被引:4
作者
Ouchi, Shun [1 ]
Kajitani, Rei [1 ]
Itoh, Takehiko [1 ]
机构
[1] Tokyo Inst Technol, Sch Life Sci & Technol, 2-12-1 Ookayama,Meguro Ku, Tokyo 1528550, Japan
关键词
Genome assembly; Haplotype; Hi-C; Scaffolding; Phasing;
D O I
10.1186/s13059-023-03006-8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Chromosome-level haplotype-resolved genome assembly is an important resource in molecular biology. However, current de novo haplotype assemblers require parental data or reference genomes and often fail to provide chromosome-level results. We present GreenHill, a novel scaffolding and phasing tool that considers various assemblers' contigs as input to reconstruct chromosome-level haplotypes using Hi-C without parental or reference data. Its unique functions include new error correction based on Hi-C contacts and the simultaneous use of Hi-C and long reads. Benchmarks reveal that GreenHill outperforms other approaches in contiguity and phasing accuracy, and the majority of chromosome arms are entirely phased.
引用
收藏
页数:27
相关论文
共 66 条
[1]  
[Anonymous], 2012, NCBI BIOPROJECT
[2]   A haplotype-led approach to increase the precision of wheat breeding [J].
Brinton, Jemima ;
Ramirez-Gonzalez, Ricardo H. ;
Simmonds, James ;
Wingen, Luzie ;
Orford, Simon ;
Griffiths, Simon ;
Haberer, Georg ;
Spannagl, Manuel ;
Walkowiak, Sean ;
Pozniak, Curtis ;
Uauy, Cristobal .
COMMUNICATIONS BIOLOGY, 2020, 3 (01)
[3]  
C. elegans Sequencing Consortium, 2013, WBCEL235 NCBI ASS
[4]   Haplotype-resolved assembly of diploid genomes without parental data [J].
Cheng, Haoyu ;
Jarvis, Erich D. ;
Fedrigo, Olivier ;
Koepfli, Klaus-Peter ;
Urban, Lara ;
Gemmell, Neil J. ;
Li, Heng .
NATURE BIOTECHNOLOGY, 2022, 40 (09) :1332-+
[5]   Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm [J].
Cheng, Haoyu ;
Concepcion, Gregory T. ;
Feng, Xiaowen ;
Zhang, Haowen ;
Li, Heng .
NATURE METHODS, 2021, 18 (02) :170-+
[6]   Haplotype-resolved genome assembly and allele-specific gene expression in cultivated ginger [J].
Cheng, Shi-Ping ;
Jia, Kai-Hua ;
Liu, Hui ;
Zhang, Ren-Gang ;
Li, Zhi-Chao ;
Zhou, Shan-Shan ;
Shi, Tian-Le ;
Ma, Ai-Chu ;
Yu, Cong-Wen ;
Gao, Chan ;
Cao, Guang-Lei ;
Zhao, Wei ;
Nie, Shuai ;
Guo, Jing-Fang ;
Jiao, Si-Qian ;
Tian, Xue-Chan ;
Yan, Xue-Mei ;
Bao, Yu-Tao ;
Yun, Quan-Zheng ;
Wang, Xin-Zhu ;
Porth, Ilga ;
El-Kassaby, Yousry A. ;
Wang, Xiao-Ru ;
Li, Zhen ;
Van de Peer, Yves ;
Mao, Jian-Feng .
HORTICULTURE RESEARCH, 2021, 8 (01)
[7]  
Chin CS, 2016, NAT METHODS, V13, P1050, DOI [10.1038/nmeth.4035, 10.1038/NMETH.4035]
[8]   Sim3C: simulation of Hi-C and Meta3C proximity ligation sequencing technologies [J].
DeMaere, Matthew Z. ;
Darling, Aaron E. .
GIGASCIENCE, 2017, 7 (02) :1-12
[9]   De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds [J].
Dudchenko, Olga ;
Batra, Sanjit S. ;
Omer, Arina D. ;
Nyquist, Sarah K. ;
Hoeger, Marie ;
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Lander, Eric S. ;
Aiden, Aviva Presser ;
Aiden, Erez Lieberman .
SCIENCE, 2017, 356 (6333) :92-95
[10]   Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments [J].
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Rao, Suhas S. P. ;
Huntley, Miriam H. ;
Lander, Eric S. ;
Aiden, Erez Lieberman .
CELL SYSTEMS, 2016, 3 (01) :95-98