HUPAN: a pan-genome analysis pipeline for human genomes

被引:51
|
作者
Duan, Zhongqu [1 ,2 ]
Qiao, Yuyang [1 ]
Lu, Jinyuan [1 ]
Lu, Huimin [1 ]
Zhang, Wenmin [1 ]
Yan, Fazhe [1 ]
Sun, Chen [1 ]
Hu, Zhiqiang [1 ]
Zhang, Zhen [3 ,4 ]
Li, Guichao [3 ,4 ]
Chen, Hongzhuan [5 ]
Xiang, Zhen [6 ]
Zhu, Zhenggang [6 ]
Zhao, Hongyu [2 ,7 ]
Yu, Yingyan [6 ]
Wei, Chaochun [1 ,2 ,8 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Life Sci & Biotechnol, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, SJTU Yale Joint Ctr Biostat & Data Sci, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[3] Fudan Univ, Shanghai Canc Ctr, Shanghai Med Coll, Dept Radiat Oncol, 270 Dong An Rd, Shanghai 200032, Peoples R China
[4] Fudan Univ, Shanghai Canc Ctr, Shanghai Med Coll, Dept Oncol, 270 Dong An Rd, Shanghai 200032, Peoples R China
[5] Shanghai Jiao Tong Univ, Sch Med, Shanghai Key Lab Translat Med, Dept Pharmacol, 227 South Chongqing Rd, Shanghai 200025, Peoples R China
[6] Shanghai Jiao Tong Univ, Shanghai Key Lab Gastr Neoplasms, Sch Med, Dept Surg,Ruijin Hosp, 197 Ruijin Rd, Shanghai 200025, Peoples R China
[7] Yale Univ, Dept Biostat, 60 Coll St, New Haven, CT 06520 USA
[8] Shanghai Ctr Bioinformat Technol, 1278 Keyuan Rd, Shanghai 201203, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Pan-genome; Core genome; Presence-absence variation (PAV); Genome assembly; Population-specific variation; READ ALIGNMENT; SEQUENCE; DIVERSITY; ANNOTATION; CHIMPANZEE; INSIGHTS;
D O I
10.1186/s13059-019-1751-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The human reference genome is still incomplete, especially for those population-specific or individual-specific regions, which may have important functions. Here, we developed a HUman Pan-genome ANalysis (HUPAN) system to build the human pan-genome. We applied it to 185 deep sequencing and 90 assembled Han Chinese genomes and detected 29.5Mb novel genomic sequences and at least 188 novel protein-coding genes missing in the human reference genome (GRCh38). It can be an important resource for the human genome-related biomedical studies, such as cancer genome analysis. HUPAN is freely available at http://cgm.sjtu.edu.cn/hupan/ and https://github.com/SJTU-CGM/HUPAN.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Pangloss: A Tool for Pan-Genome Analysis of Microbial Eukaryotes
    McCarthy, Charley G. P.
    Fitzpatrick, David A.
    GENES, 2019, 10 (07):
  • [32] Plasmodium vinckei genomes provide insights into the pan-genome and evolution of rodent malaria parasites
    Ramaprasad, Abhinay
    Klaus, Severina
    Douvropoulou, Olga
    Culleton, Richard
    Pain, Arnab
    BMC BIOLOGY, 2021, 19 (01)
  • [33] Evolutionary history and pan-genome dynamics of strawberry (Fragaria spp.)
    Qiao, Qin
    Edger, Patrick P.
    Xue, Li
    Qiong, La
    Lu, Jie
    Zhang, Yichen
    Cao, Qiang
    Yocca, Alan E.
    Platts, Adrian E.
    Knapp, Steven J.
    Van Montagu, Marc
    Van de Peer, Yves
    Lei, Jiajun
    Zhang, Ticao
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (45)
  • [34] Aspergillus fumigatus pan-genome analysis identifies genetic variants associated with human infection
    Barber, Amelia E.
    Sae-Ong, Tongta
    Kang, Kang
    Seelbinder, Bastian
    Li, Jun
    Walther, Grit
    Panagiotou, Gianni
    Kurzai, Oliver
    NATURE MICROBIOLOGY, 2021, 6 (12) : 1526 - +
  • [35] Pan-genome analysis of Clostridium botulinum reveals unique targets for drug development
    Bhardwaj, Tulika
    Somvanshi, Pallavi
    GENE, 2017, 623 : 48 - 62
  • [36] Population genetics and evolution of the pan-genome of Streptococcus pneumoniae
    Muzzi, Alessandro
    Donati, Claudio
    INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2011, 301 (08) : 619 - 622
  • [37] A Novel Approach to Helicobacter pylori Pan-Genome Analysis for Identification of Genomic Islands
    Uchiyama, Ikuo
    Albritton, Jacob
    Fukuyo, Masaki
    Kojima, Kenji K.
    Yahara, Koji
    Kobayashi, Ichizo
    PLOS ONE, 2016, 11 (08):
  • [38] Integrating pan-genome with metagenome for microbial community profiling
    Zhong, Chaofang
    Chen, Chaoyun
    Wang, Lusheng
    Ning, Kang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1458 - 1466
  • [39] Pan-genome and phylogeny of Bacillus cereus sensu lato
    Bazinet, Adam L.
    BMC EVOLUTIONARY BIOLOGY, 2017, 17
  • [40] Insights into the population structure and pan-genome of Haemophilus influenzae
    Pinto, M.
    Gonzalez-Diaz, A.
    Machado, M. P.
    Duarte, S.
    Vieira, L.
    Carrico, J. A.
    Marti, S.
    Bajanca-Lavado, M. P.
    Gomes, J. P.
    INFECTION GENETICS AND EVOLUTION, 2019, 67 : 126 - 135