HUPAN: a pan-genome analysis pipeline for human genomes

被引:51
|
作者
Duan, Zhongqu [1 ,2 ]
Qiao, Yuyang [1 ]
Lu, Jinyuan [1 ]
Lu, Huimin [1 ]
Zhang, Wenmin [1 ]
Yan, Fazhe [1 ]
Sun, Chen [1 ]
Hu, Zhiqiang [1 ]
Zhang, Zhen [3 ,4 ]
Li, Guichao [3 ,4 ]
Chen, Hongzhuan [5 ]
Xiang, Zhen [6 ]
Zhu, Zhenggang [6 ]
Zhao, Hongyu [2 ,7 ]
Yu, Yingyan [6 ]
Wei, Chaochun [1 ,2 ,8 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Life Sci & Biotechnol, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, SJTU Yale Joint Ctr Biostat & Data Sci, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
[3] Fudan Univ, Shanghai Canc Ctr, Shanghai Med Coll, Dept Radiat Oncol, 270 Dong An Rd, Shanghai 200032, Peoples R China
[4] Fudan Univ, Shanghai Canc Ctr, Shanghai Med Coll, Dept Oncol, 270 Dong An Rd, Shanghai 200032, Peoples R China
[5] Shanghai Jiao Tong Univ, Sch Med, Shanghai Key Lab Translat Med, Dept Pharmacol, 227 South Chongqing Rd, Shanghai 200025, Peoples R China
[6] Shanghai Jiao Tong Univ, Shanghai Key Lab Gastr Neoplasms, Sch Med, Dept Surg,Ruijin Hosp, 197 Ruijin Rd, Shanghai 200025, Peoples R China
[7] Yale Univ, Dept Biostat, 60 Coll St, New Haven, CT 06520 USA
[8] Shanghai Ctr Bioinformat Technol, 1278 Keyuan Rd, Shanghai 201203, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Pan-genome; Core genome; Presence-absence variation (PAV); Genome assembly; Population-specific variation; READ ALIGNMENT; SEQUENCE; DIVERSITY; ANNOTATION; CHIMPANZEE; INSIGHTS;
D O I
10.1186/s13059-019-1751-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The human reference genome is still incomplete, especially for those population-specific or individual-specific regions, which may have important functions. Here, we developed a HUman Pan-genome ANalysis (HUPAN) system to build the human pan-genome. We applied it to 185 deep sequencing and 90 assembled Han Chinese genomes and detected 29.5Mb novel genomic sequences and at least 188 novel protein-coding genes missing in the human reference genome (GRCh38). It can be an important resource for the human genome-related biomedical studies, such as cancer genome analysis. HUPAN is freely available at http://cgm.sjtu.edu.cn/hupan/ and https://github.com/SJTU-CGM/HUPAN.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] IPGA: A handy integrated prokaryotes genome and pan-genome analysis web service
    Liu, Dongmei
    Zhang, Yifei
    Fan, Guomei
    Sun, Dingzhong
    Zhang, Xingjiao
    Yu, Zhengfei
    Wang, Jinfeng
    Wu, Linhuan
    Shi, Wenyu
    Ma, Juncai
    IMETA, 2022, 1 (04):
  • [42] Methods in genome, pan-genome, pan-transcriptome, and gene regulatory network (GRN) construction and analysis
    Gupta, Parul
    Li, Song
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [43] Pan-Genome of Wild and Cultivated Soybeans
    Liu, Yucheng
    Du, Huilong
    Li, Pengcheng
    Shen, Yanting
    Peng, Hua
    Liu, Shulin
    Zhou, Guo-An
    Zhang, Haikuan
    Liu, Zhi
    Shi, Miao
    Huang, Xuehui
    Li, Yan
    Zhang, Min
    Wang, Zheng
    Zhu, Baoge
    Han, Bin
    Liang, Chengzhi
    Tian, Zhixi
    CELL, 2020, 182 (01) : 162 - +
  • [44] Pan-Genome Identification and Expression Analysis of Lipoxygenase Genes in Cucumber
    Xu, Haiyu
    Liu, Kun
    Zhao, Lili
    Chen, Chunhua
    Wang, Lina
    Ren, Zhonghai
    AGRICULTURE-BASEL, 2025, 15 (03):
  • [45] Cotton pan-genome retrieves the lost sequences and genes during domestication and selection
    Li, Jianying
    Yuan, Daojun
    Wang, Pengcheng
    Wang, Qiongqiong
    Sun, Mengling
    Liu, Zhenping
    Si, Huan
    Xu, Zhongping
    Ma, Yizan
    Zhang, Boyang
    Pei, Liuling
    Tu, Lili
    Zhu, Longfu
    Chen, Ling-Ling
    Lindsey, Keith
    Zhang, Xianlong
    Jin, Shuangxia
    Wang, Maojun
    GENOME BIOLOGY, 2021, 22 (01)
  • [46] Comprehensive analysis of genomic variation, pan-genome and biosynthetic potential of Corynebacterium glutamicum strains
    Rahman, Md. Shahedur
    Shimul, Md. Ebrahim Khalil
    Parvez, Md. Anowar Khasru
    PLOS ONE, 2024, 19 (05):
  • [47] Use of pan-genome analysis for the identification of lineage-specific genes of Helicobacter pylori
    van Vliet, Arnoud H. M.
    FEMS MICROBIOLOGY LETTERS, 2017, 364 (02)
  • [48] Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.
    Xiaowen Yang
    Yajie Li
    Juan Zang
    Yexia Li
    Pengfei Bie
    Yanli Lu
    Qingmin Wu
    Molecular Genetics and Genomics, 2016, 291 : 905 - 912
  • [49] Comparative Pan-Genome Analysis of Piscirickettsia salmonis Reveals Genomic Divergences within Genogroups
    Nourdin-Galindo, Guillermo
    Sanchez, Patricio
    Molina, Cristian F.
    Espinoza-Rojas, Daniela A.
    Oliver, Cristian
    Ruiz, Pamela
    Vargas-Chacoff, Luis
    Carcamo, Juan G.
    Figueroa, Jaime E.
    Mancilla, Marcos
    Maracaja-Coutinho, Vinicius
    Yanez, Alejandro J.
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2017, 7
  • [50] Pan-Genome Analysis Links the Hereditary Variation of Leptospirillum ferriphilum With Its Evolutionary Adaptation
    Zhang, Xian
    Liu, Xueduan
    Yang, Fei
    Chen, Lv
    FRONTIERS IN MICROBIOLOGY, 2018, 9