Chromosome-level genome assembly of Triplophysa tibetana, a fish adapted to the harsh high-altitude environment of the Tibetan Plateau

被引:43
作者
Yang, Xuefen [1 ]
Liu, Haiping [2 ]
Ma, Zhihong [1 ]
Zou, Yu [3 ]
Zou, Ming [3 ]
Mao, Youzhi [4 ]
Li, Xiaomei [5 ]
Wang, Huan [4 ]
Chen, Tiansheng [1 ]
Wang, Weimin [1 ]
Yang, Ruibin [1 ]
机构
[1] Huazhong Agr Univ, Coll Fisheries, Wuhan, Hubei, Peoples R China
[2] Tibet Acad Agr & Anim Husb Sci, Inst Fisheries Sci, Lhasa, Peoples R China
[3] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan, Hubei, Peoples R China
[4] Wuhan Frasergen Bioinformat Co Ltd, Wuhan, Hubei, Peoples R China
[5] Guangzhou Univ, Sch Life Sci, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
assembly; genome; Hi-C; PacBio; Tibetan Plateau; Triplophysa; ADAPTATION; ANNOTATION; SEQUENCE; REVEALS; PIPELINE; INSIGHTS; PROGRAM; TOPHAT; GENES;
D O I
10.1111/1755-0998.13021
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Triplophysa is an endemic fish genus of the Tibetan Plateau in China. Triplophysa tibetana, which lives at a recorded altitude of 4,000 m and plays an important role in the highland aquatic ecosystem, serves as an excellent model for investigating high-altitude environmental adaptation. However, evolutionary and conservation studies of T. tibetana have been limited by scarce genomic resources for the genus Triplophysa. In the present study, we applied PacBio sequencing and the Hi-C technique to assemble the T. tibetana genome. A 652-Mb genome with 1,325 contigs with an N50 length of 3.1 Mb was obtained. The 1,137 contigs were further assembled into 25 chromosomes, representing 98.7% and 80.47% of all contigs at the base and sequence number level, respectively. Approximately 260 Mb of sequence, accounting for 39.8% of the genome, was identified as repetitive elements. DNA transposons (16.3%), long interspersed nuclear elements (12.4%) and long terminal repeats (11.0%) were the most repetitive types. In total, 24,372 protein-coding genes were predicted in the genome, and 95% of the genes were functionally annotated via a search in public databases. Using whole genome sequence information, we found that T. tibetana diverged from its common ancestor with Danio rerio 121.4 million years ago. The high-quality genome assembled in this work not only provides a valuable genomic resource for future population and conservation studies of T. tibetana, but it also lays a solid foundation for further investigation into the mechanisms of environmental adaptation of endemic fishes in the Tibetan Plateau.
引用
收藏
页码:1027 / 1036
页数:10
相关论文
共 67 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] 何德奎, 2006, [自然科学进展, Progress in Natural Science], V16, P1395
  • [3] [Anonymous], J DATA MINING GENOMI
  • [4] [Anonymous], CURRENT PROTOCOLS BI
  • [5] [Anonymous], NATURE GENETICS
  • [6] [Anonymous], FISH FISH RES XIZ CH
  • [7] Repbase Update, a database of repetitive elements in eukaryotic genomes
    Bao, Weidong
    Kojima, Kenji K.
    Kohany, Oleksiy
    [J]. MOBILE DNA, 2015, 6
  • [8] Hi-C 2.0: An optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation
    Belaghzal, Houda
    Dekker, Job
    Gibcus, Johan H.
    [J]. METHODS, 2017, 123 : 56 - 65
  • [9] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580
  • [10] GeneWise and genomewise
    Birney, E
    Clamp, M
    Durbin, R
    [J]. GENOME RESEARCH, 2004, 14 (05) : 988 - 995