A chromosome-level genome assembly of the redfin culter (Chanodichthys erythropterus)

被引：7

作者：

Zhao, Shihu ^{[1
]}

Yang, Xiufeng ^{[1
]}

Pang, Bo ^{[2
]}

Zhang, Lei ^{[1
]}

Wang, Qi ^{[2
]}

He, Shangbin ^{[1
]}

Dou, Huashan ^{[2
]}

Zhang, Honghai ^{[1
]}

机构：

[1] Qufu Normal Univ, Coll Life Sci, Qufu 273165, Shandong, Peoples R China

[2] Hulunbuir Acad Inland Lakes Northern Cold & Arid, Hulunbuir 021000, Inner Mongolia, Peoples R China

来源：

SCIENTIFIC DATA | 2022年 / 9卷 / 01期

基金：

中国国家自然科学基金;

关键词：

LENGTH-WEIGHT; PREDICTION; SEQUENCE; GENES; CARP; TOOL; DIVERSITY; ALIGNMENT; PROGRAM; FISHES;

D O I：

10.1038/s41597-022-01648-0

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Chanodichthys erythropterus is a fierce carnivorous fish widely found in East Asian waters. It is not only a popular food fish in China, it is also a representative victim of overfishing. Genetic breeding programs launched to meet market demands urgently require high-quality genomes to facilitate genomic selection and genetic research. In this study, we constructed a chromosome-level reference genome of C. erythropterus by taking advantage of long-read single-molecule sequencing and de novo assembly by Oxford Nanopore Technology (ONT) and Hi-C. The 1.085 Gb C. erythropterus genome was assembled from 132 Gb of Nanopore sequence. The assembled genome represents 98.5% completeness (BUSCO) with a contig N50 length of 23.29 Mb. The contigs were clustered and ordered onto 24 chromosomes covering roughly 99.49% of the genome assembly with Hi-C data. Additionally, 33,041 (98.0%) genes were functionally annotated from a total of 33,706 predicted protein-coding sequences by combining transcriptome data from seven tissues. This high-quality assembled genome will be a precious resource for future molecular breeding and functional genomics research of C. erythropterus.

引用

页数：9

共 51 条

[1]

ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999

[2]

[Anonymous], 2022, NCBI Sequence Read Archive

[3]

Arai R., 2011, FISH KARYOTYPES, DOI [10.1007/978-4-431-53877-6, DOI 10.1007/978-4-431-53877-6]

[4] Gene Ontology: tool for the unification of biology [J].

Ashburner, M ;

Ball, CA ;

Blake, JA ;

Botstein, D ;

Butler, H ;

Cherry, JM ;

Davis, AP ;

Dolinski, K ;

Dwight, SS ;

Eppig, JT ;

Harris, MA ;

Hill, DP ;

Issel-Tarver, L ;

Kasarskis, A ;

Lewis, S ;

Matese, JC ;

Richardson, JE ;

Ringwald, M ;

Rubin, GM ;

Sherlock, G .

NATURE GENETICS, 2000, 25 (01) :25-29

[5] The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].

Bairoch, A ;

Apweiler, R .

NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48

[6] Repbase Update, a database of repetitive elements in eukaryotic genomes [J].

Bao, Weidong ;

Kojima, Kenji K. ;

Kohany, Oleksiy .

MOBILE DNA, 2015, 6

[7] Hi-C: A comprehensive technique to capture the conformation of genomes [J].

Belton, Jon-Matthew ;

McCord, Rachel Patton ;

Gibcus, Johan Harmen ;

Naumova, Natalia ;

Zhan, Ye ;

Dekker, Job .

METHODS, 2012, 58 (03) :268-276

[8] Tandem repeats finder: a program to analyze DNA sequences [J].

Benson, G .

NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580

[9] GeneWise and genomewise [J].

Birney, E ;

Clamp, M ;

Durbin, R .

GENOME RESEARCH, 2004, 14 (05) :988-995

[10] Prediction of complete gene structures in human genomic DNA [J].

Burge, C ;

Karlin, S .

JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94

← 1 2 3 4 5 6 →