Chromosome-level assembly and gene annotation of Kappaphycus striatus genome

被引：0

作者：

Zhou, Zhiyin ^{[1
,2
,3
]}

Ma, Yu ^{[1
,2
,3
]}

Zhang, Jie ^{[1
,2
]}

Firdaus, Muhammad ^{[4
]}

Roleda, Michael Y. ^{[5
]}

Duan, Delin ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Oceanol, Shandong Prov Key Lab Expt Marine Biol, Key Lab Breeding Biotechnol & Sustainable Aquacult, Qingdao 266071, Peoples R China

[2] Qingdao Marine Sci & Technol Ctr, Lab Marine Biol & Biotechnol, Qingdao 266237, Peoples R China

[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[4] Natl Res & Innovat Agcy BRIN, Res Ctr Marine & Land Bioind, Lombok Utara 83352, Indonesia

[5] Univ Philippines Diliman, Marine Sci Inst, Quezon City 1101, Philippines

来源：

SCIENTIFIC DATA | 2025年 / 12卷 / 01期

关键词：

DE-NOVO IDENTIFICATION; PROGRAM; TRANSCRIPTOME; ACCURATE; FAMILIES; FINDER;

D O I：

10.1038/s41597-025-04583-y

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Kappaphycus striatus is one of the carrageenan-producing red algae, and found primarily in tropical and subtropical coastal regions. Its global distribution is mainly in the Philippines, Indonesia, and Malaysia, among other locations. Here, through the high-quality chromosome-level genome sequences and assembly with PacBio HiFi and Hi-C sequencing data, we assembled one genome with a total of 211.46 Mb in size, containing a contig N50 length of 5.04 Mb and a scaffold N50 length of 5.39 Mb. After Hi-C assembly and manual adjustment to the heatmap, we deduced that 199.42 Mb of genomic sequences were anchored to 33 presumed chromosomes, which accounting for 94.31% of the entire genome. One total of 14,596 protein-coding genes and 1,673 non-coding RNAs were identified, and the 100.96 Mb of repetitive sequences accounting for 47.73% of the assembled genome. Our chromosome-level genome assembly data provide valuable references for K. striatus future nursery and breeding, and will be useful for the functional genomics interpretations and evolutionary studies of eukaryotes.

引用

页数：9

共 41 条

[1]

[Anonymous], 2024, NCBI Sequence Read Archive

[2]

[Anonymous], 2024, NCBI GenBank

[3] Automated de novo identification of repeat sequence families in sequenced genomes [J].