Chromosome-level assembly and gene annotation of Kappaphycus striatus genome

被引:0
作者
Zhou, Zhiyin [1 ,2 ,3 ]
Ma, Yu [1 ,2 ,3 ]
Zhang, Jie [1 ,2 ]
Firdaus, Muhammad [4 ]
Roleda, Michael Y. [5 ]
Duan, Delin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Oceanol, Shandong Prov Key Lab Expt Marine Biol, Key Lab Breeding Biotechnol & Sustainable Aquacult, Qingdao 266071, Peoples R China
[2] Qingdao Marine Sci & Technol Ctr, Lab Marine Biol & Biotechnol, Qingdao 266237, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Natl Res & Innovat Agcy BRIN, Res Ctr Marine & Land Bioind, Lombok Utara 83352, Indonesia
[5] Univ Philippines Diliman, Marine Sci Inst, Quezon City 1101, Philippines
关键词
DE-NOVO IDENTIFICATION; PROGRAM; TRANSCRIPTOME; ACCURATE; FAMILIES; FINDER;
D O I
10.1038/s41597-025-04583-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Kappaphycus striatus is one of the carrageenan-producing red algae, and found primarily in tropical and subtropical coastal regions. Its global distribution is mainly in the Philippines, Indonesia, and Malaysia, among other locations. Here, through the high-quality chromosome-level genome sequences and assembly with PacBio HiFi and Hi-C sequencing data, we assembled one genome with a total of 211.46 Mb in size, containing a contig N50 length of 5.04 Mb and a scaffold N50 length of 5.39 Mb. After Hi-C assembly and manual adjustment to the heatmap, we deduced that 199.42 Mb of genomic sequences were anchored to 33 presumed chromosomes, which accounting for 94.31% of the entire genome. One total of 14,596 protein-coding genes and 1,673 non-coding RNAs were identified, and the 100.96 Mb of repetitive sequences accounting for 47.73% of the assembled genome. Our chromosome-level genome assembly data provide valuable references for K. striatus future nursery and breeding, and will be useful for the functional genomics interpretations and evolutionary studies of eukaryotes.
引用
收藏
页数:9
相关论文
共 41 条
[1]  
[Anonymous], 2024, NCBI Sequence Read Archive
[2]  
[Anonymous], 2024, NCBI GenBank
[3]   Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[4]   MISA-web: a web server for microsatellite prediction [J].
Beier, Sebastian ;
Thiel, Thomas ;
Muench, Thomas ;
Scholz, Uwe ;
Mascher, Martin .
BIOINFORMATICS, 2017, 33 (16) :2583-2585
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   The commercial red seaweed Kappaphycus alvarezii-an overview on farming and environment [J].
Bindu, M. S. ;
Levine, Ira A. .
JOURNAL OF APPLIED PHYCOLOGY, 2011, 23 (04) :789-796
[7]   GeneWise and genomewise [J].
Birney, E ;
Clamp, M ;
Durbin, R .
GENOME RESEARCH, 2004, 14 (05) :988-995
[8]   Red macroalgae in the genomic era [J].
Borg, Michael ;
Krueger-Hadfield, Stacy A. ;
Destombe, Christophe ;
Collen, Jonas ;
Lipinska, Agnieszka ;
Coelho, Susana M. .
NEW PHYTOLOGIST, 2023, 240 (02) :471-488
[9]   Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions [J].
Burton, Joshua N. ;
Adey, Andrew ;
Patwardhan, Rupali P. ;
Qiu, Ruolan ;
Kitzman, Jacob O. ;
Shendure, Jay .
NATURE BIOTECHNOLOGY, 2013, 31 (12) :1119-+
[10]  
Chen Nansheng, 2004, Curr Protoc Bioinformatics, VChapter 4, DOI 10.1002/0471250953.bi0410s05