Chromosome-level assembly and gene annotation of Kappaphycus striatus genome

被引:0
作者
Zhou, Zhiyin [1 ,2 ,3 ]
Ma, Yu [1 ,2 ,3 ]
Zhang, Jie [1 ,2 ]
Firdaus, Muhammad [4 ]
Roleda, Michael Y. [5 ]
Duan, Delin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Oceanol, Shandong Prov Key Lab Expt Marine Biol, Key Lab Breeding Biotechnol & Sustainable Aquacult, Qingdao 266071, Peoples R China
[2] Qingdao Marine Sci & Technol Ctr, Lab Marine Biol & Biotechnol, Qingdao 266237, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Natl Res & Innovat Agcy BRIN, Res Ctr Marine & Land Bioind, Lombok Utara 83352, Indonesia
[5] Univ Philippines Diliman, Marine Sci Inst, Quezon City 1101, Philippines
关键词
DE-NOVO IDENTIFICATION; PROGRAM; TRANSCRIPTOME; ACCURATE; FAMILIES; FINDER;
D O I
10.1038/s41597-025-04583-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Kappaphycus striatus is one of the carrageenan-producing red algae, and found primarily in tropical and subtropical coastal regions. Its global distribution is mainly in the Philippines, Indonesia, and Malaysia, among other locations. Here, through the high-quality chromosome-level genome sequences and assembly with PacBio HiFi and Hi-C sequencing data, we assembled one genome with a total of 211.46 Mb in size, containing a contig N50 length of 5.04 Mb and a scaffold N50 length of 5.39 Mb. After Hi-C assembly and manual adjustment to the heatmap, we deduced that 199.42 Mb of genomic sequences were anchored to 33 presumed chromosomes, which accounting for 94.31% of the entire genome. One total of 14,596 protein-coding genes and 1,673 non-coding RNAs were identified, and the 100.96 Mb of repetitive sequences accounting for 47.73% of the assembled genome. Our chromosome-level genome assembly data provide valuable references for K. striatus future nursery and breeding, and will be useful for the functional genomics interpretations and evolutionary studies of eukaryotes.
引用
收藏
页数:9
相关论文
共 41 条
[11]   Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm [J].
Cheng, Haoyu ;
Concepcion, Gregory T. ;
Feng, Xiaowen ;
Zhang, Haowen ;
Li, Heng .
NATURE METHODS, 2021, 18 (02) :170-+
[12]  
DOTY MS, 1975, MAR TECHNOL SOC J, V9, P30
[13]   LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons [J].
Ellinghaus, David ;
Kurtz, Stefan ;
Willhoeft, Ute .
BMC BIOINFORMATICS, 2008, 9 (1)
[14]   Full-length transcriptome assembly from RNA-Seq data without a reference genome [J].
Grabherr, Manfred G. ;
Haas, Brian J. ;
Yassour, Moran ;
Levin, Joshua Z. ;
Thompson, Dawn A. ;
Amit, Ido ;
Adiconis, Xian ;
Fan, Lin ;
Raychowdhury, Raktima ;
Zeng, Qiandong ;
Chen, Zehua ;
Mauceli, Evan ;
Hacohen, Nir ;
Gnirke, Andreas ;
Rhind, Nicholas ;
di Palma, Federica ;
Birren, Bruce W. ;
Nusbaum, Chad ;
Lindblad-Toh, Kerstin ;
Friedman, Nir ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2011, 29 (07) :644-U130
[15]   Rfam: annotating non-coding RNAs in complete genomes [J].
Griffiths-Jones, S ;
Moxon, S ;
Marshall, M ;
Khanna, A ;
Eddy, SR ;
Bateman, A .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D121-D124
[16]   Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments [J].
Haas, Brian J. ;
Salzberg, Steven L. ;
Zhu, Wei ;
Pertea, Mihaela ;
Allen, Jonathan E. ;
Orvis, Joshua ;
White, Owen ;
Buell, C. Robin ;
Wortman, Jennifer R. .
GENOME BIOLOGY, 2008, 9 (01)
[17]   eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses [J].
Huerta-Cepas, Jaime ;
Szklarczyk, Damian ;
Heller, Davide ;
Hernandez-Plaza, Ana ;
Forslund, Sofia K. ;
Cook, Helen ;
Mende, Daniel R. ;
Letunic, Ivica ;
Rattei, Thomas ;
Jensen, Lars J. ;
von Mering, Christian ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D309-D314
[18]   Phyconomy: the extensive cultivation of seaweeds, their sustainability and economic value, with particular reference to important lessons to be learned and transferred from the practice of eucheumatoid farming [J].
Hurtado, Anicia Q. ;
Neish, Iain C. ;
Critchley, Alan T. .
PHYCOLOGIA, 2019, 58 (05) :472-483
[19]  
Jia S, 2020, bioRxiv, DOI [10.1101/2020.02.15.950402, 10.1101/2020.02.15.950402, DOI 10.1101/2020.02.15.950402]
[20]   KEGG as a reference resource for gene and protein annotation [J].
Kanehisa, Minoru ;
Sato, Yoko ;
Kawashima, Masayuki ;
Furumichi, Miho ;
Tanabe, Mao .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D457-D462