Systematic identification and characterization of repeat sequences in African swine fever virus genomes

被引:0
作者
Zhaozhong Zhu
Shengqiang Ge
Zena Cai
Yifan Wu
Congyu Lu
Zheng Zhang
Ping Fu
Longfei Mao
Xiaodong Wu
Yousong Peng
机构
[1] Hunan University,Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology
[2] China Animal Health and Epidemiology Center,Key Laboratory of Animal Biosafety Risk Prevention and Control (South)
[3] Ministry of Agriculture and Rural Affairs,undefined
来源
Veterinary Research | / 53卷
关键词
ASFV; repeat sequences; evolution; genetic diversity;
D O I
暂无
中图分类号
学科分类号
摘要
African swine fever virus (ASFV) is a large DNA virus that infects domestic pigs with high morbidity and mortality rates. Repeat sequences, which are DNA sequence elements that are repeated more than twice in the genome, play an important role in the ASFV genome. The majority of repeat sequences, however, have not been identified and characterized in a systematic manner. In this study, three types of repeat sequences, including microsatellites, minisatellites and short interspersed nuclear elements (SINEs), were identified in the ASFV genome, and their distribution, structure, function, and evolutionary history were investigated. Most repeat sequences were observed in noncoding regions and at the 5’ end of the genome. Noncoding repeat sequences tended to form enhancers, whereas coding repeat sequences had a lower ratio of alpha-helix and beta-sheet and a higher ratio of loop structure and surface amino acids than nonrepeat sequences. In addition, the repeat sequences tended to encode penetrating and antimicrobial peptides. Further analysis of the evolution of repeat sequences revealed that the pan-repeat sequences presented an open state, showing the diversity of repeat sequences. Finally, CpG islands were observed to be negatively correlated with repeat sequence occurrences, suggesting that they may affect the generation of repeat sequences. Overall, this study emphasizes the importance of repeat sequences in ASFVs, and these results can aid in understanding the virus's function and evolution.
引用
收藏
相关论文
共 357 条
[31]  
Ge S(2018)iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC Bioinformatics 34 503-1549
[32]  
Li J(2012)ESpritz: accurate and fast prediction of protein disorder Bioinformatics 28 520-4574
[33]  
Fan X(2019)NetSurfP-2.0: improved prediction of protein structural features by integrated deep learning Proteins 87 D1094-1299
[34]  
Liu F(2016)CAMPR3: a database on sequences, structures and signatures of antimicrobial peptides Nucleic Acids Res 44 2715-277
[35]  
Li L(2018)Machine-learning-based prediction of cell-penetrating peptides and their uptake efficiency with improved accuracy J Proteome Res 17 693-2256
[36]  
Wang Q(2003)Genotyping field strains of African swine fever virus by partial p72 gene characterisation Arch Virol 148 772-154
[37]  
Ren W(2013)MAFFT multiple sequence alignment software version 7: improvements in performance and usability Mol Biol Evol 30 1547-8087
[38]  
Bao J(2018)MEGA X: molecular evolutionary genetics analysis across computing platforms Mol Biol Evol 35 4566-489
[39]  
Liu C(2015)Phylogeographic analysis of African swine fever virus based on the p72 gene sequence Genet Mol Res 14 1297-87
[40]  
Wang H(2014)PanGP: a tool for quickly analyzing bacterial pan-genome profile Bioinformatics 30 276-281