A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium

被引:0
作者
Cohen, Nimrod [1 ]
Veksler-Lublinsky, Isana [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, Fac Engn, Beer Sheva, Israel
关键词
pseudogenes; phylogenetics; bacteria; Pseudomonas aeruginosa; comparative genomics; IDENTIFICATION; VISUALIZATION; DATABASE; GENES; TREES; MLST;
D O I
10.1128/spectrum.01704-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Pseudogenes, once considered "junk DNA" based on the incorrect assumption that the absence of full coding potential means a complete lack of functionality, have recently become a subject of significant interest in the scientific community. Concurrently, it is widely assumed that bacterial genomes are compact and have a high density of coding genes with little room for non-coding genes, including pseudogenes. A key aspect of genome annotation is the correct identification of genes and the distinction between coding genes and pseudogenes, as it directly impacts functional and comparative genomics studies. In this study, we analyzed the genomic data of 4,699 strains of the bacterium Pseudomonas aeruginosa (P. aeruginosa) as they exhibit high variability in the number of annotated pseudogenes. In particular, we looked for correlations between the number of pseudogenes and other genomic and meta-features of the strains. We identified clusters of orthologous genes and pseudogenes and compared cluster size distributions and length homogeneity within clusters. We then mapped and examined orthology relationships between genes and pseudogenes. Additionally, we generated a phylogenetic tree of the strains and found that phylogenetically related strains are more homogeneous in the number of pseudogenes and share a significant amount of pseudogenes. Finally, we delved into clusters of orthologous genes and pseudogenes and quantified their phylogenetic neighborhood, classifying pseudogenes into evolutionary preserved pseudogenes, mis-annotated pseudogenes, or pseudogenes formed by failed horizontal transfer events. This in-depth study provides important insights that can be incorporated into pseudogene annotation pipelines in the future. IMPORTANCE Accurate annotation of genes and pseudogenes is vital for comparative genomics analysis. Recent studies have shown that bacterial pseudogenes have an important role in regulatory processes and can provide insight into the evolutionary history of homologous genes or the genome as a whole. Due to pseudogenes' nature as non-functional genes, there is no commonly accepted definition of a pseudogene, which poses difficulties in verifying the annotation through experimental methods and resolving discrepancies among different annotation techniques. Our study introduces an in-depth analysis of annotated genes and pseudogenes and insights that can be incorporated into improved pseudogene annotation pipelines in the future.
引用
收藏
页数:15
相关论文
共 30 条
[1]   Release factor 2 frameshifting sites in different bacteria [J].
Baranov, PV ;
Gesteland, RF ;
Atkins, JF .
EMBO REPORTS, 2002, 3 (04) :373-377
[2]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[3]   MULTIPLE COMPARISONS USING RANK SUMS [J].
DUNN, OJ .
TECHNOMETRICS, 1964, 6 (03) :241-&
[4]   "Pseudo-pseudogenes" in bacterial genomes: Proteogenomics reveals a wide but low protein expression of pseudogenes in Salmonella enterica [J].
Feng, Ye ;
Wang, Zeyu ;
Chien, Kun-Yi ;
Chen, Hsiu-Ling ;
Liang, Yi-Hua ;
Hua, Xiaoting ;
Chiu, Cheng-Hsun .
NUCLEIC ACIDS RESEARCH, 2022, 50 (09) :5158-5170
[5]   CD-HIT: accelerated for clustering the next-generation sequencing data [J].
Fu, Limin ;
Niu, Beifang ;
Zhu, Zhengwei ;
Wu, Sitao ;
Li, Weizhong .
BIOINFORMATICS, 2012, 28 (23) :3150-3152
[6]   Large-scale and significant expression from pseudogenes in Sodalis glossinidius - a facultative bacterial endosymbiont [J].
Goodhead, Ian ;
Blow, Frances ;
Brownridge, Philip ;
Hughes, Margaret ;
Kenny, John ;
Krishna, Ritesh ;
McLean, Lynn ;
Pongchaikul, Pisut ;
Beynon, Rob ;
Darby, Alistair C. .
MICROBIAL GENOMICS, 2020, 6 (01)
[7]   Taking the pseudo out of pseudogenes [J].
Goodhead, Ian ;
Darby, Alistair C. .
CURRENT OPINION IN MICROBIOLOGY, 2015, 23 :102-109
[8]   Digging for dead genes:: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome [J].
Harrison, PM ;
Echols, N ;
Gerstein, MB .
NUCLEIC ACIDS RESEARCH, 2001, 29 (03) :818-830
[9]   A comprehensive evaluation of assembly scaffolding tools [J].
Hunt, Martin ;
Newbold, Chris ;
Berriman, Matthew ;
Otto, Thomas D. .
GENOME BIOLOGY, 2014, 15 (03)
[10]  
Jaccard P., 1912, The New Phytologist, V11, P37, DOI [10.1111/j.1469-8137.1912.tb05611.x, DOI 10.1111/J.1469-8137.1912.TB05611.X]