Fungal genomes: suffering with functional annotation errors

被引:3
|
作者
Mohanta, Tapan Kumar [1 ]
Al-Harrasi, Ahmed [1 ]
机构
[1] Univ Nizwa, Nat & Med Sci Res Ctr, Nizwa 616, Oman
关键词
Fungal genome; Fungi; Genome; Annotation; Selenoprotein; WRKY; Calcium signaling; Calcium dependent protein kinase; DEPENDENT PROTEIN-KINASE; CALCIUM; EVOLUTION; CDPK;
D O I
10.1186/s43008-021-00083-x
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Background The genome sequence data of more than 65985 species are publicly available as of October 2021 within the National Center for Biotechnology Information (NCBI) database alone and additional genome sequences are available in other databases and also continue to accumulate at a rapid pace. However, an error-free functional annotation of these genome is essential for the research communities to fully utilize these data in an optimum and efficient manner. Results An analysis of proteome sequence data of 689 fungal species (7.15 million protein sequences) was conducted to identify the presence of functional annotation errors. Proteins associated with calcium signaling events, including calcium dependent protein kinases (CDPKs), calmodulins (CaM), calmodulin-like (CML) proteins, WRKY transcription factors, selenoproteins, and proteins associated with the terpene biosynthesis pathway, were targeted in the analysis. Gene associated with CDPKs and selenoproteins are known to be absent in fungal genomes. Our analysis, however, revealed the presence of proteins that were functionally annotated as CDPK proteins. However, InterproScan analysis indicated that none of the protein sequences annotated as "calcium dependent protein kinase" were found to encode calcium binding EF-hands at the regulatory domain. Similarly, none of a protein sequences annotated as a "selenocysteine" were found to contain a Sec (U) amino acid. Proteins annotated as CaM and CMLs also had significant discrepancies. CaM proteins should contain four calcium binding EF-hands, however, a range of 2-4 calcium binding EF-hands were present in the fungal proteins that were annotated as CaM proteins. Similarly, CMLs should possess four calcium binding EF-hands, but some of the CML annotated fungal proteins possessed either three or four calcium binding EF-hands. WRKY transcription factors are characterized by the presence of a WRKY domain and are confined to the plant kingdom. Several fungal proteins, however, were annotated as WRKY transcription factors, even though they did not contain a WRKY domain. Conclusion The presence of functional annotation errors in fungal genome and proteome databases is of considerable concern and needs to be addressed in a timely manner.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Fungal Genomics: Sequencing and Annotation
    Kuo, Alan
    Bushnell, Brian
    Grigoriev, Igor V.
    FUNGI, 2014, 70 : 1 - 52
  • [22] Comprehensive Functional Annotation of Metagenomes and Microbial Genomes Using a Deep Learning-Based Method
    Maranga, Mary
    Szczerbiak, Pawel
    Bezshapkin, Valentyn
    Gligorijevic, Vladimir
    Chandler, Chris
    Bonneau, Richard
    Xavier, Ramnik J.
    Vatanen, Tommi
    Kosciolek, Tomasz
    MSYSTEMS, 2023, 8 (02)
  • [23] ReNoteWeb - Web platform for the improvement of assembly result and annotation of prokaryotic genomes
    Moia, Gislenne da Silva
    Cruz Gaia, Antonio Sergio
    de Oliveira, Monica Silva
    dos Santosa, Victoria Cardoso
    Castro Alves, Jorianne Thyeska
    Caracciolo Gomes de Sa, Pablo Henrique
    de Oliveira Veras, Adonney Allan
    GENE, 2022, 844
  • [24] A comparison of the nature and abundance of microsatellites in 14 fungal genomes
    Lim, S
    Notley-McRobb, L
    Lim, M
    Carter, DA
    FUNGAL GENETICS AND BIOLOGY, 2004, 41 (11) : 1025 - 1036
  • [25] Re-annotation of eight Drosophila genomes
    Yang, Haiwang
    Jaime, Maria
    Polihronakis, Maxi
    Kanegawa, Kelvin
    Markow, Therese
    Kaneshiro, Kenneth
    Oliver, Brian
    LIFE SCIENCE ALLIANCE, 2018, 1 (06)
  • [26] PROBIOTIC GENOMES: SEQUENCING AND ANNOTATION IN THE PAST DECADE
    Joseph, Joel P.
    INTERNATIONAL JOURNAL OF PHARMACEUTICAL SCIENCES AND RESEARCH, 2018, 9 (04): : 1351 - 1362
  • [27] The Duplicated Genes Database: Identification and Functional Annotation of Co-Localised Duplicated Genes across Genomes
    Ouedraogo, Marion
    Bettembourg, Charles
    Bretaudeau, Anthony
    Sallou, Olivier
    Diot, Christian
    Demeure, Olivier
    Lecerf, Frederic
    PLOS ONE, 2012, 7 (11):
  • [28] De novo sequencing, assembly and functional annotation of Armillaria borealis genome
    Akulova, Vasilina S.
    Sharov, Vadim V.
    Aksyonova, Anastasiya I.
    Putintseva, Yuliya A.
    Oreshkova, Natalya V.
    Feranchuk, Sergey I.
    Kuzmin, Dmitry A.
    Pavlov, Igor N.
    Litovka, Yulia A.
    Krutovsky, Konstantin V.
    BMC GENOMICS, 2020, 21 (Suppl 7)
  • [29] Fungal Secretome Database: Integrated platform for annotation of fungal secretomes
    Jaeyoung Choi
    Jongsun Park
    Donghan Kim
    Kyongyong Jung
    Seogchan Kang
    Yong-Hwan Lee
    BMC Genomics, 11
  • [30] OcculterCut: A Comprehensive Survey of AT-Rich Regions in Fungal Genomes
    Testa, Alison C.
    Oliver, Richard P.
    Hane, James K.
    GENOME BIOLOGY AND EVOLUTION, 2016, 8 (06): : 2044 - 2064