Towards Automatic Detecting of Overlapping Genes - Clustered BLAST Analysis of Viral Genomes

被引:0
|
作者
Neuhaus, Klaus [1 ]
Oelke, Daniela [2 ]
Fuerst, David [3 ]
Scherer, Siegfried [1 ]
Keim, Daniel A. [2 ]
机构
[1] Tech Univ Munich, Chair Microbial Ecol, Weihenstephaner Berg 3, D-85354 Freising Weihenstephan, Germany
[2] Univ Konstanz, Chair Data Anal & Visualizat, D-78457 Constance 78457, Germany
[3] Rheinisch Westf Techn Hochsch Aachen, Chair Data Management & Data Explorat, Aachen 52056, Germany
来源
EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS | 2010年 / 6023卷
关键词
overlapping genes; clustering; BLAST analysis; EVOLUTION; COMPRESSION; STABILITY; BACTERIA; SEQUENCE; DATABASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overlapping genes (encoded on the same DNA locus but in different frames) are thought to be rare and, therefore, were largely neglected in the past. In a test set of 800 viruses we found more than 350 potential overlapping open reading frames of >500 bp which generate BLAST hits, indicating a possible biological function. Interestingly, five overlaps with more than 2000 bp were found, the largest may even contain triple overlaps. In order to perform the vast amount of BLAST searches required to test all detected open reading frames, we compared two clustering strategies (BLASTCLUST and k-means) and queried the database with one representative only. Our results show that this approach achieves a significant speed-up while retaining a high quality of the results (>99% precision compared to single queries) for both clustering methods. Future wet lab experiments are needed to show whether the detected overlapping reading frames are biologically functional.
引用
收藏
页码:228 / +
页数:4
相关论文
共 6 条
  • [1] Genomes analysis and bacteria identification: The use of overlapping genes as molecular markers
    Perrin, Elena
    Fondi, Marco
    Maida, Isabel
    Mengoni, Alessio
    Chiellini, Carolina
    Mocali, Stefano
    Cocchi, Priscilla
    Campana, Silvia
    Taccetti, Giovanni
    Vaneechoutte, Mario
    Fani, Renato
    JOURNAL OF MICROBIOLOGICAL METHODS, 2015, 117 : 108 - 112
  • [2] New insights into the evolutionary features of viral overlapping genes by discriminant analysis
    Pavesi, Angelo
    VIROLOGY, 2020, 546 : 51 - 66
  • [3] Global analysis of more than 50,000 SARS-CoV-2 genomes reveals epistasis between eight viral genes
    Zeng, Hong-Li
    Dichio, Vito
    Horta, Edwin Rodriguez
    Thorell, Kaisa
    Aurell, Erik
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (49) : 31519 - 31526
  • [4] Expression of Concern on "The Comparative Analysis of Two RT-qPCR Kits for Detecting SARS-CoV-2 Reveals a Higher Risk of False-Negative Diagnosis in Samples with High Quantification Cycles for Viral and Internal Genes "
    Medical Microbiology, Canadian Journal of Infectious Diseases and
    CANADIAN JOURNAL OF INFECTIOUS DISEASES & MEDICAL MICROBIOLOGY, 2023, 2023
  • [5] Comparative analysis of nearly full-length hepatitis C virus quasispecies from patients experiencing viral breakthrough during antiviral therapy: Clustered mutations in three functional genes, E2, NS2, and NS5a
    Xu, Zekuan
    Fan, Xiaofeng
    Xu, Yanjuan
    Di Bisceglie, Adrian M.
    JOURNAL OF VIROLOGY, 2008, 82 (19) : 9417 - 9424
  • [6] The Comparative Analysis of Two RT-qPCR Kits for Detecting SARS-CoV-2 Reveals a Higher Risk of False-Negative Diagnosis in Samples with High Quantification Cycles for Viral and Internal Genes (Publication with Expression of Concern)
    Luraschi, Roberto
    Barrera-Avalos, Carlos
    Vallejos-Vidal, Eva
    Alarcon, Javiera
    Mella-Torres, Andrea
    Hernandez, Felipe
    Inostroza-Molina, Ailen
    Valdes, Daniel
    Imarai, Monica
    Acuna-Castillo, Claudio
    Reyes-Lopez, Felipe E.
    Sandino, Ana Maria
    CANADIAN JOURNAL OF INFECTIOUS DISEASES & MEDICAL MICROBIOLOGY, 2022, 2022