Accurate annotation of protein-coding genes in mitochondrial genomes

被引：29

作者：

Al Arab, Marwa ^{[1
,2
,8
]}

zu Siederdissen, Christian Hoener ^{[1
,2
,3
]}

Tout, Kifah ^{[8
]}

Sahyoun, Abdullah H. ^{[1
,2
,8
,9
]}

Stadler, Peter F. ^{[1
,2
,3
,4
,5
,6
,7
]}

Bernt, Matthias ^{[1
,10
]}

机构：

[1] Univ Leipzig, Dept Comp Sci, Bioinformat Grp, Hartelstr 16-18, D-04107 Leipzig, Germany

[2] Univ Leipzig, Interdisciplinary Ctr Bioinformat, Hartelstr 16-18, D-04107 Leipzig, Germany

[3] Univ Vienna, Inst Theoret Chem, Wahringerstr 17, A-1090 Vienna, Austria

[4] Max Planck Inst Math Sci, Inselstr 22, D-04103 Leipzig, Germany

[5] Fraunhofer Inst Zelltherapie & Immunol, Perlickstr 1, D-04103 Leipzig, Germany

[6] Univ Copenhagen, Ctr Noncoding RNA Technol & Hlth, Gronnegardsvej 3, DK-1870 Frederiksberg C, Denmark

[7] Santa Fe Inst, 1399 Hyde Pk Rd, Santa Fe, NM 87501 USA

[8] Lebanese Univ, Doctoral Sch Sci & Technol, AZM Ctr Biotechnol Res, Tripoli, Lebanon

[9] Johannes Gutenberg Univ Mainz gGmbH, Univ Med Ctr, TRON Translat Oncol, Mainz, Germany

[10] Univ Leipzig, Parallel Comp & Complex Syst Grp, Dept Comp Sci, Augustuspl 10, D-04103 Leipzig, Germany

来源：

MOLECULAR PHYLOGENETICS AND EVOLUTION | 2017年 / 106卷

关键词：

Protein coding genes; Metazoa; Mitochondrial DNA; Annotation; Hidden Markov models; AUTOMATIC ANNOTATION; SEQUENCE; PHYLOGENY; DNA; TRANSCRIPTS; ALIGNMENTS; DATABASE; TURTLES; BIRDS; CODE;

D O I：

10.1016/j.ympev.2016.09.024

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Mitochondrial genome sequences are available in large number and new sequences become published nowadays with increasing pace. Fast, automatic, consistent, and high quality annotations are a prerequisite for downstream analyses. Therefore, we present an automated pipeline for fast de novo annotation of mitochondrial protein-coding genes. The annotation is based on enhanced phylogeny-aware hidden Markov models (HMMs). The pipeline builds taxon-specific enhanced multiple sequence alignments (MSA) of already annotated sequences and corresponding HMMs using an approximation of the phylogeny. The MSAs are enhanced by fixing unannotated frameshifts, purging of wrong sequences, and removal of non-conserved columns from both ends. A comparison with reference annotations highlights the high quality of the results. The frameshift correction method predicts a large number of frameshifts, many of which are unknown. A detailed analysis of the frameshifts in nad3 of the Archosauria-Testudines group has been conducted. (C) 2016 Elsevier Inc. All rights reserved.

引用

页码：209 / 216

页数：8

共 50 条

[31] Protein-Coding Genes' Retrocopies and Their Functions
Kubiak, Magdalena Regina
Makalowska, Izabela
VIRUSES-BASEL, 2017, 9 (04):
[32] Introns in protein-coding genes in Archaea
Watanabe, Y
Yokobori, S
Inaba, T
Yamagishi, A
Oshima, T
Kawarabayasi, Y
Kikuchi, H
Kita, K
FEBS LETTERS, 2002, 510 (1-2) : 27 - 30
[33] Origins of new protein-coding genes
不详
SCIENCE, 2021, 371 (6531) : 779 - 780
[34] Phylogenetic performance of mitochondrial protein-coding genes in resolving relationships among vertebrates
Zardoya, R
Meyer, A
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (07) : 933 - 942
[35] Analysis of codon usage pattern of mitochondrial protein-coding genes in different hookworms
Deb, Bornali
Uddin, Arif
Mazumder, Gulshana Akthar
Chakraborty, Supriyo
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2018, 219 : 24 - 32
[36] Analysis of mitochondrial protein-coding genes ofAntheraea assamensis: Muga silkworm of Assam
Uddin, Arif
Chakraborty, Supriyo
ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY, 2021, 106 (01)
[37] Nucleotide substitution rates for the full set of mitochondrial protein-coding genes in Coleoptera
Pons, Joan
Ribera, Ignacio
Bertranpetit, Jaume
Balke, Michael
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2010, 56 (02) : 796 - 807
[38] Comparative analysis of sequences preceding protein-coding mitochondrial genes in flowering plants
Hazle, Thomas
Bonen, Linda
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (05) : 1101 - 1112
[39] Mitochondrial-encoded endonucleases drive recombination of protein-coding genes in yeast
Wu, Baojun
Hao, Weilong
ENVIRONMENTAL MICROBIOLOGY, 2019, 21 (11) : 4233 - 4240
[40] Two mitochondrial genomes from the families Bethylidae and Mutillidae: Independent rearrangement of protein-coding genes and higher-level phylogeny of the Hymenoptera
Wei, Shu-Jun
Li, Qian
van Achterberg, Kees
Chen, Xue-Xin
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2014, 77 : 1 - 10

← 1 2 3 4 5 →