CPGView: A package for visualizing detailed chloroplast genome structures

被引:320
作者
Liu, Shengyu [1 ,2 ]
Ni, Yang [1 ]
Li, Jingling [1 ]
Zhang, Xinyi [1 ]
Yang, Heyu [1 ]
Chen, Haimei [1 ]
Liu, Chang [1 ,3 ]
机构
[1] Chinese Acad Med Sci & Peking Union Med Coll, Inst Med Plant Dev, Beijing, Peoples R China
[2] Chinese Acad Med Sci & Peking Union Med Coll, Inst Med Informat & Lib, Dept Med Data Sharing, Beijing, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Inst Med Plant Dev, Beijing 100193, Peoples R China
基金
美国国家科学基金会;
关键词
cis-splicing genes; coordinate scaling algorithm; repeats; rps12; trans-splicing genes; ANNOTATION; TOOLS; ALIGNMENT; VERSATILE; PROGRAM; FORMAT;
D O I
10.1111/1755-0998.13729
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Chloroplast genomes have been widely used in studying plant phylogeny and evolution. Several chloroplast genome visualization tools have been developed to display the distribution of genes on the genome. However, these tools do not draw features, such as exons, introns, repetitive elements, and variable sites, disallowing in-depth examination of the genome structures. Here, we developed and validated a software package called Chloroplast Genome Viewers (CPGView). CPGView can draw three maps showing (i) the distributions of genes, variable sites, and repetitive sequences, including microsatellites, tandem and dispersed repeats; (ii) the structure of the cis-splicing genes after adjusting the exon-intron boundary positions using a coordinate scaling algorithm, and (iii) the structure of the trans-splicing gene rps12. To test the accuracy of CPGView, we sequenced, assembled, and annotated 31 chloroplast genomes from 31 genera of 22 families. CPGView drew maps correctly for all the 31 chloroplast genomes. Lastly, we used CPGView to examine 5998 publicly released chloroplast genomes from 2513 genera of 553 families. CPGView succeeded in plotting maps for 5882 but failed to plot maps for 116 chloroplast genomes. Further examination showed that the annotations of these 116 genomes had various errors needing manual correction. The test on newly generated data and publicly available data demonstrated the ability of CPGView to identify errors in the annotations of chloroplast genomes. CPGView will become a widely used tool to study the detailed structure of chloroplast genomes. The web version of CPGView can be accessed from .
引用
收藏
页码:694 / 704
页数:11
相关论文
共 29 条
  • [1] MISA-web: a web server for microsatellite prediction
    Beier, Sebastian
    Thiel, Thomas
    Muench, Thomas
    Scholz, Uwe
    Mascher, Martin
    [J]. BIOINFORMATICS, 2017, 33 (16) : 2583 - 2585
  • [2] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580
  • [3] Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics
    Cock, Peter J. A.
    Antao, Tiago
    Chang, Jeffrey T.
    Chapman, Brad A.
    Cox, Cymon J.
    Dalke, Andrew
    Friedberg, Iddo
    Hamelryck, Thomas
    Kauff, Frank
    Wilczynski, Bartek
    de Hoon, Michiel J. L.
    [J]. BIOINFORMATICS, 2009, 25 (11) : 1422 - 1423
  • [4] The variant call format and VCFtools
    Danecek, Petr
    Auton, Adam
    Abecasis, Goncalo
    Albers, Cornelis A.
    Banks, Eric
    DePristo, Mark A.
    Handsaker, Robert E.
    Lunter, Gerton
    Marth, Gabor T.
    Sherry, Stephen T.
    McVean, Gilean
    Durbin, Richard
    [J]. BIOINFORMATICS, 2011, 27 (15) : 2156 - 2158
  • [5] PLANN: A COMMAND-LINE APPLICATION FOR ANNOTATING PLASTOME SEQUENCES
    Huang, Daisie I.
    Cronk, Quentin C. B.
    [J]. APPLICATIONS IN PLANT SCIENCES, 2015, 3 (08):
  • [6] GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes
    Jin, Jian-Jun
    Yu, Wen-Bin
    Yang, Jun-Bo
    Song, Yu
    dePamphilis, Claude W.
    Yi, Ting-Shuang
    Li, De-Zhu
    [J]. GENOME BIOLOGY, 2020, 21 (01)
  • [7] AGORA: organellar genome annotation from the amino acid and nucleotide references
    Jung, Jaehee
    Kim, Jong Im
    Jeong, Young-Sik
    Yi, Gangman
    [J]. BIOINFORMATICS, 2018, 34 (15) : 2661 - 2663
  • [8] TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions
    Kim, Daehwan
    Pertea, Geo
    Trapnell, Cole
    Pimentel, Harold
    Kelley, Ryan
    Salzberg, Steven L.
    [J]. GENOME BIOLOGY, 2013, 14 (04):
  • [9] Gepard: a rapid and sensitive tool for creating dotplots on genome scale
    Krumsiek, Jan
    Arnold, Roland
    Rattei, Thomas
    [J]. BIOINFORMATICS, 2007, 23 (08) : 1026 - 1028
  • [10] Kurtz S., 2003, REF TYPE COMPUTER PR, P4