Estimating Gene Gain and Loss Rates in the Presence of Error in Genome Assembly and Annotation Using CAFE 3

被引:660
作者
Han, Mira V. [1 ,2 ]
Thomas, Gregg W. C. [2 ]
Lugo-Martinez, Jose [2 ]
Hahn, Matthew W. [2 ,3 ]
机构
[1] Natl Evolutionary Synth Ctr, Durham, NC USA
[2] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
[3] Indiana Univ, Dept Biol, Bloomington, IN 47405 USA
基金
美国国家科学基金会;
关键词
duplication; gene family; adaptive evolution; COPY-NUMBER POLYMORPHISM; EVOLUTIONARY TREES; MAXIMUM-LIKELIHOOD; FAMILY EVOLUTION; INSIGHTS; DIVERGENCE; SEQUENCE; DEATH; LIFE;
D O I
10.1093/molbev/mst100
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.
引用
收藏
页码:1987 / 1997
页数:11
相关论文
共 42 条
[1]   Determining the evolutionary history of gene families [J].
Ames, Ryan M. ;
Money, Daniel ;
Ghatge, Vikramsinh P. ;
Whelan, Simon ;
Lovell, Simon C. .
BIOINFORMATICS, 2012, 28 (01) :48-55
[2]  
Bailey NTJ, 1964, ELEMENTS STOCHASTIC
[3]   Population genomics:: Whole-genome analysis of polymorphism and divergence in Drosophila simulans [J].
Begun, David J. ;
Holloway, Alisha K. ;
Stevens, Kristian ;
Hillier, LaDeana W. ;
Poh, Yu-Ping ;
Hahn, Matthew W. ;
Nista, Phillip M. ;
Jones, Corbin D. ;
Kern, Andrew D. ;
Dewey, Colin N. ;
Pachter, Lior ;
Myers, Eugene ;
Langley, Charles H. .
PLOS BIOLOGY, 2007, 5 (11) :2534-2559
[4]   The evolution of gene expression levels in mammalian organs [J].
Brawand, David ;
Soumillon, Magali ;
Necsulea, Anamaria ;
Julien, Philippe ;
Csardi, Gabor ;
Harrigan, Patrick ;
Weier, Manuela ;
Liechti, Angelica ;
Aximu-Petri, Ayinuer ;
Kircher, Martin ;
Albert, Frank W. ;
Zeller, Ulrich ;
Khaitovich, Philipp ;
Gruetzner, Frank ;
Bergmann, Sven ;
Nielsen, Rasmus ;
Paeaebo, Svante ;
Kaessmann, Henrik .
NATURE, 2011, 478 (7369) :343-+
[5]   Rapid Expansion and Functional Divergence of Subtelomeric Gene Families in Yeasts [J].
Brown, Chris A. ;
Murray, Andrew W. ;
Verstrepen, Kevin J. .
CURRENT BIOLOGY, 2010, 20 (10) :895-903
[6]  
Buonaccorsi JP, 2010, INTERD STAT, P1, DOI 10.1201/9781420066586
[7]   Evolution of pathogenicity and sexual reproduction in eight Candida genomes [J].
Butler, Geraldine ;
Rasmussen, Matthew D. ;
Lin, Michael F. ;
Santos, Manuel A. S. ;
Sakthikumar, Sharadha ;
Munro, Carol A. ;
Rheinbay, Esther ;
Grabherr, Manfred ;
Forche, Anja ;
Reedy, Jennifer L. ;
Agrafioti, Ino ;
Arnaud, Martha B. ;
Bates, Steven ;
Brown, Alistair J. P. ;
Brunke, Sascha ;
Costanzo, Maria C. ;
Fitzpatrick, David A. ;
de Groot, Piet W. J. ;
Harris, David ;
Hoyer, Lois L. ;
Hube, Bernhard ;
Klis, Frans M. ;
Kodira, Chinnappa ;
Lennard, Nicola ;
Logue, Mary E. ;
Martin, Ronny ;
Neiman, Aaron M. ;
Nikolaou, Elissavet ;
Quail, Michael A. ;
Quinn, Janet ;
Santos, Maria C. ;
Schmitzberger, Florian F. ;
Sherlock, Gavin ;
Shah, Prachi ;
Silverstein, Kevin A. T. ;
Skrzypek, Marek S. ;
Soll, David ;
Staggs, Rodney ;
Stansfield, Ian ;
Stumpf, Michael P. H. ;
Sudbery, Peter E. ;
Srikantha, Thyagarajan ;
Zeng, Qiandong ;
Berman, Judith ;
Berriman, Matthew ;
Heitman, Joseph ;
Gow, Neil A. R. ;
Lorenz, Michael C. ;
Birren, Bruce W. ;
Kellis, Manolis .
NATURE, 2009, 459 (7247) :657-662
[8]   Evolution of genes and genomes on the Drosophila phylogeny [J].
Clark, Andrew G. ;
Eisen, Michael B. ;
Smith, Douglas R. ;
Bergman, Casey M. ;
Oliver, Brian ;
Markow, Therese A. ;
Kaufman, Thomas C. ;
Kellis, Manolis ;
Gelbart, William ;
Iyer, Venky N. ;
Pollard, Daniel A. ;
Sackton, Timothy B. ;
Larracuente, Amanda M. ;
Singh, Nadia D. ;
Abad, Jose P. ;
Abt, Dawn N. ;
Adryan, Boris ;
Aguade, Montserrat ;
Akashi, Hiroshi ;
Anderson, Wyatt W. ;
Aquadro, Charles F. ;
Ardell, David H. ;
Arguello, Roman ;
Artieri, Carlo G. ;
Barbash, Daniel A. ;
Barker, Daniel ;
Barsanti, Paolo ;
Batterham, Phil ;
Batzoglou, Serafim ;
Begun, Dave ;
Bhutkar, Arjun ;
Blanco, Enrico ;
Bosak, Stephanie A. ;
Bradley, Robert K. ;
Brand, Adrianne D. ;
Brent, Michael R. ;
Brooks, Angela N. ;
Brown, Randall H. ;
Butlin, Roger K. ;
Caggese, Corrado ;
Calvi, Brian R. ;
de Carvalho, A. Bernardo ;
Caspi, Anat ;
Castrezana, Sergio ;
Celniker, Susan E. ;
Chang, Jean L. ;
Chapple, Charles ;
Chatterji, Sourav ;
Chinwalla, Asif ;
Civetta, Alberto .
NATURE, 2007, 450 (7167) :203-218
[9]   The Ecoresponsive Genome of Daphnia pulex [J].
Colbourne, John K. ;
Pfrender, Michael E. ;
Gilbert, Donald ;
Thomas, W. Kelley ;
Tucker, Abraham ;
Oakley, Todd H. ;
Tokishita, Shinichi ;
Aerts, Andrea ;
Arnold, Georg J. ;
Basu, Malay Kumar ;
Bauer, Darren J. ;
Caceres, Carla E. ;
Carmel, Liran ;
Casola, Claudio ;
Choi, Jeong-Hyeon ;
Detter, John C. ;
Dong, Qunfeng ;
Dusheyko, Serge ;
Eads, Brian D. ;
Froehlich, Thomas ;
Geiler-Samerotte, Kerry A. ;
Gerlach, Daniel ;
Hatcher, Phil ;
Jogdeo, Sanjuro ;
Krijgsveld, Jeroen ;
Kriventseva, Evgenia V. ;
Kueltz, Dietmar ;
Laforsch, Christian ;
Lindquist, Erika ;
Lopez, Jacqueline ;
Manak, J. Robert ;
Muller, Jean ;
Pangilinan, Jasmyn ;
Patwardhan, Rupali P. ;
Pitluck, Samuel ;
Pritham, Ellen J. ;
Rechtsteiner, Andreas ;
Rho, Mina ;
Rogozin, Igor B. ;
Sakarya, Onur ;
Salamov, Asaf ;
Schaack, Sarah ;
Shapiro, Harris ;
Shiga, Yasuhiro ;
Skalitzky, Courtney ;
Smith, Zachary ;
Souvorov, Alexander ;
Sung, Way ;
Tang, Zuojian ;
Tsuchiya, Dai .
SCIENCE, 2011, 331 (6017) :555-561
[10]  
Costello JC, 2008, LECT N BIOINFORMAT, V5267, P14, DOI 10.1007/978-3-540-87989-3_2