Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation

被引:33
作者
Biller, Priscila [1 ,2 ]
Gueguen, Laurent [3 ]
Knibbe, Carole [1 ,4 ]
Tannier, Eric [1 ,3 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] Univ Estadual Campinas, Sao Paulo, Brazil
[3] Univ Lyon 1, LBBE, UMR5558, F-69622 Villeurbanne, France
[4] Univ Lyon 1, LIRIS, UMR5205, F-69622 Villeurbanne, France
来源
GENOME BIOLOGY AND EVOLUTION | 2016年 / 8卷 / 05期
基金
巴西圣保罗研究基金会;
关键词
rearrangements; inversions; random graphs; amniote genomes; uniform breakpoint model; fragile breakpoint model; BREAKPOINT REUSE; BREAKAGE MODELS; EVOLUTION; DISTRIBUTIONS;
D O I
10.1093/gbe/evw083
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Models of evolution by genome rearrangements are prone to two types of flaws: One is to ignore the diversity of susceptibility to breakage across genomic regions, and the other is to suppose that susceptibility values are given. Without necessarily supposing their precise localization, we call "solid" the regions that are improbably broken by rearrangements and "fragile" the regions outside solid ones. We propose a model of evolution by inversions where breakage probabilities vary across fragile regions and over time. It contains as a particular case the uniform breakage model on the nucleotidic sequence, where breakage probabilities are proportional to fragile region lengths. This is very different from the frequently used pseudouniform model where all fragile regions have the same probability to break. Estimations of rearrangement distances based on the pseudouniform model completely fail on simulations with the truly uniform model. On pairs of amniote genomes, we show that identifying coding genes with solid regions yields incoherent distance estimations, especially with the pseudouniform model, and to a lesser extent with the truly uniform model. This incoherence is solved when we coestimate the number of fragile regions with the rearrangement distance. The estimated number of fragile regions is surprisingly small, suggesting that a minority of regions are recurrently used by rearrangements. Estimations for several pairs of genomes at different divergence times are in agreement with a slowly evolvable colocalization of active genomic regions in the cell.
引用
收藏
页码:1427 / 1439
页数:13
相关论文
共 49 条
[1]   Are there rearrangement hotspots in the human genome? [J].
Alekseyev, Max A. ;
Pevzner, Pavel A. .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (11) :2111-2121
[2]   Comparative genomics reveals birth and death of fragile regions in mammalian evolution [J].
Alekseyev, Max A. ;
Pevzner, Pavel A. .
GENOME BIOLOGY, 2010, 11 (11)
[3]  
Alexeev N, 2015, EVOLUTIONARY DISTANC
[4]  
Alexeev N, 2015, LECT N BIOINFORMAT, V9043, P471, DOI 10.1007/978-3-319-16483-0_46
[5]   The rise and fall of breakpoint reuse depending on genome resolution [J].
Attie, Oliver ;
Darling, Aaron E. ;
Yancopoulos, Sophia .
BMC BIOINFORMATICS, 2011, 12
[6]  
Baudet C, 2014, LECT N BIOINFORMAT, V8826, P99, DOI 10.1007/978-3-319-12418-6_13
[7]   The random versus fragile breakage models of chromosome evolution: a matter of resolution [J].
Becker, Thomas S. ;
Lenhard, Boris .
MOLECULAR GENETICS AND GENOMICS, 2007, 278 (05) :487-491
[8]   Paleontological evidence to date the tree of life [J].
Benton, Michael J. ;
Donoghue, Philip C. J. .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (01) :26-53
[9]   A phase transition in the random transposition random walk [J].
Berestycki, Nathanael ;
Durrett, Rick .
PROBABILITY THEORY AND RELATED FIELDS, 2006, 136 (02) :203-233
[10]  
Bergeron A, 2006, P REC COMP GEN 2008, P226