DCJ-Indel sorting revisited

被引:26
作者
Compeau, Phillip E. C. [1 ]
机构
[1] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
关键词
Genome rearrangements; DCJ; Indels; Sorting; Solution space;
D O I
10.1186/1748-7188-8-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The introduction of the double cut and join operation (DCJ) caused a flurry of research into the study of multichromosomal rearrangements. However, little of this work has incorporated indels (i.e., insertions and deletions of chromosomes and chromosomal intervals) into the calculation of genomic distance functions, with the exception of Braga et al., who provided a linear time algorithm for the problem of DCJ-indel sorting. Although their algorithm only takes linear time, its derivation is lengthy and depends on a large number of possible cases. Results: We note the simple idea that a deletion of a chromosomal interval can be viewed as a DCJ that creates a new circular chromosome. This framework will allow us to amortize indels as DCJs, which in turn permits the application of the classical breakpoint graph to obtain a simplified indel model that still solves the problem of DCJ-indel sorting in linear time via a more concise formulation that relies on the simpler problem of DCJ sorting. Furthermore, we can extend this result to fully characterize the solution space of DCJ-indel sorting. Conclusions: Encoding indels as DCJ operations offers a new insight into why the problem of DCJ-indel sorting is not ultimately any more difficult than that of sorting by DCJs alone. There is still room for research in this area, most notably the problem of sorting when the cost of indels is allowed to vary with respect to the cost of a DCJ and we demand a minimum cost transformation of one genome into another.
引用
收藏
页数:9
相关论文
共 17 条
[1]   Genome rearrangements and sorting by reversals [J].
Bafna, V ;
Pevzner, PA .
SIAM JOURNAL ON COMPUTING, 1996, 25 (02) :272-289
[2]  
Bergeron A, 2006, LECT NOTES COMPUT SC, V4175, P163
[3]   On the weight of indels in genomic distances [J].
Braga, Marilia D. V. ;
Machado, Raphael ;
Ribeiro, Leonardo C. ;
Stoye, Jens .
BMC BIOINFORMATICS, 2011, 12
[4]  
Braga MDV, 2010, LECT N BIOINFORMAT, V6293, P90, DOI 10.1007/978-3-642-15294-8_8
[5]   The Solution Space of Sorting by DCJ [J].
Braga, Marilia D. V. ;
Stoye, Jens .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2010, 17 (09) :1145-1165
[6]  
Bulteau L, PREPRINT
[7]   An (18/11)n upper bound for sorting by prefix reversals [J].
Chitturi, B. ;
Fahle, W. ;
Meng, Z. ;
Morales, L. ;
Shields, C. O. ;
Sudborough, I. H. ;
Voit, W. .
THEORETICAL COMPUTER SCIENCE, 2009, 410 (36) :3372-3390
[8]  
Compeau Phillip E. C., 2012, Algorithms in Bioinformatics. Proceedings of the12th International Workshop, WABI 2012, P365, DOI 10.1007/978-3-642-33122-0_29
[9]  
Dobzhansky T, 1938, GENETICS, V23, P28
[10]  
Dweighter H., 1975, Am. Math. Mon., V82, P1010