Sorting Permutations by Intergenic Operations

被引:8
作者
Oliveira, Andre Rodrigues [1 ]
Jean, Geraldine [2 ]
Fertin, Guillaume [2 ]
Brito, Klairton Lima [1 ]
Dias, Ulisses [3 ]
Dias, Zanoni [1 ]
机构
[1] Univ Estadual Campinas, Inst Comp, BR-13083970 Campinas, Brazil
[2] Univ Nantes, LS2N, CNRS, UMR 6004, F-44035 Nantes, France
[3] Univ Estadual Campinas, Sch Technol, BR-13484350 Limeira, Brazil
基金
巴西圣保罗研究基金会;
关键词
Genomics; Sorting; Approximation algorithms; Transforms; Bioinformatics; DNA; Phylogeny; Genome rearrangements; intergenic regions; reversals; transpositions; POLYNOMIAL ALGORITHM; REVERSALS; TRANSPOSITIONS;
D O I
10.1109/TCBB.2021.3077418
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome Rearrangements are events that affect large stretches of genomes during evolution. Many mathematical models have been used to estimate the evolutionary distance between two genomes based on genome rearrangements. However, most of them focused on the (order of the) genes of a genome, disregarding other important elements in it. Recently, researchers have shown that considering regions between each pair of genes, called intergenic regions, can enhance distance estimation in realistic data. Two of the most studied genome rearrangements are the reversal, which inverts a sequence of genes, and the transposition, which occurs when two adjacent gene sequences swap their positions inside the genome. In this work, we study the transposition distance between two genomes, but we also consider intergenic regions, a problem we name Sorting by Intergenic Transpositions. We show that this problem is NP-hard and propose two approximation algorithms, with factors 3.5 and 2.5, considering two distinct definitions for the problem. We also investigate the signed reversal and transposition distance between two genomes considering their intergenic regions. This second problem is called Sorting by Signed Intergenic Reversals and Intergenic Transpositions. We show that this problem is NP-hard and develop two approximation algorithms, with factors 3 and 2.5. We check how these algorithms behave when assigning weights for genome rearrangements. Finally, we implemented all these algorithms and tested them on real and simulated data.
引用
收藏
页码:2080 / 2093
页数:14
相关论文
共 30 条
[1]   A GRASP-Based Heuristic for the Sorting by Length-Weighted Inversions Problem [J].
Arruda, Thiago da Silva ;
Dias, Ulisses ;
Dias, Zanoni .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (02) :352-363
[2]   Sorting by weighted reversals, transpositions, and inverted transpositions [J].
Bader, Martin ;
Ohlebusch, Enno .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (05) :615-636
[3]   Genome rearrangements and sorting by reversals [J].
Bafna, V ;
Pevzner, PA .
SIAM JOURNAL ON COMPUTING, 1996, 25 (02) :272-289
[4]   Sorting by transpositions [J].
Bafna, V ;
Pevzner, PA .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 1998, 11 (02) :224-240
[5]   Comparative Genomics on Artificial Life [J].
Biller, Priscila ;
Knibbe, Carole ;
Beslon, Guillaume ;
Tannier, Eric .
PURSUIT OF THE UNIVERSAL, 2016, 9709 :35-44
[6]   Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation [J].
Biller, Priscila ;
Gueguen, Laurent ;
Knibbe, Carole ;
Tannier, Eric .
GENOME BIOLOGY AND EVOLUTION, 2016, 8 (05) :1427-1439
[7]   Computing the Rearrangement Distance of Natural Genomes [J].
Bohnenkamper, Leonard ;
Braga, Marilia D. V. ;
Doerr, Daniel ;
Stoye, Jens .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (04) :410-431
[8]   Sorting by Genome Rearrangements on Both Gene Order and Intergenic Sizes [J].
Brito, Klairton Lima ;
Jean, Geraldine ;
Fertin, Guillaume ;
Oliveira, Andre Rodrigues ;
Dias, Ulisses ;
Dias, Zanoni .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (02) :156-174
[9]   Genome rearrangements with indels in intergenes restrict the scenario space [J].
Bulteau, Laurent ;
Fertin, Guillaume ;
Tannier, Eric .
BMC BIOINFORMATICS, 2016, 17
[10]   SORTING BY TRANSPOSITIONS IS DIFFICULT [J].
Bulteau, Laurent ;
Fertin, Guillaume ;
Rusu, Irena .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2012, 26 (03) :1148-1180