Constructing a Gene Team Tree in Almost O(n lg n) Time

被引：3

作者：

Wang, Biing-Feng ^{[1
]}

Lin, Chien-Hsin ^{[1
]}

Yang, I-Tse ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30013, Taiwan

来源：

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS | 2014年 / 11卷 / 01期

关键词：

Algorithms; data structures; gene teams; comparative genomics; conserved gene clusters; COMMON INTERVALS; CLUSTERS; ALGORITHMS; CONSERVATION; OPERONS; ORDER; SETS;

D O I：

10.1109/TCBB.2013.150

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

An important model of a conserved gene cluster is called the gene team model, in which a chromosome is defined to be a permutation of distinct genes and a gene team is defined to be a set of genes that appear in two or more species, with the distance between adjacent genes in the team for each chromosome always no more than a certain threshold delta. A gene team tree is a succinct way to represent all gene teams for every possible value of delta. The previous fastest algorithm for constructing a gene team tree of two chromosomes requires O(n lg n lglg n) time, which was given by Wang and Lin. Its bottleneck is a problem called the maximum-gap problem. In this paper, by presenting an improved algorithm for the maximum-gap problem, we reduce the upper bound of the gene team tree problem to O(n lg n adn). Since a grows extremely slowly, this result is almost as efficient as the current best upper bound, O(n lg n), for finding the gene teams of a fixed d value. Our new algorithm is very efficient from both the theoretical and practical points of view. Wang and Lin's gene-team-tree algorithm can be extended to k chromosomes with complexity O(kn lg n lglg n). Similarly, our improved algorithm for the maximum-gap problem reduces this running time to O(kn lg n adn). In addition, it also provides new upper bounds for the gene team tree problem on general sequences, in which multiple copies of the same gene are allowed.

引用

页码：142 / 153

页数：12

共 50 条

[31] On the 1.375-Approximation Algorithm for Sorting by Transpositions in O(n log n) Time
Cunha, Luis Felipe I.
Kowada, Luis Antonio B.
Hausen, Rodrigo de A.
de Figueiredo, Celina M. H.
ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 8213 : 126 - 135
[32] AN O(N) TIME ALGORITHM FOR MAXIMUM MATCHING ON COGRAPHS
YU, MS
YANG, CH
INFORMATION PROCESSING LETTERS, 1993, 47 (02) : 89 - 93
[33] Proving sequence aligners can guarantee accuracy in almost O(m log n) time through an average-case analysis of the seed-chain-extend heuristic
Shaw, Jim
Yu, Yun William
GENOME RESEARCH, 2023, 33 (07) : 1175 - 1187
[34] Multiple-Source Single-Sink Maximum Flow in Directed Planar Graphs in O(diameter . n log n) Time
Klein, Philip N.
Mozes, Shay
ALGORITHMS AND DATA STRUCTURES, 2011, 6844 : 571 - +
[35] FULLY DYNAMIC MAXIMAL MATCHING IN O(log N) UPDATE TIME (CORRECTED VERSION)
Baswana, Surender
Gupta, Manoj
Sen, Sandeep
SIAM JOURNAL ON COMPUTING, 2018, 47 (03) : 617 - 650
[36] An O(n) time algorithm for maximum matching in P-4-tidy graphs
Fouquet, JL
Parfenoff, I
Thuillier, H
INFORMATION PROCESSING LETTERS, 1997, 62 (06) : 281 - 287
[37] An O(n3) time algorithm for recognizing threshold dimension 2 graphs
Sterbini, A
Raschle, T
INFORMATION PROCESSING LETTERS, 1998, 67 (05) : 255 - 259
[38] A polynomial time algorithm for the minimum quartet inconsistency problem with O(n) quartet errors
Wu, Gang
You, Jia-Huai
Lin, Guohui
INFORMATION PROCESSING LETTERS, 2006, 100 (04) : 167 - 171
[39] Approximate Distance Oracles for Unweighted Graphs in Expected O(n2) Time
Baswana, Surender
Sen, Sandeep
ACM TRANSACTIONS ON ALGORITHMS, 2006, 2 (04)
[40] Convex hull of points lying on lines in o(n log n) time after preprocessing
Ezra, Esther
Mulzer, Wolfgang
COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2013, 46 (04): : 417 - 434

← 1 2 3 4 5 →