Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem

被引:21
作者
Brodzik, Andrzej K. [1 ]
机构
[1] Mitre Corp, Bedford, MA 01730 USA
关键词
D O I
10.1093/bioinformatics/btl674
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One of the main tasks of DNA sequence analysis is identification of repetitive patterns. DNA symbol repetitions play a key role in a number of applications, including prediction of gene and exon locations, identification of diseases, reconstruction of human evolutionary history and DNA forensics. Results: A new approach towards identification of tandem repeats in DNA sequences is proposed. The approach is a refinement of previously considered method, based on the complex periodicity transform. The refinement is obtained, among others, by mapping of DNA symbols to pure quaternions. This mapping results in an enhanced, symbol-balanced sensitivity of the transform to DNA patterns, and an unambiguous threshold selection criterion. Computational efficiency of the transform is further improved, and coupling of the computation with the period value is removed, thereby facilitating parallel implementation of the algorithm. Additionally, a post-processing stage is inserted into the algorithm, enabling unambiguous display of results in a convenient graphical format. Comparison of the quaternionic periodicity transform with two well-known pattern detection techniques shows that the new approach is competitive with these two techniques in detection of exact and approximate repeats.
引用
收藏
页码:694 / 700
页数:7
相关论文
共 27 条
[11]   Friedreich's ataxia: Autosomal recessive disease caused by an intronic GAA triplet repeat expansion [J].
Campuzano, V ;
Montermini, L ;
Molto, MD ;
Pianese, L ;
Cossee, M ;
Cavalcanti, F ;
Monros, E ;
Rodius, F ;
Duclos, F ;
Monticelli, A ;
Zara, F ;
Canizares, J ;
Koutnikova, H ;
Bidichandani, SI ;
Gellera, C ;
Brice, A ;
Trouillas, P ;
DeMichele, G ;
Filla, A ;
DeFrutos, R ;
Palau, F ;
Patel, PI ;
DiDonato, S ;
Mandel, JL ;
Cocozza, S ;
Koenig, M ;
Pandolfo, M .
SCIENCE, 1996, 271 (5254) :1423-1427
[12]   Genomics and microbiology - Microbial forensics - "Cross-examining pathogens" [J].
Cummings, CA ;
Relman, DA .
SCIENCE, 2002, 296 (5575) :1976-+
[13]   AN UNSTABLE TRIPLET REPEAT IN A GENE RELATED TO MYOTONIC MUSCULAR-DYSTROPHY [J].
FU, YH ;
PIZZUTI, A ;
FENWICK, RG ;
KING, J ;
RAJNARAYAN, S ;
DUNNE, PW ;
DUBEL, J ;
NASSER, GA ;
ASHIZAWA, T ;
DEJONG, P ;
WIERINGA, B ;
KORNELUK, R ;
PERRYMAN, MB ;
EPSTEIN, HF ;
CASKEY, CT .
SCIENCE, 1992, 255 (5049) :1256-1258
[14]   Myelin basic protein gene is associated with MS in DR4- and DR5-positive Italians and Russians [J].
Guerini, FR ;
Ferrante, P ;
Losciale, L ;
Caputo, D ;
Lombardi, ML ;
Pirozzi, G ;
Luongo, V ;
Sudomoina, MA ;
Andreewski, TV ;
Alekseenkov, AD ;
Boiko, AN ;
Gusev, EI ;
Favorova, OO .
NEUROLOGY, 2003, 61 (04) :520-526
[15]  
HAMILTON W. R., 1866, Elements of quaternions
[16]  
Hauth Amy M, 2002, Bioinformatics, V18 Suppl 1, pS31
[17]  
Kantor I. L., 1989, HYPERCOMPLEX NUMBERS
[18]   Exhaustive whole-genome tandem repeats search [J].
Krishnan, A ;
Tang, F .
BIOINFORMATICS, 2004, 20 (16) :2702-2710
[19]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[20]   Interleukin-6 gene alleles affect the risk of Alzheimer's disease and levels of the cytokine in blood and brain [J].
Licastro, F ;
Grimaldi, LME ;
Bonafè, M ;
Martina, C ;
Olivieri, F ;
Cavallone, L ;
Giovanietti, S ;
Masliah, E ;
Franceschi, C .
NEUROBIOLOGY OF AGING, 2003, 24 (07) :921-926