Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem

被引:21
作者
Brodzik, Andrzej K. [1 ]
机构
[1] Mitre Corp, Bedford, MA 01730 USA
关键词
D O I
10.1093/bioinformatics/btl674
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One of the main tasks of DNA sequence analysis is identification of repetitive patterns. DNA symbol repetitions play a key role in a number of applications, including prediction of gene and exon locations, identification of diseases, reconstruction of human evolutionary history and DNA forensics. Results: A new approach towards identification of tandem repeats in DNA sequences is proposed. The approach is a refinement of previously considered method, based on the complex periodicity transform. The refinement is obtained, among others, by mapping of DNA symbols to pure quaternions. This mapping results in an enhanced, symbol-balanced sensitivity of the transform to DNA patterns, and an unambiguous threshold selection criterion. Computational efficiency of the transform is further improved, and coupling of the computation with the period value is removed, thereby facilitating parallel implementation of the algorithm. Additionally, a post-processing stage is inserted into the algorithm, enabling unambiguous display of results in a convenient graphical format. Comparison of the quaternionic periodicity transform with two well-known pattern detection techniques shows that the new approach is competitive with these two techniques in detection of exact and approximate repeats.
引用
收藏
页码:694 / 700
页数:7
相关论文
共 27 条
[1]   Genomic signal processing [J].
Anastassiou, D .
IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (04) :8-20
[2]  
[Anonymous], MATH METHODS DNA SEQ
[3]  
[Anonymous], [No title captured]
[4]  
[Anonymous], FORENSIC DNA TYPING
[5]   What can we learn with wavelets about DNA sequences? [J].
Arneodo, A ;
D'Aubenton-Carafa, Y ;
Audit, B ;
Bacry, E ;
Muzy, JF ;
Thermes, C .
PHYSICA A, 1998, 249 (1-4) :439-448
[6]  
BASSO K, STAT SIGNIFICANCE PA
[7]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[8]   Extrapolation of band-limited signals and the finite Zak transform [J].
Brodzik, AK ;
Tolimieri, R .
SIGNAL PROCESSING, 2000, 80 (03) :413-423
[9]   Location of a major susceptibility locus for familiar schizophrenia on chromosome 1q21-q22 [J].
Brzustowicz, LM ;
Hodgkinson, KA ;
Chow, EWC ;
Honer, WG ;
Bassett, AS .
SCIENCE, 2000, 288 (5466) :678-682
[10]   Detection and visualization of tandem repeats in DNA sequences [J].
Buchner, M ;
Janjarasjitt, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (09) :2280-2287