Genomic Signal Processing Methods for Computation of Alignment-Free Distances from DNA Sequences

被引:23
作者
Borrayo, Ernesto [1 ]
Gerardo Mendizabal-Ruiz, E. [1 ]
Velez-Perez, Hugo [1 ]
Romo-Vazquez, Rebeca [1 ]
Mendizabal, Adriana P. [2 ]
Alejandro Morales, J. [1 ,3 ]
机构
[1] Univ Guadalajara, CUCEI, Dept Comp Sci, Guadalajara 44430, Jalisco, Mexico
[2] Univ Guadalajara, CUCEI, Mol Biol Lab, Farmacobiol Dept, Guadalajara 44430, Jalisco, Mexico
[3] Univ Guadalajara, Ctr Theoret Res & High Performance Comp, CUCEI, Guadalajara 44430, Jalisco, Mexico
来源
PLOS ONE | 2014年 / 9卷 / 11期
关键词
SIMILARITY; IDENTIFICATION; PREDICTION; SEARCH;
D O I
10.1371/journal.pone.0110954
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments.
引用
收藏
页数:13
相关论文
共 53 条
[1]  
Akhtar M, 2007, IEEE INT WORK GENOM, P54
[2]   Signal Processing in Sequence Analysis: Advances in Eukaryotic Gene Prediction [J].
Akhtar, Mahmood ;
Epps, Julien ;
Ambikairajah, Eliathamby .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2008, 2 (03) :310-321
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   Genomic signal processing [J].
Anastassiou, D .
IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (04) :8-20
[5]   Frequency-domain analysis of biomolecular sequences [J].
Anastassiou, D .
BIOINFORMATICS, 2000, 16 (12) :1073-1081
[6]  
[Anonymous], [No title captured]
[7]  
[Anonymous], 2006 IEEE INT C AC S
[8]  
[Anonymous], 2012, IACSIT COIMBATORE C
[9]   Wavelet methods in statistics: Some recent developments and their applications [J].
Antoniadis, Anestis .
STATISTICS SURVEYS, 2007, 1 :16-55
[10]  
Antoniadis Anestis, 2001, J. Stat. Softw., V6, P1