A Modified Multiple Alignment Fast Fourier Transform with Higher Efficiency

被引:4
|
作者
Zheng, Weihua [1 ,2 ]
Li, Kenli [1 ]
Li, Keqin [1 ]
So, Hing Cheung [3 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Univ Def Technol, CIC HPC, Changsha 410073, Hunan, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金; 对外科技合作项目(国际科技项目);
关键词
Convolution; fast Fourier transform (FFT); MAFFT; multiple sequence alignment (MSA); SPLIT-RADIX FFT; SEQUENCE ALIGNMENT; PROTEIN SEQUENCES; PERFORMANCE; IMPROVEMENT; ALGORITHM; ACCURACY; MAFFT;
D O I
10.1109/TCBB.2016.2530064
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignment (MSA) is the most common task in bioinformatics. Multiple alignment fast Fourier transform (MAFFT) is the fastest MSA program among those the accuracy of the resulting alignments can be comparable with the most accurate MSA programs. In this paper, we modify the correlation computation scheme of the MAFFT for further efficiency improvement in three aspects. First, novel complex number based amino acid and nucleotide expressions are utilized in the modified correlation. Second, linear convolution with a limitation is proposed for computing the correlation of amino acid and nucleotide sequences. Third, we devise a fast Fourier transform (FFT) algorithm for computing linear convolution. The FFT algorithm is based on conjugate pair split-radix FFT and does not require the permutation of order, and it is new as only real parts of the final outputs are required. Simulation results show that the speed of the modified scheme is 107.58 to 365.74 percent faster than that of the original MAFFT for one execution of the function Falign() of MAFFT, indicating its faster realization.
引用
收藏
页码:634 / 645
页数:12
相关论文
共 50 条