Power spectrum and dynamic time warping for DNA sequences classification

被引:3
作者
Dakhli, Abdesselem [1 ]
Ben Amar, Chokri [2 ]
机构
[1] Hail Univ, Community Coll, Hail, Saudi Arabia
[2] Univ Sfax, Natl Engn Sch Sfax ENIS, REGIM Res Grp Intelligent Machines, Sfax 3038, Tunisia
关键词
DNA sequences; Power spectrum; Dynamic time warping; Binary; Pairwise comparison; Discrete Fourier transform;
D O I
10.1007/s12530-019-09306-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity and alignment and are often used to classify DNA sequences. We have developed a new classifier to classify DNA sequence. First, our approach is used to extract the features of DNA strands. Second, the goal of our approach is to classify DNA strands according to the similarity elaborated by the alignment. Frequently, the performance of the classification of DNA sequences depends on the method that allows to extract the characteristics and calculation of the genomic similarity. Particularly, our approach consists of three different methods for improving the classification of the DNA sequences. This paper presents a new approach of classification of DNA sequence based on dynamic time warping (DTW) method. First, the binary indicator is used to code each nucleotide and the power spectrum is used to extract the characteristics. Secondly, the DNA sequence similarity matrix is evaluated by the dynamic temporal Warping. Third, pairwise comparison is used to classify DNA strands. Our approach solves the complex problem of presentation and structure of different groups of organisms. The experimental results of our classifier obtained are compared with other approaches based on the alignment and similarity of the DNA sequences. These results showed that our approach outperformed other approaches in terms of classification and running time. Here is a summary of the main contributions of this article: (1) Convert nucleotides from DNA sequences by applying binary coding. (2) Using power spectrum our approach extracts the characteristics of DNA sequences. (3) Elaborate the similarity matrix of the DNA strand signal by the Dynamic Time Warping method. (4) Use pairwise comparison to classify DNA sequences. The approach developed is efficient to solve the problems of classification of DNA sequences.
引用
收藏
页码:637 / 646
页数:10
相关论文
共 18 条
[1]   Scalable metagenomic taxonomy classification using a reference genome database [J].
Ames, Sasha K. ;
Hysom, David A. ;
Gardner, Shea N. ;
Lloyd, G. Scott ;
Gokhale, Maya B. ;
Allen, Jonathan E. .
BIOINFORMATICS, 2013, 29 (18) :2253-2260
[2]   A comparative evaluation of sequence classification programs [J].
Bazinet, Adam L. ;
Cummings, Michael P. .
BMC BIOINFORMATICS, 2012, 13
[3]   Wavelet Neural Networks for DNA Sequence Classification Using the Genetic Algorithms and the Least Trimmed Square [J].
Dakhli, Abdesselem ;
Bellil, Wajdi ;
Ben Amar, Chokri .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 :427-436
[4]   Deep learning models for bacteria taxonomic classification of metagenomic data [J].
Fiannaca, Antonino ;
La Paglia, Laura ;
La Rosa, Massimo ;
Lo Bosco, Giosue' ;
Renda, Giovanni ;
Rizzo, Riccardo ;
Gaglio, Salvatore ;
Urso, Alfonso .
BMC BIOINFORMATICS, 2018, 19
[5]   CoMeta: Classification of Metagenomes Using k-mers [J].
Kawulok, Jolanta ;
Deorowicz, Sebastian .
PLOS ONE, 2015, 10 (04)
[6]  
Kwan, 2012, INT C BIOSC BIOCH BI, V31, P1
[7]   Chromosome classification using dynamic time warping [J].
Legrand, Benoit ;
Chang, C. S. ;
Ong, S. H. ;
Neo, Soek-Ying ;
Palanisamy, Nallasivarn .
PATTERN RECOGNITION LETTERS, 2008, 29 (03) :215-222
[8]  
Lei Ch, 2012, ZOOL RES, V33, P463
[9]  
Liu B, 2010, IEEE INT C BIOINFORM, P95, DOI 10.1109/BIBM.2010.5706544
[10]  
Lo Bosco G, 2017, LECT NOTES COMPUTER