Visual representation of DNA sequences for exon detection using non-parametric spectral estimation techniques

被引:5
作者
Dessouky, Ahmed M. [1 ]
Taha, Taha E. [2 ]
Dessouky, Mohamed M. [3 ]
Eltholth, Ashraf A. [4 ]
Hassan, Emadeldeen [2 ,5 ]
Abd El-Samie, Fathi E. [2 ]
机构
[1] Al Alson Acad, Cairo, Egypt
[2] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun Engn, Menoufia, Egypt
[3] Menoufia Univ, Fac Elect Engn, Dept Comp Sci & Engn, Menoufia, Egypt
[4] Natl Telecommun Inst, Cairo, Egypt
[5] Umea Univ, Dept Comp Sci, Umea, Sweden
关键词
DNA; signal modeling; non-parametric spectral estimation; BIOINFORMATICS;
D O I
10.1080/15257770.2018.1536270
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper presents a new approach for modeling of DNA sequences for the purpose of exon detection. The proposed model adopts the sum-of-sinusoids concept for the representation of DNA sequences. The objective of the modeling process is to represent the DNA sequence with few coefficients. The modeling process can be performed on the DNA signal as a whole or on a segment-by-segment basis. The created models can be used instead of the original sequences in a further spectral estimation process for exon detection. The accuracy of modeling is evaluated evaluated by using the Root Mean Square Error (RMSE) and the R-square metrics. In addition, non-parametric spectral estimation methods are used for estimating the spectral of both original and modeled DNA sequences. The results of exon detection based on original and modeled DNA sequences coincide to a great extent, which ensures the success of the proposed sum-of-sinusoids method for modeling of DNA sequences.
引用
收藏
页码:321 / 337
页数:17
相关论文
共 23 条
[1]  
Abo-Zahhad Mohammed, 2012, International Journal of Information Technology and Computer Science, V4, P22, DOI 10.5815/ijitcs.2012.08.03
[2]   Genomic signal processing [J].
Anastassiou, D .
IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (04) :8-20
[3]  
Arniker BS., 2012, INT C BIOSCIENCE BIO, V31, P2
[4]  
Attwood T K, 2002, Biotechnol Annu Rev, V8, P1, DOI 10.1016/S1387-2656(02)08003-1
[5]   Power spectrum analysis for DNA sequences [J].
Berger, JA ;
Mitra, SK ;
Astola, J .
SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, :29-32
[6]   Graphical and numerical representations of DNA sequences: statistical aspects of similarity [J].
Bielinska-Waz, Dorota .
JOURNAL OF MATHEMATICAL CHEMISTRY, 2011, 49 (10) :2345-2407
[7]  
Changchuan Yin, 2008, 2008 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2008), P223, DOI 10.1109/CIBCB.2008.4675783
[8]  
El-Badawy I. M., 2015, IEEE INT C SIGN IM P
[9]  
Kalouptsidis N., 1997, SIGNAL PROCESSING SY
[10]   What is bioinformatics? A proposed definition and overview of the field [J].
Luscombe, NM ;
Greenbaum, D ;
Gerstein, M .
METHODS OF INFORMATION IN MEDICINE, 2001, 40 (04) :346-358