MULTIFRACTAL ANALYSIS AND FEATURE EXTRACTION OF DNA SEQUENCES

被引:4
作者
Kinsner, Witold [1 ,2 ]
Zhang, Hong [1 ,2 ]
机构
[1] Univ Manitoba, Signal & Data Compress Lab, Dept Elect & Comp Engn, Winnipeg, MB R3T 5V6, Canada
[2] Inst Ind Math Sci, TRLabs, Winnipeg, MB R3T 5V6, Canada
来源
PROCEEDINGS OF THE 8TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS | 2009年
基金
加拿大自然科学与工程研究理事会;
关键词
DNA sequences; multifractal analysis; feature extraction for classification; LONG-RANGE CORRELATIONS; FRACTAL CORRELATIONS; CODING REGIONS; GENOMIC DNA; PREDICTION; EXONS;
D O I
10.1109/COGINF.2009.5250696
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents feature extraction and estimations of multifractal measures for deoxyribonucleic acid (DNA) sequences, and demonstrates the intriguing possibilitiy of identifying biological functionality using information contained within the DNA sequence. We have developed a technique that seeks patterns or correlations in the DNA sequence at a higher level. The technique has three main steps: (i) transforms the DNA sequence symbols into a modified Levy walk, (ii) transforms the Levy walk a signal spectrum, and (iii) break the spectrum into subspectra and treats each of these as an attractor from which the multifractal dimension spectrum is c estimated. An optimal minimum window size and volume element size are found for estimation of the multifractal measures. Experimental results show that DNA is a multifractal, and that the multifractality), changes depending upon the location (coding or noncoding region) in the sequence.
引用
收藏
页码:29 / +
页数:3
相关论文
共 43 条
[1]  
[Anonymous], 1997, FRACTALS CHAOS GEOLO, DOI DOI 10.1017/CBO9781139174695
[2]  
[Anonymous], 1977, FRACTAL GEOMETRY NAT
[3]  
ARNEODO A, 2002, SCI DISASTERS CLIMAT, V453, P27
[4]   Long-range correlations in genomic DNA: A signature of the nucleosomal structure [J].
Audit, B ;
Thermes, C ;
Vaillant, C ;
d'Aubenton-Carafa, Y ;
Muzy, JF ;
Ameodo, A .
PHYSICAL REVIEW LETTERS, 2001, 86 (11) :2471-2474
[5]   LOCAL SELF-SIMILARITY OF SEQUENCE IN MAMMALIAN NUCLEAR-DNA IS MODULATED BY A 180-BP PERIODICITY [J].
BAINS, W .
JOURNAL OF THEORETICAL BIOLOGY, 1993, 161 (02) :137-143
[6]   Compositional segmentation and long-range fractal correlations in DNA sequences [J].
BernaolaGalvan, P ;
RomanRoldan, R ;
Oliver, JL .
PHYSICAL REVIEW E, 1996, 53 (05) :5181-5189
[7]   FRACTALITY OF DNA TEXTS [J].
BOROVK, AS ;
FRANKKAMENETSKII, MD ;
GROSBERG, AY .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1994, 12 (03) :655-669
[8]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[9]  
EBELING W, 2002, SCI DISASTERS CLIMAT, V453, P2
[10]  
Gibas C., 2001, BIOINFORMATICS COMPU