A novel technique for analyzing the similarity and dissimilarity of DNA sequences

被引:2
作者
Liu, Y. W. [1 ]
Peng, Y. [2 ]
机构
[1] Hunan Agr Univ, Coll Sci, Changsha, Hunan, Peoples R China
[2] Hunan Agr Univ, Key Lab Crop Germplasm Innovat & Utilizat Hunan P, Changsha, Hunan, Peoples R China
关键词
Graphical representation; Sequence comparison; Invariant; Average and variance; DNA sequence; 2D GRAPHICAL REPRESENTATION;
D O I
10.4238/2014.January.28.2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
l(i,j) denotes the distance between the point (x(i), y(i)) and the point (x(j), y(i)) in graphical representation. By classifying l(i,j), i, j = 1, 2, ... , N according to the number of points between (x(i), y(i)) and (x(j), y(i)), N - 1 types are obtained. The average and variance of every type are assembled by the novel invariant v = (a(1), d(1), a(2), d(2), ... , a(N), d(N)). Compared with the traditional invariants, the leading eigenvalue, the max-min (eigenvalue), the leading eigenvalue/N, the average matrix element, and the average row sum, this strategy complies with the rule of using the average, extracts more information about biological sequences, and reduces the amounts of computation. It is superior to the traditional invariants in predicting similarity and dissimilarity among different species.
引用
收藏
页码:570 / 577
页数:8
相关论文
共 17 条
  • [1] Novel 4D numerical representation of DNA sequences
    Chi, R
    Ding, KQ
    [J]. CHEMICAL PHYSICS LETTERS, 2005, 407 (1-3) : 63 - 67
  • [2] A new method to analyze the similarity of the DNA sequences
    Guo, Ying
    Wang, Tian-Ming
    [J]. JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2008, 853 (1-3): : 62 - 67
  • [3] He P.-an, 2002, INTERNET ELECT J MOL, V1, P668
  • [4] He PA, 2011, MATCH-COMMUN MATH CO, V65, P445
  • [5] Similarity studies of DNA sequences based on a new 2D graphical representation
    Huang, Guohua
    Liao, Bo
    Li, Yongfan
    Yu, Yougui
    [J]. BIOPHYSICAL CHEMISTRY, 2009, 143 (1-2) : 55 - 59
  • [6] Li C, 2003, COMB CHEM HIGH T SCR, V6, P795
  • [7] New 2D graphical representation of DNA sequences
    Liao, B
    Wang, TM
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2004, 25 (11) : 1364 - 1368
  • [8] 3-D graphical representation of DNA sequences and their numerical characterization
    Liao, B
    Wang, TM
    [J]. JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2004, 681 (1-3): : 209 - 212
  • [9] A 2D graphical representation of DNA sequence
    Liao, B
    [J]. CHEMICAL PHYSICS LETTERS, 2005, 401 (1-3) : 196 - 199
  • [10] Analysis of similarity/dis similarity of DNA sequences based on 3-D graphical representation
    Liao, B
    Wang, TM
    [J]. CHEMICAL PHYSICS LETTERS, 2004, 388 (1-3) : 195 - 200