On gene prediction by cross-species comparative sequence analysis

被引:0
作者
Chen, R [1 ]
Ali, H [1 ]
机构
[1] Univ Nebraska, Coll Informat Sci & Technol, Dept Comp Sci, Omaha, NE 68182 USA
来源
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE | 2003年
关键词
D O I
10.1109/CSB.2003.1227366
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequencing of large fragments of genomic DNA makes it possible to perform comparisons of genomic sequences for identification of protein-coding regions. We have conducted a comparative analysis of homologous genomic sequences of organisms with different evolutionary distances and determined the degree of conservation of the non-coding regions between closely related organisms. In contrast, more distance shows much less intron similarity but less conservation on the exon structures. Based on this finding and training of data sets, we proposed a model by which coding sequences could be identified by comparing sequences of multiple species, both close and approximately distant. The reliability of the proposed method is evaluated in terms of sensitivity and specificity, and results are compared to those obtained by other popular gene prediction programs. Provided sequences can be found from other species at appropriate evolutionary distances, this approach could be applied in newly sequenced organisms where no species-dependent statistical models are available.
引用
收藏
页码:446 / 447
页数:2
相关论文
共 9 条
[1]  
Bafna V, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P3
[2]   Human and mouse gene structure: Comparative analysis and application to exon prediction [J].
Batzoglou, S ;
Pachter, L ;
Mesirov, JP ;
Berger, B ;
Lander, ES .
GENOME RESEARCH, 2000, 10 (07) :950-958
[3]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[4]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[5]   Computational methods for the identification of genes in vertebrate genomic sequences [J].
Claverie, JM .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1735-1744
[6]   An assessment of gene prediction accuracy in large DNA sequences [J].
Guigó, R ;
Agarwal, P ;
Abril, JF ;
Burset, M ;
Fickett, JW .
GENOME RESEARCH, 2000, 10 (10) :1631-1642
[7]   Comparison of genomic DNA sequences: solved and unsolved problems [J].
Miller, W .
BIOINFORMATICS, 2001, 17 (05) :391-397
[8]  
MORGENSTERN B, 2001, HUM GEN M 2001 ED PR, P146
[9]   Gene recognition in eukaryotic DNA by comparison of genomic sequences [J].
Novichkov, PS ;
Gelfand, MS ;
Mironov, AA .
BIOINFORMATICS, 2001, 17 (11) :1011-1018