The Prediction of Human Genes in DNA Based on a Generalized Hidden Markov Model

被引:1
作者
Guo, Rui [1 ]
Yan, Ke [1 ]
He, Wei [1 ]
Zhang, Jian [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Biocomp Res Ctr, Shenzhen, Peoples R China
来源
BIOMETRIC RECOGNITION | 2016年 / 9967卷
关键词
Gene prediction; WWAM; IMM; GHMM; The prefix sum arrays; The method based on similarity weighting of sequence patterns;
D O I
10.1007/978-3-319-46654-5_82
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Generalized Hidden Markov Model (GHMM) has been proved to be an excellently general probabilistic model of the gene structure of human genomic sequences. It can simultaneously incorporate different signal descriptions like splicing sites and content descriptions, for instance, compositional features of exons and introns. Enjoying its flexibility and convincing probabilistic underpinnings, we integrate some other modification of submodels and then implement a prediction program of Human Genes in DNA. The program has the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. More importantly, it also can perform well for longer sequences with an unknown number of genes in them. In the experiments, the results show that the proposed method has better performance in prediction accuracy than some existing methods, and over 70 % of exons can be identified exactly.
引用
收藏
页码:747 / 755
页数:9
相关论文
empty
未找到相关数据