Hidden Markov Model Optimized by PSO Algorithm for Gene Sequence Clustering

被引:1
作者
Soruri, Mohammad [1 ,2 ]
Sadri, Javad [3 ]
Zahiri, S. Hamid [4 ]
机构
[1] Univ Birjand, Ferdows Fac Engn, Birjand, Iran
[2] Univ Birjand, Ferdows Fac Engn, Ferdows, South Khorasan, Iran
[3] McGill Univ, McGill Ctr Bioinformat, Montreal, PQ, Canada
[4] Univ Birjand, Dept Elect & Comp Engn, Birjand, South Khorasan, Iran
来源
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, DATA AND CLOUD COMPUTING (ICC 2017) | 2017年
关键词
Hidden Markov Model (HMM); Particle Swarm Optimization (PSO); Gene Clustering; Sequence Modeling; Distance measure;
D O I
10.1145/3018896.3025147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene sequence modeling and clustering is one of the most important problems in bioinformatics. Hidden Markov Models (HMMs) have been widely used to find similarity between sequences with large and various lengths. In this paper a novel gene sequence clustering method based on HMMs optimized by Particle Swarm Optimization (PSO) algorithm is introduced. In this approach, each gene sequence is described by a specific HMM, and then its probability to generate individual sequence is evaluated for each model. A hierarchical clustering algorithm based on a new definition of a distance measure, has been applied to find the best clusters. Experiments carried out on lung cancer related genes dataset show that the proposed approach can be successfully utilized for gene clustering.
引用
收藏
页数:6
相关论文
共 18 条
[1]  
Al-Hajj R, 2007, PROC INT CONF DOC, P959
[2]  
Alveraz-Valin F., 2015, PATTERN ANAL APPL, P1
[3]  
Angeline P. J., 1998, Evolutionary Programming VII. 7th International Conference, EP98. Proceedings, P601, DOI 10.1007/BFb0040811
[4]   Gene Clustering Using Metaheuristic Optimization Algorithms [J].
Banu, P. K. Nizar ;
Andrews, S. .
INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING, 2015, 6 (04) :14-38
[5]  
Bicego M, 2004, PATTERN RECOGN, V37, P2281, DOI [10.1016/S0031-3203(04)00162-1, 10.1016/j.patcog.2004.04.005]
[6]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[7]  
Ferles C, 2008, IEEE INT C BIOINF BI, P430
[8]  
Goldber D. E., 1988, Machine Learning, V3, P95, DOI 10.1023/A:1022602019183
[9]  
Kennedy J., 1995, 1995 IEEE International Conference on Neural Networks Proceedings (Cat. No.95CH35828), P1942, DOI 10.1109/ICNN.1995.488968
[10]  
KROGH A, 1993, UCSCCRL9332