Hidden Markov Model Optimized by PSO Algorithm for Gene Sequence Clustering

被引:1
作者
Soruri, Mohammad [1 ,2 ]
Sadri, Javad [3 ]
Zahiri, S. Hamid [4 ]
机构
[1] Univ Birjand, Ferdows Fac Engn, Birjand, Iran
[2] Univ Birjand, Ferdows Fac Engn, Ferdows, South Khorasan, Iran
[3] McGill Univ, McGill Ctr Bioinformat, Montreal, PQ, Canada
[4] Univ Birjand, Dept Elect & Comp Engn, Birjand, South Khorasan, Iran
来源
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, DATA AND CLOUD COMPUTING (ICC 2017) | 2017年
关键词
Hidden Markov Model (HMM); Particle Swarm Optimization (PSO); Gene Clustering; Sequence Modeling; Distance measure;
D O I
10.1145/3018896.3025147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene sequence modeling and clustering is one of the most important problems in bioinformatics. Hidden Markov Models (HMMs) have been widely used to find similarity between sequences with large and various lengths. In this paper a novel gene sequence clustering method based on HMMs optimized by Particle Swarm Optimization (PSO) algorithm is introduced. In this approach, each gene sequence is described by a specific HMM, and then its probability to generate individual sequence is evaluated for each model. A hierarchical clustering algorithm based on a new definition of a distance measure, has been applied to find the best clusters. Experiments carried out on lung cancer related genes dataset show that the proposed approach can be successfully utilized for gene clustering.
引用
收藏
页数:6
相关论文
共 18 条
[11]   CONTEXT-DEPENDENT PHONETIC HIDDEN MARKOV-MODELS FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION [J].
LEE, KF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (04) :599-609
[12]  
Li C., 2000, INT C MACHINE LEARNI, P543
[13]  
Panuccio A., 2002, Structural, Syntactic, and Statistical Pattern Recognition. Joint IAPR International Workshops SSPR 2002 and SPR 2002 (Lecture Notes in Computer Science Vol. 2396), P734
[14]  
Rabiner L. R., 1989, ICASSP-89: 1989 International Conference on Acoustics, Speech and Signal Processing (IEEE Cat. No.89CH2673-2), P405, DOI 10.1109/ICASSP.1989.266451
[15]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[16]   Gene Clustering via Integrated Markov Models Combining Individual and Pairwise Features [J].
Vignes, Matthieu ;
Forbes, Florence .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (02) :260-270
[17]  
Xue LP, 2006, INT CONF SIGN PROCES, P791
[18]   Binary matrix factorization for analyzing gene expression data [J].
Zhang, Zhong-Yuan ;
Li, Tao ;
Ding, Chris ;
Ren, Xian-Wen ;
Zhang, Xiang-Sun .
DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) :28-52