Efficient automatic exact motif discovery algorithms for biological sequences

被引:9
作者
Karci, Ali [1 ]
机构
[1] Firat Univ, Fac Engn, Dept Comp Engn, TR-23119 Elazig, Turkey
关键词
Motif discovery; Computational biology; DNA; Biological sequences; Algorithms; Bioinformatics;
D O I
10.1016/j.eswa.2008.10.087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: This paper presents an algorithm for the solution of the motif discovery problem (MDP). Methods and materials: Motif discovery problem can be considered in two cases: motifs with insertions/deletions, and motifs without insertions/deletions. The first group motifs can be found by stochastic and approximated methods. The second group can be found by using stochastic and approximated methods, but also deterministic method. We proved that the second group motifs can be found with a deterministic algorithm, and so, it can be said that the second motifs finding is a P-type problem as proved in this paper. Results and conclusions: An algorithm was proposed in this paper for motif discovery problem. The proposed algorithm finds all motifs which are occurred in the sequence at least two times, and it also finds motifs of various sizes. Due to this case, this algorithm is regarded as Automatic Exact Motif Discovery Algorithm. All motifs of different sizes can be found with this algorithm, and this case was proven in this paper. It shown that automatic exact motif discovery is a P-type problem in this paper. The application of the proposed algorithm has been shown that this algorithm is superior to MEME, MEME3, Motif Sampler, WEEDER, CONSENSUS, AlignACE. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7952 / 7963
页数:12
相关论文
共 17 条
[1]   Motif discovery by monotone scores [J].
Apostolico, Alberto ;
Pizzi, Cinzia .
DISCRETE APPLIED MATHEMATICS, 2007, 155 (6-7) :695-706
[2]  
BAILEY TL, 1995, MACH LEARN, V21, P51, DOI 10.1007/BF00993379
[3]   An upper bound on the hardness of exact matrix based motif discovery [J].
Horton, Paul ;
Fujibuchi, Wataru .
JOURNAL OF DISCRETE ALGORITHMS, 2007, 5 (04) :706-713
[4]  
Hu J., 2006, NUCLEIC ACIDS RES, V33, P4899
[5]   Bridge and brick network motifs: Identifying significant building blocks from complex biological systems [J].
Huang, Chung-Yuan ;
Cheng, Chia-Ying ;
Sun, Chuen-Tsai .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (02) :117-127
[6]   An algorithm for searching RNA motifs in genomic sequences [J].
Liu, Jingping ;
Ma, Bin ;
Zhang, Kaizhong .
BIOMOLECULAR ENGINEERING, 2007, 24 (03) :343-350
[7]  
Liu X, 2001, Pac Symp Biocomput, P127
[8]   An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments [J].
Liu, XS ;
Brutlag, DL ;
Liu, JS .
NATURE BIOTECHNOLOGY, 2002, 20 (08) :835-839
[9]   Self-organizing neural networks to support the discovery of DNA-binding motifs [J].
Mahony, Shaun ;
Benos, Panayiotis V. ;
Smith, Terry J. ;
Golden, Aaron .
NEURAL NETWORKS, 2006, 19 (6-7) :950-962
[10]   Shuffling biological sequences with motif constraints [J].
Riviere, Romain ;
Barth, Dominique ;
Cohen, Johanne ;
Denise, Alain .
JOURNAL OF DISCRETE ALGORITHMS, 2008, 6 (02) :192-204