A hyper-heuristic approach to sequencing by hybridization of DNA sequences

被引:12
作者
Blazewicz, Jacek [1 ,2 ]
Burke, Edmund K. [3 ]
Kendall, Graham [3 ]
Mruczkiewicz, Wojciech [1 ]
Oguz, Ceyda [4 ]
Swiercz, Aleksandra [1 ,2 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Poznan, Poland
[2] Polish Acad Sci, Inst Bioorgan Chem, Poznan, Poland
[3] Univ Nottingham, Sch Comp Sci, Nottingham NG7 2RD, England
[4] Koc Univ, Dept Ind Engn, Istanbul, Turkey
关键词
Hyper-heuristics; Simulated annealing; Tabu search; Choice function; Sequencing by hybridization; TABU-SEARCH; GENETIC ALGORITHM;
D O I
10.1007/s10479-011-0927-y
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper we investigate the use of hyper-heuristic methodologies for predicting DNA sequences. In particular, we utilize Sequencing by Hybridization. We believe that this is the first time that hyper-heuristics have been investigated in this domain. A hyper-heuristic is provided with a set of low-level heuristics and the aim is to decide which heuristic to call at each decision point. We investigate three types of hyper-heuristics. Two of these (simulated annealing and tabu search) draw their inspiration from meta-heuristics. The choice function hyper-heuristic draws its inspiration from reinforcement learning. We utilize two independent sets of low-level heuristics. The first set is based on a previous tabu search method, with the second set being a significant extension to this basic set, including utilizing a different representation and introducing the definition of clusters. The datasets we use comprises two randomly generated datasets and also a publicly available biological dataset. In total, we carried out experiments using 70 different combinations of heuristics, using the three datasets mentioned above and investigating six different hyper-heuristic algorithms. Our results demonstrate the effectiveness of a hyper-heuristic approach to this problem domain. It is necessary to provide a good set of low-level heuristics, which are able to both intensify and diversify the search but this approach has demonstrated very encouraging results on this extremely difficult and important problem domain.
引用
收藏
页码:27 / 41
页数:15
相关论文
共 34 条
[1]  
[Anonymous], 2005, Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques, DOI DOI 10.1007/0-387-28356-0_7
[2]  
[Anonymous], 2005, SEARCH METHODOLOGIES: Introductory Tutorials in Optimization and Decision Support Techniques, DOI DOI 10.1007/0-387-28356-0_6
[3]  
[Anonymous], 2002, AS PAC C SIM EV LEAR
[4]  
Ayob M., 2003, Proceedings of the International Conference on Intelligent Technologies, V3, P132
[5]  
Bai R., 2005, Metaheuristics: Progress as Real Problem Solvers, P87, DOI DOI 10.1007/0-387-25383-1_4
[6]  
BAI R., 2007, A simulated annealing hyper-heuristic methodology for fexible decision support
[7]  
Blazewicz J, 2004, INFORMS J COMPUT, V16, P232, DOI 10.1287/ijoc.1030.0049
[8]   Hybrid genetic algorithm for DNA sequencing with errors [J].
Blazewicz, J ;
Kasprzak, M ;
Kuroczycki, W .
JOURNAL OF HEURISTICS, 2002, 8 (05) :495-502
[9]   A heuristic managing errors for DNA sequencing [J].
Blazewicz, J ;
Formanowicz, P ;
Guinand, F ;
Kasprzak, M .
BIOINFORMATICS, 2002, 18 (05) :652-660
[10]   Complexity of DNA sequencing by hybridization [J].
Blazewicz, J ;
Kasprzak, M .
THEORETICAL COMPUTER SCIENCE, 2003, 290 (03) :1459-1473