An Efficient-Assembler Whale Optimization Algorithm for DNA Fragment Assembly Problem: Analysis and Validations

被引:18
作者
Abdel-Basset, Mohamed [1 ]
Mohamed, Reda [1 ]
Sallam, Karam M. [1 ]
Chakrabortty, Ripon K. [2 ]
Ryan, Michael J. [2 ]
机构
[1] Zagazig Univ, Fac Comp & Informat, Zagazig 44519, Egypt
[2] UNSW Canberra Australian Def Force Acad, Sch Engn & Informat Technol, Capabil Syst Ctr, Campbell, ACT 2612, Australia
关键词
DNA; Optimization; Standards; Whales; Search problems; Genetic algorithms; Heuristic algorithms; DNA sequence; DNA fragments assembly problem; overlap-layout-consensus; whale optimization algorithm; LOCAL SEARCH ALGORITHM;
D O I
10.1109/ACCESS.2020.3044857
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The study of deoxyribonucleic acid (DNA) is crucial in many fields, including medicine, biology, zoology, agriculture, and forensics. Since reading a DNA sequence is onerous because of its massive length, it is common in many DNA analysis applications to divide DNA strands into small segments or fragments which, after analysis, must be reassembled. Since this reassembly takes a non-specific polynomial time to solve, the DNA fragment assembly problem (DFAP) is NP-hard. This paper proposes a new assembler for tackling the DFAP based on the overlap-layout-consensus (OLC) approach. The proposed assembler adapts a discrete whale optimization algorithm (DWOA) using standard operators adopted from evolutionary algorithms to simulate the strategy adopted by humpback whales when searching for prey. For the first time, we formulate the behaviors of whales to be applied directly to any discrete optimization problem based on three primary operations: a swap-based best-position operator, an ordered crossover operator, and selection of a random whale operation to perform the exploitation and exploration phases of the algorithm. These operations were carefully designed to preserve the methodology of the original whale algorithm. DFAP is a multi-objective problem that seeks to reach the optimal order of segments that maximizes the overlap score and minimizes the number of contigs (set of overlapping DNA segments) to compose a one-contig DNA strand. Existing local search methods, such as problem aware local search (PALS) many non-conflicting movements (PALS2-many), suffer from being trapped in local optima. Hence, the integration of DWOA with PALS2-many improves the search capability for finding the optimal order of fragments. In addition, we propose a new variation of PALS2-many that achieves simultaneously the two objectives of DFAP. Our proposed DWOA was compared with a number of the most recent robust assemblers: a hybrid crow search algorithm for solving the DFAP (CSA-P2M*Fit), P2M*Fit, and a hybrid genetic algorithm (GA-P2M*Fit). The experimental results and statistical analyses of the proposed DWOA on thirty benchmark instances show that DWOA significantly outperforms those algorithms in reaching fewer contigs, in addition to being competitive with CSA-P2M*Fit and superior to P2M*Fit and GA-P2M*Fit for the overlap score.
引用
收藏
页码:222144 / 222167
页数:24
相关论文
共 76 条
[1]   Energy-Aware Marine Predators Algorithm for Task Scheduling in IoT-Based Fog Computing Applications [J].
Abdel-Basset, Mohamed ;
Mohamed, Reda ;
Elhoseny, Mohamed ;
Bashir, Ali Kashif ;
Jolfaei, Alireza ;
Kumar, Neeraj .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) :5068-5076
[2]   Solar photovoltaic parameter estimation using an improved equilibrium optimizer [J].
Abdel-Basset, Mohamed ;
Mohamed, Reda ;
Mirjalili, Seyedali ;
Chakrabortty, Ripon K. ;
Ryan, Michael J. .
SOLAR ENERGY, 2020, 209 :694-708
[3]   HSMA_WOA: A hybrid novel Slime mould algorithm with whale optimization algorithm for tackling the image segmentation problem of chest X-ray images [J].
Abdel-Basset, Mohamed ;
Chang, Victor ;
Mohamed, Reda .
APPLIED SOFT COMPUTING, 2020, 95
[4]   A Hybrid COVID-19 Detection Model Using an Improved Marine Predators Algorithm and a Ranking-Based Diversity Reduction Strategy [J].
Abdel-Basset, Mohamed ;
Mohamed, Reda ;
Elhoseny, Mohamed ;
Chakrabortty, Ripon K. ;
Ryan, Michael .
IEEE ACCESS, 2020, 8 :79521-79540
[5]   A modified hybrid whale optimization algorithm for the scheduling problem in multimedia data objects [J].
Abdel-Basset, Mohamed ;
El-Shahat, Doaa ;
El-henawy, Ibrahim .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (04)
[6]   A new fusion of grey wolf optimizer algorithm with a two-phase mutation for feature selection [J].
Abdel-Basset, Mohamed ;
El-Shahat, Doaa ;
El-henawy, Ibrahim ;
de Albuquerque, Victor Hugo C. ;
Mirjalili, Seyedali .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
[7]   A modified nature inspired meta-heuristic whale optimization algorithm for solving 0-1 knapsack problem [J].
Abdel-Basset, Mohamed ;
El-Shahat, Doaa ;
Sangaiah, Arun Kumar .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (03) :495-514
[8]   Quantum based Whale Optimization Algorithm for wrapper feature selection [J].
Agrawal, R. K. ;
Kaur, Baljeet ;
Sharma, Surbhi .
APPLIED SOFT COMPUTING, 2020, 89
[9]   Gradient-based optimizer: A new metaheuristic optimization algorithm [J].
Ahmadianfar, Iman ;
Bozorg-Haddad, Omid ;
Chu, Xuefeng .
INFORMATION SCIENCES, 2020, 540 :131-159
[10]  
Alba E., 2013, Metaheuristics for Dynamic Optimization