Parallel and distributed architecture of genetic algorithm on Apache Hadoop and Spark

被引:24
作者
Lu, Hau-Chun [1 ]
Hwang, F. J. [2 ]
Huang, Yao-Huei [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Informat Management, New Taipei, Taiwan
[2] Univ Technol Sydney, Transport Res Ctr, Sch Math & Phys Sci, Ultimo, Australia
关键词
Genetic algorithm; Parallel and distributed computing; Traveling salesman problems; Apache Hadoop; Apache Spark; 2-MACHINE FLOWSHOP; OPTIMIZATION; SUBJECT; MODELS;
D O I
10.1016/j.asoc.2020.106497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The genetic algorithm (GA), one of the best-known metaheuristic algorithms, has been extensively utilized in various fields of management science, operational research, and industrial engineering. The efficiency of GAs in solving large-scale optimization problems would be enhanced if the iterative processes required by the genetic operators can be implemented in a parallel and distributed computing architecture. Apache Hadoop has recently been one of the most popular systems for distributed storage and parallel processing of big data. By integrating the GA highly into Apache Hadoop, this study proposes an advanced GA parallel and distributed computing architecture that achieves the effectiveness and efficiency of GA evolution. Characterized by the sophisticated mechanism of dispatching the GA core operators into Apache Hadoop, the developed computing framework fits well with the cloud computing model. The presented GA parallelization architecture outperforms the state-of-the-art reference architectures according to the computational experiments where the testing instances of traveling salesman problems are employed. Our numerical experiments also demonstrate that the proposed architecture can readily be extended to Apache Spark. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
[31]   An Enhanced Parallelisation Model for Performance Prediction of Apache Spark on a Multinode Hadoop Cluster [J].
Ahmed, Nasim ;
Barczak, Andre L. C. ;
Rashid, Mohammad A. ;
Susnjak, Teo .
BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (04)
[32]   Apache Spark Implementation of Whale Optimization Algorithm [J].
Maryam AlJame ;
Imtiaz Ahmad ;
Mohammad Alfailakawi .
Cluster Computing, 2020, 23 :2021-2034
[33]   Solving Traveling Salesman Problem Using Parallel River Formation Dynamics Optimization Algorithm on Multi-core Architecture Using Apache Spark [J].
Esra’a Alhenawi ;
Ruba Abu Khurma ;
Robertas Damaševic̆ius ;
Abdelazim G. Hussien .
International Journal of Computational Intelligence Systems, 17
[34]   ParaCA: A Speculative Parallel Crawling Approach on Apache Spark [J].
Li, Yuxiang ;
Zhang, Zhiyong ;
Niu, DanMei ;
Jing, Junchang .
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I, 2020, 12452 :112-124
[35]   Apache Spark Implementation of Whale Optimization Algorithm [J].
AlJame, Maryam ;
Ahmad, Imtiaz ;
Alfailakawi, Mohammad .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (03) :2021-2034
[36]   Solving Traveling Salesman Problem Using Parallel River Formation Dynamics Optimization Algorithm on Multi-core Architecture Using Apache Spark [J].
Alhenawi, Esra'a ;
Abu Khurma, Ruba ;
Damasevicius, Robertas ;
Hussien, Abdelazim G. .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
[37]   Distributed Data Processing on Microcomputers with Ascheduler and Apache Spark [J].
Korkhov, Vladimir ;
Gankevich, Ivan ;
Iakushkin, Oleg ;
Gushchanskiy, Dmitry ;
Khmel, Dmitry ;
Ivashchenko, Andrey ;
Pyayt, Alexander ;
Zobnin, Sergey ;
Loginov, Alexander .
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT V, 2017, 10408 :387-398
[38]   Distributed Classification of Text Documents on Apache Spark Platform [J].
Semberecki, Piotr ;
Maciejewski, Henryk .
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 :621-630
[39]   Efficient Distributed Range Query Processing in Apache Spark [J].
Papadopoulos, Apostolos N. ;
Sioutas, Spyros ;
Zacharatos, Nikolaos ;
Zaroliagis, Christos .
2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, :569-575
[40]   SparkScore: Leveraging Apache Spark for Distributed Genomic Inference [J].
Bahmani, Amir ;
Sibley, Alexander B. ;
Parsian, Mahmoud ;
Owzar, Kouros ;
Mueller, Frank .
2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, :435-442