Parallel and distributed architecture of genetic algorithm on Apache Hadoop and Spark

被引：24

作者：

Lu, Hau-Chun ^{[1
]}

Hwang, F. J. ^{[2
]}

Huang, Yao-Huei ^{[1
]}

机构：

[1] Fu Jen Catholic Univ, Dept Informat Management, New Taipei, Taiwan

[2] Univ Technol Sydney, Transport Res Ctr, Sch Math & Phys Sci, Ultimo, Australia

来源：

APPLIED SOFT COMPUTING | 2020年 / 95卷

关键词：

Genetic algorithm; Parallel and distributed computing; Traveling salesman problems; Apache Hadoop; Apache Spark; 2-MACHINE FLOWSHOP; OPTIMIZATION; SUBJECT; MODELS;

D O I：

10.1016/j.asoc.2020.106497

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The genetic algorithm (GA), one of the best-known metaheuristic algorithms, has been extensively utilized in various fields of management science, operational research, and industrial engineering. The efficiency of GAs in solving large-scale optimization problems would be enhanced if the iterative processes required by the genetic operators can be implemented in a parallel and distributed computing architecture. Apache Hadoop has recently been one of the most popular systems for distributed storage and parallel processing of big data. By integrating the GA highly into Apache Hadoop, this study proposes an advanced GA parallel and distributed computing architecture that achieves the effectiveness and efficiency of GA evolution. Characterized by the sophisticated mechanism of dispatching the GA core operators into Apache Hadoop, the developed computing framework fits well with the cloud computing model. The presented GA parallelization architecture outperforms the state-of-the-art reference architectures according to the computational experiments where the testing instances of traveling salesman problems are employed. Our numerical experiments also demonstrate that the proposed architecture can readily be extended to Apache Spark. (C) 2020 Elsevier B.V. All rights reserved.

引用

页数：15

共 50 条

[1] Parallel and Distributed Implementation of Sine Cosine Algorithm on Apache Spark Platform
Alfailakawi, Mohammad Gh.
Aljame, Maryam
Ahmad, Imtiaz
IEEE ACCESS, 2021, 9 : 77188 - 77202
[2] A Parallel distributed genetic algorithm using Apache Spark for flexible scheduling of multitasks in a cloud manufacturing environment
Elgendy, Abdelrahman
Yan, Jihong
Zhang, Mingyang
INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2024, 37 (05) : 652 - 667
[3] Performance Comparison of Apache Hadoop and Apache Spark
Singh, Amritpal
Khamparia, Aditya
Luhach, Ashish Kr
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS FOR COMPUTING RESEARCH (ICAICR '19), 2019,
[4] A speculative parallel decompression algorithm on Apache Spark
Zhoukai Wang
Yinliang Zhao
Yang Liu
Zhong Chen
Cuocuo Lv
Yuxiang Li
The Journal of Supercomputing, 2017, 73 : 4082 - 4111
[5] A speculative parallel decompression algorithm on Apache Spark
Wang, Zhoukai
Zhao, Yinliang
Liu, Yang
Chen, Zhong
Lv, Cuocuo
Li, Yuxiang
JOURNAL OF SUPERCOMPUTING, 2017, 73 (09): : 4082 - 4111
[6] A distributed implementation using Apache Spark of a genetic algorithm applied to test data generation
Paduraru, Ciprian
Melemciuc, Marius-Constantin
Stefanescu, Alin
PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1857 - 1863
[7] PRACTICAL RESULTS USING APACHE HADOOP PLATFORM FOR DISTRIBUTED AND PARALLEL COMPUTING
Toma, Cristian
INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2012, : 30 - 35
[8] Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark
Azhir, Elham
Hosseinzadeh, Mehdi
Khan, Faheem
Mosavi, Amir
MATHEMATICS, 2022, 10 (19)
[9] Global expansion of apache hadoop/apache spark activities at NTT DATA
Ranaweera, Ravindra Sandaruwan
Ajisaka, Akira
NTT Technical Review, 2018, 16 (02):
[10] On the Usability of Hadoop MapReduce, Apache Spark & Apache Flink for Data Science
Akil, Bilal
Zhou, Ying
Roehm, Uwe
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 303 - 310

← 1 2 3 4 5 →