Parallel and distributed architecture of genetic algorithm on Apache Hadoop and Spark

被引:24
|
作者
Lu, Hau-Chun [1 ]
Hwang, F. J. [2 ]
Huang, Yao-Huei [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Informat Management, New Taipei, Taiwan
[2] Univ Technol Sydney, Transport Res Ctr, Sch Math & Phys Sci, Ultimo, Australia
关键词
Genetic algorithm; Parallel and distributed computing; Traveling salesman problems; Apache Hadoop; Apache Spark; 2-MACHINE FLOWSHOP; OPTIMIZATION; SUBJECT; MODELS;
D O I
10.1016/j.asoc.2020.106497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The genetic algorithm (GA), one of the best-known metaheuristic algorithms, has been extensively utilized in various fields of management science, operational research, and industrial engineering. The efficiency of GAs in solving large-scale optimization problems would be enhanced if the iterative processes required by the genetic operators can be implemented in a parallel and distributed computing architecture. Apache Hadoop has recently been one of the most popular systems for distributed storage and parallel processing of big data. By integrating the GA highly into Apache Hadoop, this study proposes an advanced GA parallel and distributed computing architecture that achieves the effectiveness and efficiency of GA evolution. Characterized by the sophisticated mechanism of dispatching the GA core operators into Apache Hadoop, the developed computing framework fits well with the cloud computing model. The presented GA parallelization architecture outperforms the state-of-the-art reference architectures according to the computational experiments where the testing instances of traveling salesman problems are employed. Our numerical experiments also demonstrate that the proposed architecture can readily be extended to Apache Spark. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Parallel and Distributed Implementation of Sine Cosine Algorithm on Apache Spark Platform
    Alfailakawi, Mohammad Gh.
    Aljame, Maryam
    Ahmad, Imtiaz
    IEEE ACCESS, 2021, 9 : 77188 - 77202
  • [2] A Parallel distributed genetic algorithm using Apache Spark for flexible scheduling of multitasks in a cloud manufacturing environment
    Elgendy, Abdelrahman
    Yan, Jihong
    Zhang, Mingyang
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2024, 37 (05) : 652 - 667
  • [3] Performance Comparison of Apache Hadoop and Apache Spark
    Singh, Amritpal
    Khamparia, Aditya
    Luhach, Ashish Kr
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS FOR COMPUTING RESEARCH (ICAICR '19), 2019,
  • [4] A speculative parallel decompression algorithm on Apache Spark
    Zhoukai Wang
    Yinliang Zhao
    Yang Liu
    Zhong Chen
    Cuocuo Lv
    Yuxiang Li
    The Journal of Supercomputing, 2017, 73 : 4082 - 4111
  • [5] A speculative parallel decompression algorithm on Apache Spark
    Wang, Zhoukai
    Zhao, Yinliang
    Liu, Yang
    Chen, Zhong
    Lv, Cuocuo
    Li, Yuxiang
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (09): : 4082 - 4111
  • [6] A distributed implementation using Apache Spark of a genetic algorithm applied to test data generation
    Paduraru, Ciprian
    Melemciuc, Marius-Constantin
    Stefanescu, Alin
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1857 - 1863
  • [7] PRACTICAL RESULTS USING APACHE HADOOP PLATFORM FOR DISTRIBUTED AND PARALLEL COMPUTING
    Toma, Cristian
    INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, 2012, : 30 - 35
  • [8] Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark
    Azhir, Elham
    Hosseinzadeh, Mehdi
    Khan, Faheem
    Mosavi, Amir
    MATHEMATICS, 2022, 10 (19)
  • [9] Global expansion of apache hadoop/apache spark activities at NTT DATA
    Ranaweera, Ravindra Sandaruwan
    Ajisaka, Akira
    NTT Technical Review, 2018, 16 (02):
  • [10] On the Usability of Hadoop MapReduce, Apache Spark & Apache Flink for Data Science
    Akil, Bilal
    Zhou, Ying
    Roehm, Uwe
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 303 - 310