XHAMI - extended HDFS and MapReduce interface for Big Data image processing applications in cloud computing environments

被引:12
|
作者
Kune, Raghavendra [1 ]
Konugurthi, Pramod Kumar [1 ]
Agarwal, Arun [2 ]
Chillarige, Raghavendra Rao [2 ]
Buyya, Rajkumar [3 ]
机构
[1] Adv Data Proc Res Inst, Dept Space, Hyderabad 500009, Andhra Pradesh, India
[2] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad, Andhra Pradesh, India
[3] Univ Melbourne, Dept Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic, Australia
关键词
cloud computing; Big Data; Hadoop; MapReduce; extended MapReduce; XHAMI; image processing; scientific computing; remote sensing;
D O I
10.1002/spe.2425
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Hadoop distributed file system (HDFS) and MapReduce model have become popular technologies for large-scale data organization and analysis. Existing model of data organization and processing in Hadoop using HDFS and MapReduce are ideally tailored for search and data parallel applications, for which there is no need of data dependency with its neighboring/adjacent data. However, many scientific applications such as image mining, data mining, knowledge data mining, and satellite image processing are dependent on adjacent data for processing and analysis. In this paper, we identify the requirements of the overlapped data organization and propose a two-phase extension to HDFS and MapReduce programming model, called XHAMI, to address them. The extended interfaces are presented as APIs and implemented in the context of image processing application domain. We demonstrated effectiveness of XHAMI through case studies of image processing functions along with the results. Although XHAMI has little overhead in data storage and input/output operations, it greatly enhances the system performance and simplifies the application development process. Our proposed system, XHAMI, works without any changes for the existing MapReduce models and can be utilized by many applications where there is a requirement of overlapped data. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:455 / 472
页数:18
相关论文
共 50 条
  • [31] PRIMEBALL: A Parallel Processing Framework Benchmark for Big Data Applications in the Cloud
    Ferrarons, Jaume
    Adhana, Mulu
    Colmenares, Carlos
    Pietrowska, Sandra
    Bentayeb, Fadila
    Darmont, Jerome
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING, 2014, 8391 : 109 - 124
  • [32] Big Data Analytics Technology and Applications in Cloud Computing Perspective
    Wen, Xiangbin
    Wang, Zhenghui
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2023, 8 (02) : 1415 - 1432
  • [33] Google Earth Engine Cloud Computing Platform for Remote Sensing Big Data Applications: A Comprehensive Review
    Amani, Meisam
    Ghorbanian, Arsalan
    Ahmadi, Seyed Ali
    Kakooei, Mohammad
    Moghimi, Armin
    Mirmazloumi, S. Mohammad
    Moghaddam, Sayyed Hamed Alizadeh
    Mahdavi, Sahel
    Ghahremanloo, Masoud
    Parsian, Saeid
    Wu, Qiusheng
    Brisco, Brian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 (13) : 5326 - 5350
  • [34] QoS-aware Resource Provisioning for Big Data Processing in Cloud Computing Environment
    Hassan, Mohammad Mehedi
    Song, Biao
    Hossain, M. Shamim
    Alamri, Atif
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 2, 2014, : 107 - 112
  • [35] Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
    Ul Haq, Muhammad Inaam
    Li, Qianmu
    Hassan, Shoaib
    IEEE ACCESS, 2019, 7 : 162254 - 162267
  • [36] BigTrustScheduling: Trust-aware big data task scheduling approach in cloud computing environments
    Rjoub, Gaith
    Bentahar, Jamal
    Wahab, Omar Abdel
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 110 : 1079 - 1097
  • [37] Performance and energy efficiency of big data applications in cloud environments: A Hadoop case study
    Feller, Eugen
    Ramakrishnan, Lavanya
    Morin, Christine
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 79-80 : 80 - 89
  • [38] APPLICATIONS OF BIG DATA IN RENEWABLE ENERGY SYSTEMS BASED ON CLOUD COMPUTING
    Sreedhar, Tarun Shakthi
    Islam, Saiful
    Atmosa, Meron
    Yazdandoust, Elaheh
    Elnaim, Mohamed Suliman
    Mishra, Shomesh
    Naresh, Venkata
    Bajpai, Vemparala Rupali
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2024, 16 (03): : 121 - 128
  • [39] Employing Vertical Elasticity for Efficient Big Data Processing in Container-Based Cloud Environments
    Choi, Jin-young
    Cho, Minkyoung
    Kim, Jik-Soo
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [40] Implementation of Image Processing System using Handover Technique with Map Reduce Based on Big Data in the Cloud Environment
    Ali, Mehraj
    Kumar, John
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (02) : 326 - 331