Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing

被引:50
|
作者
Ma, Yan [1 ]
Wang, Lizhe [1 ]
Liu, Peng [1 ]
Ranjan, Rajiv
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100864, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
Big data; Parallel computing; Data-intensive computing; Remote Sensing data processing; SYSTEM;
D O I
10.1016/j.ins.2014.10.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the recent advances in Remote Sensing (RS) techniques, continuous Earth Observation is generating tremendous volume of RS data. The proliferation of RS data is revolutionizing the way in which RS data are processed and understood. Data with higher dimensionality, as well as the increasing requirement for real-time processing capabilities, have also given rise to the challenging issue of "Data-Intensive (DI) Computing". However, how to properly identify and qualify the DI issue remains a significant problem that is worth exploring. DI computing is a complex issue. While the huge data volume may be one of the reasons for this, some other factors could also be important. In this paper, we propose an empirical model (DIRS) of DI index to estimate RS applications. DIRS here is a novel empirical model (DIRS) that could quantify the DI issues in RS data processing with a normalized DI index. Through experimental analysis of the typical algorithms across the whole RS data processing flow, we identify the key factors that affect the DI issues mostly. Finally, combined with the empirical knowledge of domain experts, we formulate DIRS model to describe the correlations between the key factors and DI index. By virtue of experimental validation on more selected RS applications, we have found that DIRS model is an easy but promising approach. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:171 / 188
页数:18
相关论文
共 50 条
  • [41] G-Hadoop: MapReduce across distributed data centers for data-intensive computing
    Wang, Lizhe
    Tao, Jie
    Ranjan, Rajiv
    Marten, Holger
    Streit, Achim
    Chen, Jingying
    Chen, Dan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (03): : 739 - 750
  • [42] TomusBlobs: scalable data-intensive processing on Azure clouds
    Costan, Alexandru
    Tudoran, Radu
    Antoniu, Gabriel
    Brasche, Goetz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (04) : 950 - 976
  • [43] Towards Big Data Solutions for Industrial Tomography Data Processing
    Kowalska, Aleksandra
    Luczak, Piotr
    Sielski, Dawid
    Kowalski, Tomasz
    Romanowski, Andrzej
    Sankowski, Dominik
    PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 427 - 431
  • [44] Towards Efficient Big Data: Hadoop Data Placing and Processing
    Bahadi, Jihane
    El Asri, Bouchra
    Courtine, Melanie
    Rhanoui, Maryem
    Kergosien, Yannick
    2ND INTERNATIONAL CONFERENCE ON SMART DIGITAL ENVIRONMENT (ICSDE'18), 2018, : 42 - 47
  • [45] Improving Load Balance for Data-Intensive Computing on Cloud Platforms
    Dai, Wei
    Ibrahim, Ibrahim
    Bassiouni, Mostafa
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 140 - 145
  • [46] A Modular Remote Sensing Big Data Framework
    Xu, Chen
    Du, Xiaoping
    Fan, Xiangtao
    Yan, Zhenzhen
    Kang, Xujie
    Zhu, Junjie
    Hu, Zhongyang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] BIG DATA ISSUES FOR REMOTE SENSING: VARIETY
    Pierce, Leland
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 7593 - 7596
  • [48] Alleviation of Disk I/O Contention in Virtualized Settings for Data-Intensive Computing
    Malensek, Matthew
    Pallickara, Sangmi Lee
    Pallickara, Shrideep
    2015 IEEE/ACM 2ND INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2015, : 1 - 10
  • [49] Big Data Processing in Cloud Computing Environments
    Ji, Changqing
    Li, Yu
    Qiu, Wenming
    Awada, Uchechukwu
    Li, Keqiu
    PROCEEDINGS OF THE 2012 12TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (I-SPAN 2012), 2012, : 17 - 23
  • [50] Big Data Processing in Cloud Computing Environments
    Noraziah, A.
    Fakherldin, Mohammed Adam Ibrahim
    Adam, Khalid
    Majid, Mazlina Abdul
    ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11092 - 11095