A Data Placement Strategy for Data-Intensive Cloud Storage

被引:3
|
作者
Ding, Jie [1 ]
Han, Haiyun [1 ]
Zhou, Aihua [1 ]
机构
[1] Res Inst Informat & Commun, Nanjing, Jiangsu, Peoples R China
关键词
Cloud computing; power systems; data placement; data movement; clustering; consistent hashing;
D O I
10.4028/www.scientific.net/AMR.354-355.896
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Data-Intensive applications in power systems often perform complex computations which always involve large amount of datasets. In a distributed environment, an application may needs several datasets located in different data centers which faces two challenges including the high cost of data movements between data centers and data dependencies within the same data centers. In this paper, a data placement strategy among and within data centers in a cloud environment is proposed. Datasets are placed in different centers by a clustering scheme based on the data dependencies. And within the center, data is partitioned and replicated using consistent hashing. Simulations show that the algorithm can effectively reduce the cost of data movements and perform a evenly data distribution.
引用
收藏
页码:896 / 900
页数:5
相关论文
共 50 条
  • [31] Adaptive Caching for Data-Intensive Scientific Workflows in the Cloud
    Heidsieck, Gaetan
    de Oliveira, Daniel
    Pacitti, Esther
    Pradal, Christophe
    Tardieu, Francois
    Valduriez, Patrick
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, 2019, 11707 : 452 - 466
  • [32] Scalable Data Placement of Data-intensive Services in Geo-distributed Clouds
    Atrey, Ankita
    Van Seghbroeck, Gregory
    Volckaert, Bruno
    De Turck, Filip
    CLOSER: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2018, : 497 - 508
  • [33] Maintaining Consistency in Data-Intensive Cloud Computing Environment
    Basu, Sruti
    Pattnaik, Prasant Kumar
    PROGRESS IN COMPUTING, ANALYTICS AND NETWORKING, ICCAN 2017, 2018, 710 : 257 - 264
  • [34] Nebula: Distributed Edge Cloud for Data-Intensive Computing
    Ryden, Mathew
    Oh, Kwangsung
    Chandra, Abhishek
    Weissman, Jon
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2014, : 491 - 492
  • [35] Dynamic Scheduling Approach for Data-Intensive Cloud Environment
    Islam, Md. Rafiqul
    Habiba, Mansura
    2012 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES, APPLICATIONS AND MANAGEMENT (ICCCTAM), 2012, : 179 - 185
  • [36] Fair Resource Allocation for Data-Intensive Computing in the Cloud
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (01) : 20 - 33
  • [37] Unifying Data and Replica Placement for Data-intensive Services in Geographically Distributed Clouds
    Atrey, Ankita
    Van Seghbroeck, Gregory
    Mora, Higinio
    De Turck, Filip
    Volckaert, Bruno
    CLOSER: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2019, : 25 - 36
  • [38] A Network Performance Based Data Placement Policy in Distributed Data-Intensive Applications
    Xu, Dawei
    Miao, Xianglin
    Hu, Peng
    Luan, Zhongzhi
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 795 - 800
  • [39] NSM: A distributed storage architecture for data-intensive applications
    Ali, Z
    Malluhi, Q
    20TH IEEE/11TH NASA GODDARD CONFERENCE ON MASS STORAGE AND TECHNOLOGIES (MSST 2003), PROCEEDINGS, 2003, : 87 - 91
  • [40] TADRP: Toward Thermal-Aware Data Replica Placement in Data-Intensive Data Centers
    Li, Jie
    Deng, Yuhui
    Zhou, Yi
    Wu, Zhaorui
    Pang, Shujie
    Min, Geyong
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (04): : 4397 - 4415