Data Prefetching for Scientific Workflow Based on Hadoop

被引:0
|
作者
Chen, Gaozhao [1 ]
Wu, Shaochun [1 ]
Gu, Rongrong [1 ]
Xu, Yongquan [1 ]
Xu, Lingyu [1 ]
Ge, Yunwen [1 ]
Song, Cuicui [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200072, Peoples R China
来源
COMPUTER AND INFORMATION SCIENCE 2012 | 2012年 / 429卷
关键词
Hadoop; data-intensive; scientific workflow; prefetching;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data-intensive scientific workflow based on Hadoop needs huge data transfer and storage. Aiming at this problem, on the environment of an executing computer cluster which has limited computing resources, this paper adopts the way of data prefetching to hide the overhead caused by data search and transfer and reduce the delays of data access. Prefetching algorithm for data-intensive scientific workflow based on the consideration of available computing resources is proposed. Experimental results indicate that the algorithm consumes less response time and raises the efficiency.
引用
收藏
页码:81 / 92
页数:12
相关论文
共 50 条
  • [31] A novel time computation model based on algorithm complexity for data intensive scientific workflow design and scheduling
    He, Jing
    Zhang, Yanchun
    Huang, Guangyan
    Pang, Chaoyi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (16) : 2070 - 2083
  • [32] Integrating Policy with Scientific Workflow Management for Data-Intensive Applications
    Chervenak, Ann L.
    Smith, David E.
    Chen, Weiwei
    Deelman, Ewa
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 140 - 149
  • [33] Dynamic data prefetching in home-based software DSMs
    Hu, WW
    Zhang, FX
    Liu, HM
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (03) : 231 - 241
  • [34] Dynamic data prefetching in home-based software DSMs
    Weiwu Hu
    Fuxin Zhang
    Haiming Liu
    Journal of Computer Science and Technology, 2001, 16 : 231 - 241
  • [35] Dynamic Data Prefetching in Home-Based Software DSMs
    胡伟武
    张福新
    刘海明
    Journal of Computer Science and Technology, 2001, (03) : 231 - 241
  • [36] Bi-Objective CSO for Big Data Scientific Workflows Scheduling in the Cloud: Case of LIGO Workflow
    Bousselmi, K.
    Ben Hamida, S.
    Rukoz, M.
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 615 - 624
  • [37] Design of a Hadoop Based Data Platform for Auto Aftermarket
    Shen, Yi
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION, INFORMATION AND CONTROL, 2015, 125 : 1425 - 1431
  • [38] Massive Online Shopping Data Mining based on Hadoop
    Sun, Hong
    Li, Cunjin
    Yin, Zhong
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 3277 - 3282
  • [39] Research on Massive Tile Data Management based on Hadoop
    Gao, Kun
    Mao, Xuemin
    PROCEEDINGS OF 2016 2ND INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2016), 2016,
  • [40] Hadoop Based Scalable Cluster Deduplication for Big Data
    Liu, Qing
    Fu, Yinjin
    Ni, Guiqiang
    Hou, Rui
    2016 IEEE 36TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW 2016), 2016, : 98 - 105