Case Study of Scientific Data Processing on a Cloud Using Hadoop

被引:0
|
作者
Zhang, Chen [1 ]
De Sterck, Hans [2 ]
Aboulnaga, Ashraf [1 ]
Djambazian, Haig [3 ]
Sladek, Rob
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] Univ Waterloo, Dept Appl Math, Waterloo, ON N2L 3G1, Canada
[3] McGill Univ, Genome Quebec Innovat Ctr, Montreal, PQ H3A 1A4, Canada
来源
HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS | 2010年 / 5976卷
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity of cloud computing, Hadoop has become a widely used open source cloud computing framework for large scale data processing. However, few efforts have been made to demonstrate the applicability of Hadoop to various real-world application scenarios in fields other than server side computations such as web indexing, etc. In this paper, we use the Hadoop cloud computing framework to develop a user application that allows processing of scientific data on clouds. A simple extension to Hadoop's MapReduce is described which allows it to handle scientific data processing problems with arbitrary input formats and explicit control over how the input is split. This approach is used to develop a Hadoop-based cloud computing application that processes sequences of microscope images of live cells, and we test its performance. It is discussed how the approach can be generalized to more complicated scientific data processing problems.
引用
收藏
页码:400 / +
页数:3
相关论文
共 50 条
  • [1] Processing of Big Educational Data in the Cloud Using Apache Hadoop
    Machova, Renata
    Komarkova, Jitka
    Lnenicka, Martin
    INTERNATIONAL CONFERENCE ON INFORMATION SOCIETY (I-SOCIETY 2016), 2016, : 46 - 49
  • [2] Online Data Processing on Cloud and Hadoop Platform
    Akhtar, Ayesha
    Shakir, Muhammad Sohaib
    2017 FOURTH HCT INFORMATION TECHNOLOGY TRENDS (ITT), 2017, : 25 - 29
  • [3] Scientific data processing framework for Hadoop MapReduce
    Department of Computer and Information, Xinxiang University, Xinxiang, China
    1600, Journal of Chemical and Pharmaceutical Research, 3/668 Malviya Nagar, Jaipur, Rajasthan, India (06):
  • [4] Protecting Data Storage on Cloud to Enhance Security Level and Processing of the Data by using Hadoop
    Saxena, Shivani
    Shrivastava, Amit
    Saxena, Aumreesh
    Manoria, Manish
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [5] Scientific data processing using MapReduce in cloud environments
    Kong, Xiangsheng
    Information Technology Journal, 2013, 12 (23) : 7869 - 7873
  • [6] Big Data Processing Using Hadoop and Spark: The Case of Meteorology Data
    Hussein, Eslam
    Sadiki, Ronewa
    Jafta, Yahlieel
    Sungay, Muhammad Mujahid
    Ajayi, Olasupo
    Bagula, Antoine
    E-INFRASTRUCTURE AND E-SERVICES FOR DEVELOPING COUNTRIES (AFRICOMM 2019), 2020, 311 : 180 - 185
  • [7] Scientific data mining and processing using MapReduce in cloud environments
    Kong, Xiangsheng
    Kong, Xiangsheng, 1600, Journal of Chemical and Pharmaceutical Research, 3/668 Malviya Nagar, Jaipur, Rajasthan, India (06): : 1270 - 1276
  • [8] A Data Processing Framework for Cloud Environment Based on Hadoop and Grid Middleware
    Kim, Hyukho
    Kim, Woongsup
    Lee, Kyoungmook
    Kim, Yangwoo
    GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 515 - +
  • [9] Mass Log Data Processing and Mining Based on Hadoop and Cloud Computing
    Yu, Hongyong
    Wang, Deshuai
    PROCEEDINGS OF 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, VOLS I-VI, 2012, : 197 - 202
  • [10] Performance and energy efficiency of big data applications in cloud environments: A Hadoop case study
    Feller, Eugen
    Ramakrishnan, Lavanya
    Morin, Christine
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 79-80 : 80 - 89