Code Refactoring from OpenMP to MapReduce Model for Big Data Processing

被引:1
作者
Zhao, Junfeng [1 ]
Zhang, Minjia [1 ]
Yang, Hongji [2 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China
[2] Univ Leicester, Dept Informat, Leicester, Leics, England
来源
2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019) | 2019年
基金
中国国家自然科学基金;
关键词
big data; OpenMP; MapReduce; code refactoring; CLOUD; TRANSLATOR;
D O I
10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00186
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the multi-core era, much software has been developed using parallel programming technology, such as OpenMP, to take full advantage of the CPU cores. Nevertheless, in the era of big data, ubiquitous systems have enabled data collection on an unprecedented scale, the existing computing power and storage capacity can no longer effectively satisfy the needs of big data processing. MapReduce is a parallel programming model in cloud computing, which provides a new way to cope with the problem of OpenMP program's resource limitations for processing big data. In order to enable the legacy OpenMP-based program to take advantage of the virtue of cloud computing for processing big data, it is worth studying how to refactor it into MapReduce model. A detailed approach for refactoring OpenMP to MapReduce is proposed, and a prototype tool O2MR was developed in this paper. Two experiments show that the refactoring approach is efficient and the tool is helpful to refactoring process. In addition, the program execution times before and after refactoring were compared by five data sets, and the results demonstrate that the refactored program has better performance than the original in the face of big data, and the performance of the refactored program will be better as the amount of data increases.
引用
收藏
页码:930 / 935
页数:6
相关论文
共 15 条
[1]  
Brown C., 2012, P 5 WORKSHOP REFACTO, P54
[2]  
Dig D, 2010, PROC IEEE INT CONF S
[3]   A Refactoring Approach to Parallelism [J].
Dig, Danny .
IEEE SOFTWARE, 2011, 28 (01) :17-22
[4]   Challenges in migrating legacy software systems to the cloud an empirical study [J].
Gholami, Mahdi Fahmideh ;
Daneshgar, Farhad ;
Beydoun, Ghassan ;
Rabhi, Fethi .
INFORMATION SYSTEMS, 2017, 67 :100-113
[5]  
Kang S. J., 2015, ADV MULTIMED, P1
[6]   YSmart: Yet Another SQL-to-MapReduce Translator [J].
Lee, Rubao ;
Luo, Tian ;
Huai, Yin ;
Wang, Fusheng ;
He, Yongqiang ;
Zhang, Xiaodong .
31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, :25-36
[7]  
Lei H, 2018, MULTICORE HETEROGENE, P30
[8]   J2M: a Java']Java to MapReduce translator for cloud computing [J].
Li, Bing ;
Zhang, Junbo ;
Yu, Ning ;
Pan, Yi .
JOURNAL OF SUPERCOMPUTING, 2016, 72 (05) :1928-1945
[9]  
Li Y., 2013, SPECIFYING DETECTING
[10]  
Liao CS, 2013, INT J COMPUT SCI ENG, V8, P219