CESMTuner: An Auto-Tuning Framework for the Community Earth System Model

被引:9
作者
Ding Nan [1 ,2 ,3 ,4 ]
Xue Wei [1 ,2 ,3 ,4 ]
Ji Xu [1 ,2 ,3 ,4 ]
Xu Haoyu [1 ,2 ]
Song Zhenya [5 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Key Lab Earth Syst Modeling, Minist Educ, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Ctr Earth Syst Sci, Beijing 100084, Peoples R China
[5] SOA, Inst Oceanog 1, Qingdao 266061, Peoples R China
来源
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS) | 2014年
关键词
auto-tuning; CESM; load balance; processor allocation; performance prediction; PERFORMANCE PORTABILITY; DYNAMICAL CORE; OCEAN MODEL;
D O I
10.1109/HPCC.2014.51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing scientific demands of climate predication and climate projection have promoted to manage the computational resources of climate model rationally. The Community Earth System Model (CESM) is one of the state-of-the-art and the most widely-used coupled models for simulating the earth system. Although considerable effort has been put to improve the scalability of single component, CESM is still struggling with the poor performance due to load balance across components. To solve this problem, an easy-used and easy-ported auto-tuning framework named CESMTuner is proposed in this paper. It targets to reduce the time consumed of CESM as much as possible by looking for the optimal process configuration. In which, a novel process layout searching algorithm is presented that can look for the optimal process count of each component as well as the best process layout across components simultaneously. Moreover, a lightweight and accurate performance model is built to reduce searching overhead effectively. With the evaluation over TianHe-1A, CESMTuner can achieve 58.49% performance improvement compared to the widely-used sequential process layout and achieve 38.23% performance improvement compared to the heuristic branch and bound algorithm based on the performance model of simply fitting each component's runtime.
引用
收藏
页码:282 / 289
页数:8
相关论文
共 50 条
[31]   Tuning and auto-tuning of fractional order controllers for industry applications [J].
Monje, Concepcion A. ;
Vinagre, Blas M. ;
Feliu, Vicente ;
Chen, YangQuan .
CONTROL ENGINEERING PRACTICE, 2008, 16 (07) :798-812
[32]   A methodology to evaluate PID auto-tuning algorithms [J].
Romero, Julio A. ;
Sanchis, Roberto .
REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2011, 8 (01) :112-+
[33]   An improved auto-tuning scheme for PID controllers [J].
Dey, Chanchal ;
Mudi, Rajani K. .
ISA TRANSACTIONS, 2009, 48 (04) :396-409
[34]   A Robust Auto-tuning Scheme for PID Controllers [J].
Pandey, Sanjeev Kumar ;
Veeranna, Kuruva ;
Kumar, Bijender ;
Deshmukh, K. U. .
IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, :47-52
[35]   Effect of Auto-Tuning on Serrated Flow Behavior [J].
Mohammed, S. M. A. K. ;
Chen, D. L. .
METALS, 2019, 9 (08)
[36]   MATOG: Array Layout Auto-Tuning for CUDA [J].
Weber, Nicolas ;
Goesele, Michael .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (03)
[37]   Parallel GMRES Incomplete Orthogonalization Auto-Tuning [J].
Aquilanti, Pierre-Yves ;
Petiton, Serge ;
Calandra, Henri .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 :2246-2256
[38]   A methodology for comparing optimization algorithms for auto-tuning [J].
Willemsen, Floris-Jan ;
Schoonhoven, Richard ;
Filipovic, Jiri ;
Torring, Jacob O. ;
van Nieuwpoort, Rob ;
van Werkhoven, Ben .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 :489-504
[39]   Auto-tuning the Java']Java Virtual Machine [J].
Jayasena, Sanath ;
Fernando, Milinda ;
Rusira, Tharindu ;
Perera, Chalitha ;
Philips, Chamara .
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, :1261-1270
[40]   Auto-Tuning Intermediate Representations for In Situ Visualization [J].
Frey, Steffen ;
Ertl, Thomas .
2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,