Configuration of Parallel Real-Time Applications on Multi-Core Processors

被引:0
作者
Gharajeh, Mohammad Samadi [1 ]
Carvalho, Tiago [1 ]
Pinho, Luis Miguel [1 ]
机构
[1] Polytech Inst Porto, Sch Engn, Porto, Portugal
来源
2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN) | 2022年
关键词
real-time systems; multi-core processors; timing analysis; parallel programming; OpenMP;
D O I
10.1109/INDIN51773.2022.9976163
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Parallel programming models (e.g., OpenMP) are more and more used to improve the performance of real-time applications in modern processors. Nevertheless, these processors have complex architectures, being very difficult to understand their timing behavior. The main challenge with most of existing works is that they apply static timing analysis for simpler models or measurement-based analysis using traditional platforms (e.g., single core) or considering only sequential algorithms. How to provide an efficient configuration for the allocation of the parallel program in the computing units of the processor is still an open challenge. This paper studies the problem of performing timing analysis on complex multi-core platforms, pointing out a methodology to understand the applications' timing behavior, and guide the configuration of the platform. As an example, the paper uses an OpenMP-based program of the Heat benchmark on a NVIDIA Jetson AGX Xavier. The main objectives are to analyze the execution time of OpenMP tasks, specify the best configuration of OpenMP directives, identify critical tasks, and discuss the predictability of the system/application. A Linux perf based measurement tool, which has been extended by our team, is applied to measure each task across multiple executions in terms of total CPU cycles, the number of cache accesses, and the number of cache misses at different cache levels, including L1, L2 and L3. The evaluation process is performed using the measurement of the performance metrics by our tool to study the predictability of the system/application.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 21 条
[1]   HPCTOOLKIT: tools for performance analysis of optimized parallel programs [J].
Adhianto, L. ;
Banerjee, S. ;
Fagan, M. ;
Krentel, M. ;
Marin, G. ;
Mellor-Crummey, J. ;
Tallent, N. R. .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2010, 22 (06) :685-701
[2]   Practical parallelization of scientific applications with OpenMP, OpenACC and MPI [J].
Aldinucci, Marco ;
Cesare, Valentina ;
Colonnelli, Iacopo ;
Martinelli, Alberto Riccardo ;
Mittone, Gianluca ;
Cantalupo, Barbara ;
Cavazzoni, Carlo ;
Drocco, Maurizio .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 157 :13-29
[3]  
[Anonymous], 2018, NVPModel-NVIDIA jetson AGX xavier developer kit
[4]  
[Anonymous], 2019, HEAT BENCHMARK
[5]   Design-time performance modeling of compositional parallel programs [J].
Czappa, Fabian ;
Calotoiu, Alexandru ;
Hoehl, Thomas ;
Mantel, Heiko ;
Nguyen, Toni ;
Wolf, Felix .
PARALLEL COMPUTING, 2021, 108 (108)
[6]   Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI [J].
Davis, Joshua ;
Gao, Tao ;
Chandrasekaran, Sunita ;
Jagode, Heike ;
Danalis, Anthony ;
Dongarra, Jack ;
Balaji, Pavan ;
Taufer, Michela .
PARALLEL COMPUTING: TECHNOLOGY TRENDS, 2020, 36 :287-298
[7]   MC2: Multicore and Cache Analysis via Deterministic and Probabilistic Jitter Bounding [J].
Diaz, Enrique ;
Fernandez, Mikel ;
Kosmidis, Leonidas ;
Mezzetti, Enrico ;
Hernandez, Carles ;
Abella, Jaume ;
Cazorla, Francisco J. .
RELIABLE SOFTWARE TECHNOLOGIES - ADA-EUROPE 2017, 2017, 10300 :102-118
[8]   On the Use of Probabilistic Worst-Case Execution Time Estimation for Parallel Applications in High Performance Systems [J].
Fusi, Matteo ;
Mazzocchetti, Fabio ;
Farres, Albert ;
Kosmidis, Leonidas ;
Canal, Ramon ;
Cazorla, Francisco J. ;
Abella, Jaume .
MATHEMATICS, 2020, 8 (03)
[9]  
Gonzalez A. M., 2020, PROC 10 EUROPEAN C E
[10]   Slow and Steady: Measuring and Tuning Multicore Interference [J].
Iorga, Dan ;
Sorensen, Tyler ;
Wickerson, John ;
Donaldson, Alastair F. .
2020 IEEE REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS 2020), 2020, :200-212