Towards Efficient Dynamic Binary Translation Optimizations Based on RISC Architectural Features

被引:0
作者
Xie, WenBing [1 ]
Tang, DaGuo [1 ]
Qi, FengBin [2 ]
Chai, ZhiLei [3 ]
Luo, QiaoLing [1 ]
Lin, Yuan [1 ]
机构
[1] Wuxi Inst Adv Technol, Wuxi, Peoples R China
[2] Natl Res Ctr Parallel Comp Engn & Technol, Beijing, Peoples R China
[3] Jiang Nan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Peoples R China
关键词
Dynamic binary translation; RISC architectural features; fast FP emulation; control lookup inlining; register mapping;
D O I
10.1142/S0218126624501044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic binary translation (DBT) is a core technology that enables the migration of legacy software to different instruction set architectures while maintaining the original semantics. However, the development and maintenance of an efficient cross-DBT system are challenging. Key challenges include memory access overhead, inefficient instruction simulation, and frequent context switches. In this paper, we propose three novel optimization techniques. First, we formalize a register mapping cost model and investigate a hierarchical register mapping approach to bridge the memory access overhead. Second, we accelerate floating point (FP) emulation by surrounding the use of hardware FP unit with high-efficiency non-FP code. Third, we present a function inlining approach to alleviating the overhead associated with indirect control lookup. On the system side, we implement our approach on ARM64 and SW64 architectures based on QEMU and extensively evaluate the effectiveness with the SPEC2006 benchmark suite. The experimental results show that an average of 1.28x performance speedup and 13.41% code size reduction can be achieved on SW64. Similarly, on ARM64, we achieve an average of 1.15x performance speedup and 11.48% code size reduction.
引用
收藏
页数:24
相关论文
共 31 条
[1]  
[Anonymous], 2017, SWCPU
[2]  
[Anonymous], 2022, PRINCIPLE APPL DYNAM
[3]  
[Anonymous], 2022, AMD64 Technology
[4]  
[Anonymous], 2021, ROSETTA 2 MAC APPLE
[5]  
[Anonymous], 2021, INTEL CORE PROCESSOR
[6]  
Bellard F, 2005, USENIX Association Proceedings of the FREENIX/Open Source Track, P41
[7]  
Chen W, 2011, LECT NOTES COMPUT SC, V6908, P309, DOI 10.1007/978-3-642-23300-5_24
[8]   An approach to minimizing the interpretation overhead in Dynamic Binary Translation [J].
Chen, Wei ;
Chen, Dan ;
Wang, Zhiying .
JOURNAL OF SUPERCOMPUTING, 2012, 61 (03) :804-825
[9]  
Clark MD, 2017, CREATIVE WRITING INNOVATIONS: BREAKING BOUNDARIES IN THE CLASSROOM, P1
[10]  
Cota EmilioG., 2019, P 15 ACM SIGPLAN SIG, P74, DOI DOI 10.1145/3313808.3313811