Asynchronous memory access unit for general purpose processors

被引:0
作者
Wang L. [1 ,2 ]
Zhang X. [1 ,2 ]
Lu T. [1 ,2 ]
Chen M. [1 ,2 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing
来源
BenchCouncil Transactions on Benchmarks, Standards and Evaluations | 2022年 / 2卷 / 02期
关键词
Asynchronous memory access; Far memory; Micro-architecture;
D O I
10.1016/j.tbench.2022.100061
中图分类号
学科分类号
摘要
In future data centers, applications will make heavy use of far memory (including disaggregated memory pools and NVM). The access latency of far memory is more widely distributed than that of local memory accesses. This makes the efficiency of traditional out-of-order load/store mechanism in most general-purpose processors decrease in this scenario. Therefore, this work proposes an in-core asynchronous memory access unit to fully utilize the far memory resources. © 2022 The Authors
引用
收藏
相关论文
共 14 条
  • [1] Aguilera M.K., Amit N., Calciu I., Deguillard X., Gandhi J., Subrahmanyam P., Suresh L., Tati K., Venkatasubramanian R., Wei M., Remote memory in the age of fast networks, Proceedings of the 2017 Symposium on Cloud Computing, SoCC ’17, pp. 121-127, (2017)
  • [2] openCAPI specification, (2017)
  • [3] Gen-Z specification, (2018)
  • [4] Compute express link, (2022)
  • [5] Pinto C., Syrivelis D., Gazzetti M., Koutsovasilis P., Reale A., Katrinis K., Hofstee H.P., ThymesisFlow: A Software-Defined, HW/SW co-Designed Interconnect Stack for Rack-Scale Memory Disaggregation, pp. 868-880, (2020)
  • [6] Liu H.-K., Chen D., Jin H., Liao X.-F., He B., Hu K., Zhang Y., A survey of non-volatile main memory technologies: State-of-the-arts, practices, and future directions, J. Comput. Sci. Tech., 36, 1, pp. 4-32, (2021)
  • [7] Intel optane persistent memory, (2022)
  • [8] Lagar-Cavilla A., Ahn J., Souhlal S., Agarwal N., Burny R., Butt S., Chang J., Chaugule A., Deng N., Shahid J., Thelen G., Yurtsever K.A., Zhao Y., Ranganathan P., Software-defined far memory in warehouse-scale computers, Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS ’19, pp. 317-330, (2019)
  • [9] Tuck J., Ceze L., Torrellas J., Scalable cache miss handling for high memory-level parallelism, 2006 39th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO’06, pp. 409-422, (2006)
  • [10] Asiatici M., Ienne P., Stop crying over your cache miss rate: Handling efficiently thousands of outstanding misses in fpgas, pp. 310-319, (2019)