Latency Tails of Byte-Addressable Non-Volatile Memories in Systems

被引:0
作者
Sun, Chao [1 ]
Le Moal, Damien [1 ]
Wang, Qingbo [1 ]
Mateescu, Robert [1 ]
Blagojevic, Filip [1 ]
Lueker-Boden, Martin [1 ]
Guyot, Cyril [1 ]
Bandic, Zvonimir [1 ]
Vucinic, Dejan [1 ]
机构
[1] Western Digital Res Ctr, San Jose, CA 95138 USA
来源
2017 IEEE 9TH INTERNATIONAL MEMORY WORKSHOP (IMW) | 2017年
关键词
Non-volatile memory; latency tail; quality-of-service (QoS); byte-addressable;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Next generation non-volatile memories, like Resistive RAM [1], Spin-Transfer Torque Magnetic RAM [2] and Phase Change Memory [3][4], are byte-addressable with very low latency, bridging the large performance gap between DRAM memory and NAND flash storage. For this reason we think of them as Storage Class Memories (SCMs), meaning their main use could ideally be as main memory but the non-volatility and high density could also fill some of the needs for durable storage. The path to using SCMs as main memory will necessitate significant changes to prevailing CPU architectures, so at first our focus was on enabling their early market adoption as ultrafast storage in commodity systems. In stark contrast to NAND flash, whose read latency of a tenth of a millisecond dominates the total system response latency to a storage request, SCM-based devices are so fast that attach interface and host device driver latencies, which are in the microsecond domain, start to dominate the total response latency, hindering greatly the performance of SCMs in commodity systems. Moreover, the latency jitter introduced by host hardware and software and by controller firmware further affects the Quality of Service (QoS) of solid-state drives based on SCMs. In this paper we discuss various factors that degrade the QoS, including host software and machine configurations. A particular fine-tuning of an x86 host machine, a well-designed device driver and a low latency device controller result in an ultra-low latency system with excellent QoS. We measure less than 4 mu s latency for 99.999% of I/O requests at queue depth one, and less than 7 mu s at queue depth 32, from an SCM-based block device on PCI Express interface.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 9 条
[1]   Resistive Random Access Memory (ReRAM) Based on Metal Oxides [J].
Akinaga, Hiroyuki ;
Shima, Hisashi .
PROCEEDINGS OF THE IEEE, 2010, 98 (12) :2237-2251
[2]  
[Anonymous], 2012, P 10 USENIX C FILE S
[3]   Current-driven magnetization reversal and spin-wave excitations in Co/Cu/Co pillars [J].
Katine, JA ;
Albert, FJ ;
Buhrman, RA ;
Myers, EB ;
Ralph, DC .
PHYSICAL REVIEW LETTERS, 2000, 84 (14) :3149-3152
[4]  
Le Moal D., 2015, P 2015 NONV MEM WORK
[5]   REVERSIBLE ELECTRICAL SWITCHING PHENOMENA IN DISORDERED STRUCTURES [J].
OVSHINSKY, SR .
PHYSICAL REVIEW LETTERS, 1968, 21 (20) :1450-+
[6]  
Qureshi Moinuddin K., 2009, Proceedings of the 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2009), P14, DOI 10.1145/1669112.1669117
[7]  
Vasudeva Anil, NVME NEXT GENERATION
[8]  
Vucinic Dejan., 2014, P 12 USENIX C FILE S, P309
[9]   Phase Change Memory [J].
Wong, H. -S. Philip ;
Raoux, Simone ;
Kim, SangBum ;
Liang, Jiale ;
Reifenberg, John P. ;
Rajendran, Bipin ;
Asheghi, Mehdi ;
Goodson, Kenneth E. .
PROCEEDINGS OF THE IEEE, 2010, 98 (12) :2201-2227