An Exploratory Study on Software-Defined Data Center Hard Disk Drives

被引：3

作者：

Li, Yin ^{[1
]}

Chen, Xubin ^{[1
]}

Zheng, Ning ^{[2
]}

Hao, Jingpeng ^{[1
]}

Zhang, Tong ^{[1
]}

机构：

[1] Rensselaer Polytech Inst, Dept Elect Comp 8t Syst Engn, 110 8th St, Troy, NY 12180 USA

[2] Scaleflux Inc, 97 East Brokaw Rd Suite 260, San Jose, CA 95112 USA

来源：

ACM TRANSACTIONS ON STORAGE | 2019年 / 15卷 / 03期

基金：

美国国家科学基金会;

关键词：

Reliability; error-tolerance; filesystem design; local erasure coding;

D O I：

10.1145/3319405

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article presents a design framework aiming to reduce mass data storage cost in data centers. Its underlying principle is simple: Assume one may noticeably reduce the HDD manufacturing cost by significantly (i.e., at least several orders of magnitude) relaxing raw HDD reliability, which ensures the eventual data storage integrity via low-cost system-level redundancy. This is called system-assisted HDD bit cost reduction. To better utilize both capacity and random IOPS of HDDs, it is desirable to mix data with complementary requirements on capacity and random IOPS in each HDD. Nevertheless, different capacity and random IOPS requirements may demand different raw HDD reliability vs. bit cost trade-offs and hence different forms of system-assisted bit cost reduction. This article presents a software-centric design framework to realize data-adaptive system-assisted bit cost reduction for data center HDDs. Implementation is solely handled by the filesystem and demands only minor change of the error correction coding (ECC) module inside HDDs. Hence, it is completely transparent to all the other components in the software stack (e.g., applications, OS kernel, and drivers) and keeps fundamental HIM) design practice (e.g., firmware, media, head, and servo) intact. We carried out analysis and experiments to evaluate its implementation feasibility and effectiveness. We integrated the design techniques into ext4 to further quantitatively measure its impact on system speed performance.

引用

页数：22

共 35 条

[1]

[Anonymous], 2011, SIGMOD 2011

[2]

Bairavasundaram LN, 2007, PERF E R SI, V35, P289

[3]

Bates S., 2013, P FLASH MEM SUMM AUG, P1

[4] EVENODD - AN EFFICIENT SCHEME FOR TOLERATING DOUBLE-DISK FAILURES IN RAID ARCHITECTURES [J].

BLAUM, M ;

BRADY, J ;

BRUCK, J ;

MENON, J .

IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (02) :192-202

[5]

Brewer E., 2016, Disks for data centers

[6] RAID - HIGH-PERFORMANCE, RELIABLE SECONDARY STORAGE [J].

CHEN, PM ;

LEE, EK ;

GIBSON, GA ;

KATZ, RH ;

PATTERSON, DA .

ACM COMPUTING SURVEYS, 1994, 26 (02) :145-185

[7]

Daniel EricD., 1999, MAGNETIC RECORDING 1

[8] Network Coding for Distributed Storage Systems [J].

Dimakis, Alexandros G. ;

Godfrey, P. Brighten ;

Wu, Yunnan ;

Wainwright, Martin J. ;

Ramchandran, Kannan .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (09) :4539-4551

[9]

Ferris Timothy A., 2015, US Patent, Patent No. [9,093,105, 9093105]

[10]

Ganger Gregory R., 2003, TECHNICAL REPORT

← 1 2 3 4 →