Terabyte Sort on FPGA-Accelerated Flash Storage

被引:26
作者
Jun, Sang-Woo [1 ]
Xu, Shuotao [1 ]
Arvind [1 ]
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
来源
2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017) | 2017年
关键词
ALGORITHM;
D O I
10.1109/FCCM.2017.53
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sorting is one of the most fundamental and useful applications in computer science, and continues to be an important tool in analyzing large datasets. An important and challenging subclass of sorting problems involves sorting terabyte scale datasets with hundreds of billions of records. The conventional method of sorting such large amounts of data is to distribute the data and computation over a cluster of machines. Such solutions can be fast but are often expensive and power-hungry. In this paper, we propose a solution based on flash storage connected to a collection of FPGA-based sorting accelerators that perform large-scale merge-sort in storage. The accelerators include highly efficient sorting networks and merge trees that use bitonic sorting to emit multiple sorted values every cycle. We show that by appropriate use of accelerators we can remove all the computation bottlenecks so that the endto- end sorting performance is limited only by the flash storage bandwidth. We demonstrate that our flash-based system matches the performance of existing distributed-cluster solutions of much larger scale. More importantly, our prototype is able to show almost twice the power efficiency compared to the existing Joulesort record holder. An optimized system with less wasteful components is projected to be four times more efficient compared to the current record holder, sorting over 200,000 records per joule of energy.
引用
收藏
页码:17 / 24
页数:8
相关论文
共 33 条
[1]  
Agrawal Nitin, 2008, P USENIX ANN TECHN C, P57
[2]  
[Anonymous], 2014, GRAYSORT APACHE SPAR
[3]  
[Anonymous], INT J CONTROL AUTOMA
[4]  
[Anonymous], 2015, FUXISORT
[5]  
[Anonymous], FPGA 14
[6]  
[Anonymous], SORT BENCHMARK HOME
[7]  
[Anonymous], 2011 IEEE 9 S APPL S
[8]  
[Anonymous], P 2008 ACM SIGMOD IN
[9]  
[Anonymous], GRAYSORT MINUTESORT
[10]  
[Anonymous], TERASORT BENCHMARK C