An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
|
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [1] WooKong: A Ubiquitous Accelerator for Recommendation Algorithms With Custom Instruction Sets on FPGA
    Wang, Chao
    Gong, Lei
    Ma, Xiang
    Li, Xi
    Zhou, Xuehai
    IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (07) : 1071 - 1082
  • [2] FPGA Based Custom Accelerator Architecture Framework for Complex Event Processing
    Ekanayaka, Kavinga Upul Bandara
    Pasqual, Ajith
    TENCON 2014 - 2014 IEEE REGION 10 CONFERENCE, 2014,
  • [3] Exploiting FPGA-Aware Merging of Custom Instructions for Runtime Reconfiguration
    Lam, Siew-Kei
    Clarke, Christopher T.
    Srikanthan, Thambipillai
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (03)
  • [4] Exploiting FPGA-Aware Merging of Custom Instructions for Runtime Reconfiguration
    Lam, Siew-Kei
    Srikanthan, Thambipillai
    Clarke, Christopher T.
    2012 7TH INTERNATIONAL WORKSHOP ON RECONFIGURABLE AND COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2012,
  • [5] A Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA
    Wang, Chao
    Gong, Lei
    Li, Xi
    Zhou, Xuehai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (10) : 2346 - 2359
  • [6] A Power-Efficient Optimizing Framework FPGA Accelerator Based on Winograd for YOLO
    Bao, Chun
    Xie, Tao
    Feng, Wenbin
    Chang, Le
    Yu, Chongchong
    IEEE ACCESS, 2020, 8 : 94307 - 94317
  • [7] AlphaGo Policy Network: A DCNN Accelerator on FPGA
    Li, Zhenni
    Zhu, Can
    Gao, Yu-Liang
    Wang, Ze-Kun
    Wang, Jiao
    IEEE ACCESS, 2020, 8 : 203039 - 203047
  • [8] An FPGA-based Accelerator for Neighborhood-based Collaborative Filtering Recommendation Algorithms
    Ma, Xiang
    Wang, Chao
    Yu, Qi
    Li, Xi
    Zhou, Xuehai
    2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, 2015, : 494 - 495
  • [9] Deep Neural Network Accelerator based on FPGA
    Thang Viet Huynh
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 254 - 257
  • [10] An Efficient FPGA-based Accelerator for Deep Forest
    Zhu, Mingyu
    Luo, Jiapeng
    Mao, Wendong
    Wang, Zhongfeng
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3334 - 3338