An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [31] Architecture of an FPGA Accelerator for LDA-Based Inference
    Ono, Taisuke
    Waidyasooriya, Hasitha Muthumala
    Hariyama, Masanori
    Ishigaki, Tsukasa
    2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 357 - 362
  • [32] An FPGA-based Hardware Accelerator for Iris Segmentation
    Avey, Joe
    Jones, Phillip
    Zambreno, Joseph
    2018 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2018,
  • [33] Custom Instructions for Networked Processor Templates
    Todman, Tim
    Luk, Wayne
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (07) : 3096 - 3100
  • [34] An Efficient FPGA-Based Accelerator Design for Convolution
    Song, Peng-Fei
    Pan, Jeng-Shyang
    Yang, Chun-Sheng
    Lee, Chiou-Yng
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 494 - 500
  • [35] A Deep Learning prediction process accelerator based FPGA
    Yu, Qi
    Wang, Chao
    Ma, Xiang
    Li, Xi
    Zhou, Xuehai
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 1159 - 1162
  • [36] Automatic Compiler Based FPGA Accelerator for CNN Training
    Venkataramanaiah, Shreyas Kolala
    Ma, Yufei
    Yin, Shihui
    Nurvithadhi, Eriko
    Dasu, Aravind
    Cao, Yu
    Seo, Jae-sun
    2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 166 - 172
  • [37] An FPGA Based Compression Accelerator for Forex Trading System
    Jang, Ji Hoon
    Lee, Seong Mo
    Gwon, Oh Seong
    Lee, Seung Eun
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 711 - 720
  • [38] HPTA: A High Performance Transformer Accelerator Based on FPGA
    Han, Yuntao
    Liu, Qiang
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 27 - 33
  • [39] FPGA based accelerator for simulated annealing with greedy perturbations
    Lukowiak, M.
    Cody, B.
    MIXDES 2007: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS:, 2007, : 274 - 277
  • [40] SWAT: An Efficient Swin Transformer Accelerator Based on FPGA
    Dong, Qiwei
    Xie, Xiaoru
    Wang, Zhongfeng
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 515 - 520