An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
|
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [21] An FPGA-Based accelerator for multiphysics modeling
    Huang, XM
    Ma, J
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 209 - 212
  • [22] A Spiking LSTM Accelerator for Automatic Speech Recognition Application Based on FPGA
    Yin, Tingting
    Dong, Feihong
    Chen, Chao
    Ouyang, Chenghao
    Wang, Zheng
    Yang, Yongkui
    ELECTRONICS, 2024, 13 (05)
  • [23] Contribution of Custom Instructions on SoPC for iris recognition application
    Ea, Thomas
    Amiel, Frederic
    Michalowska, Alicja
    Rossant, Florence
    Amara, Amara
    2006 13TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2006, : 455 - 458
  • [24] Falic: An FPGA-Based Multi-Scalar Multiplication Accelerator for Zero-Knowledge Proof
    Yang, Yongkui
    Lu, Zhenyan
    Zeng, Jingwei
    Liu, Xingguo
    Qian, Xuehai
    Yu, Zhibin
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (12) : 2791 - 2804
  • [25] A Real-Time Naive Bayes Classifier Accelerator on FPGA
    Xue, Zhen
    Wei, Jizeng
    Guo, Wei
    IEEE ACCESS, 2020, 8 (08): : 40755 - 40766
  • [26] Comparison of Machine Learning Classification and Clustering Algorithms for TV Commercials Detection
    Abdelfattah, Eman
    Joshi, Shreehar
    IEEE ACCESS, 2023, 11 : 116741 - 116751
  • [27] Minitaur, an Event-Driven FPGA-Based Spiking Network Accelerator
    Neil, Daniel
    Liu, Shih-Chii
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (12) : 2621 - 2628
  • [28] FitNN: A Low-Resource FPGA-Based CNN Accelerator for Drones
    Zhang, Zhichao
    Mahmud, M. A. Parvez
    Kouzani, Abbas Z.
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21) : 21357 - 21369
  • [29] Custom Instructions for Networked Processor Templates
    Todman, Tim
    Luk, Wayne
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (07) : 3096 - 3100
  • [30] Unified Accelerator for Attention and Convolution in Inference Based on FPGA
    Li, Tianyang
    Zhang, Fan
    Fan, Xitian
    Shen, Jianliang
    Guo, Wei
    Cao, Wei
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,