An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引：13

作者：

Wang, Chao ^{[1
]}

Gong, Lei ^{[1
]}

Jia, Fahui ^{[2
]}

Zhou, Xuehai ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China

[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2021年 / 70卷 / 05期

基金：

美国国家科学基金会;

关键词：

Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;

D O I：

10.1109/TC.2020.2995761

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.

引用

页码：725 / 732

页数：8

共 50 条

[21] An FPGA-Based accelerator for multiphysics modeling
Huang, XM
Ma, J
ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 209 - 212
[22] A Spiking LSTM Accelerator for Automatic Speech Recognition Application Based on FPGA
Yin, Tingting
Dong, Feihong
Chen, Chao
Ouyang, Chenghao
Wang, Zheng
Yang, Yongkui
ELECTRONICS, 2024, 13 (05)
[23] Contribution of Custom Instructions on SoPC for iris recognition application
Ea, Thomas
Amiel, Frederic
Michalowska, Alicja
Rossant, Florence
Amara, Amara
2006 13TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2006, : 455 - 458
[24] Falic: An FPGA-Based Multi-Scalar Multiplication Accelerator for Zero-Knowledge Proof
Yang, Yongkui
Lu, Zhenyan
Zeng, Jingwei
Liu, Xingguo
Qian, Xuehai
Yu, Zhibin
IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (12) : 2791 - 2804
[25] A Real-Time Naive Bayes Classifier Accelerator on FPGA
Xue, Zhen
Wei, Jizeng
Guo, Wei
IEEE ACCESS, 2020, 8 (08): : 40755 - 40766
[26] Comparison of Machine Learning Classification and Clustering Algorithms for TV Commercials Detection
Abdelfattah, Eman
Joshi, Shreehar
IEEE ACCESS, 2023, 11 : 116741 - 116751
[27] Minitaur, an Event-Driven FPGA-Based Spiking Network Accelerator
Neil, Daniel
Liu, Shih-Chii
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (12) : 2621 - 2628
[28] FitNN: A Low-Resource FPGA-Based CNN Accelerator for Drones
Zhang, Zhichao
Mahmud, M. A. Parvez
Kouzani, Abbas Z.
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21) : 21357 - 21369
[29] Custom Instructions for Networked Processor Templates
Todman, Tim
Luk, Wayne
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (07) : 3096 - 3100
[30] Unified Accelerator for Attention and Convolution in Inference Based on FPGA
Li, Tianyang
Zhang, Fan
Fan, Xitian
Shen, Jianliang
Guo, Wei
Cao, Wei
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,

← 1 2 3 4 5 →