Tsetlin Machine-Based Image Classification FPGA Accelerator With On-Device Training

被引：0

作者：

Tunheim, Svein Anders ^{[1
]}

Jiao, Lei ^{[1
]}

Shafik, Rishad ^{[2
]}

Yakovlev, Alex ^{[2
]}

Granmo, Ole-Christoffer ^{[1
]}

机构：

[1] Univ Agder, Ctr Artificial Intelligence Res CAIR, N-4879 Grimstad, Norway

[2] Newcastle Univ, Sch Engn, Microsyst Grp, Newcastle Upon Tyne NE1 7RU, England

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2025年 / 72卷 / 02期

关键词：

Training; Field programmable gate arrays; Accuracy; Power demand; Image classification; Convolution; Energy efficiency; CMOS technology; Transformers; Learning automata; Machine learning; Tsetlin machine; accelerator; image classification; FPGA; NEURAL-NETWORKS; BINARY;

D O I：

10.1109/TCSI.2024.3519191

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The Tsetlin Machine (TM) is a novel machine learning algorithm that uses Tsetlin automata (TAs) to define propositional logic expressions (clauses) for classification. This paper describes a field-programmable gate array (FPGA) accelerator for image classification based on the Convolutional Coalesced Tsetlin Machine. The accelerator classifies booleanized images of $28\times 28$ pixels into 10 classes, and is configured with 128 clauses in a highly parallel architecture. To achieve fast clause evaluation and class prediction, the TA action signals and the clause weights per class are available from registers. Full on-device training is included, and the TAs are implemented with 34 Block RAM (BRAM) instances which operate in parallel. Each BRAM is addressed by the clause number and has a 72-bit word width that supports 8 TAs. The design is implemented in a Xilinx Zynq Ultrascale $+$ XCZU7 FPGA. Running at 50 MHz, the accelerator core achieves 134k image classifications per second, with an energy consumption per classification of 13.3 $\mu$ J. A single training epoch of 60k samples requires a processing time of 1.5 seconds. The accelerator obtains a test accuracy of 97.6% on MNIST, 84.1% on Fashion-MNIST and 82.8% on Kuzushiji-MNIST.

引用

页码：830 / 843

页数：14

共 50 条

[11] Automatic Compiler Based FPGA Accelerator for CNN Training
Venkataramanaiah, Shreyas Kolala
Ma, Yufei
Yin, Shihui
Nurvithadhi, Eriko
Dasu, Aravind
Cao, Yu
Seo, Jae-sun
2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 166 - 172
[12] Enabling On-Device Smartphone GPU based Training: Lessons Learned
Das, Anish
Kwon, Young D.
Chauhan, Jagmohan
Mascolo, Cecilia
2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,
[13] SVMnet: Non-Parametric Image Classification Based on Convolutional Ensembles of Support Vector Machines for Small Training Sets
Goddard, Hunter
Shamir, Lior
IEEE ACCESS, 2022, 10 : 24029 - 24038
[14] An FPGA-Based Reconfigurable CNN Training Accelerator Using Decomposable Winograd
Wang, Hui
Lu, Jinming
Lin, Jun
Wang, Zhongfeng
2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 175 - 180
[15] Cyclic Learning Rate-Based Co-Training for Image Classification With Noisy Labels
Zheng, Ying
Gu, Yu
Bai, Pingping
Yuan, Dong
Zhou, Siqi
Lyu, Xin
Chen, Ang
IEEE ACCESS, 2025, 13 : 6292 - 6305
[16] An Efficient Deep Network in Network Architecture for Image Classification on FPGA Accelerator.
Alaeddine, Hmidi
Jihene, Malek
Khemaja, Maha
2021 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2021), 2021, : 72 - 77
[17] Improving Crowdsourcing-Based Image Classification Through Expanded Input Elicitation and Machine Learning
Yasmin, Romena
Hassan, Md Mahmudulla
Grassel, Joshua. T. T.
Bhogaraju, Harika
Escobedo, Adolfo. R. R.
Fuentes, Olac
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[18] A Reconfigurable Accelerator for Generative Adversarial Network Training Based on FPGA
Yin, Tongtong
Mao, Wendong
Lu, Jinming
Wang, Zhongfeng
2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 144 - 149
[19] FPGA-Based Network Traffic Classification Using Machine Learning
Elnawawy, Mohammed
Sagahyroon, Assim
Shanableh, Tamer
IEEE ACCESS, 2020, 8 : 175637 - 175650
[20] GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Lam, Maximilian
Johnson, Jeff
Xiong, Wenjie
Maeng, Kiwan
Gupta, Udit
Li, Yang
Lai, Liangzhen
Leontiadis, Ilias
Rhu, Minsoo
Lee, Hsien-Hsin S.
Reddi, Vijay Janapa
Wei, Gu-Yeon
Brooks, David
Suh, G. Edward
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 1, 2024, : 197 - 214

← 1 2 3 4 5 →