Tsetlin Machine-Based Image Classification FPGA Accelerator With On-Device Training

被引:0
|
作者
Tunheim, Svein Anders [1 ]
Jiao, Lei [1 ]
Shafik, Rishad [2 ]
Yakovlev, Alex [2 ]
Granmo, Ole-Christoffer [1 ]
机构
[1] Univ Agder, Ctr Artificial Intelligence Res CAIR, N-4879 Grimstad, Norway
[2] Newcastle Univ, Sch Engn, Microsyst Grp, Newcastle Upon Tyne NE1 7RU, England
关键词
Training; Field programmable gate arrays; Accuracy; Power demand; Image classification; Convolution; Energy efficiency; CMOS technology; Transformers; Learning automata; Machine learning; Tsetlin machine; accelerator; image classification; FPGA; NEURAL-NETWORKS; BINARY;
D O I
10.1109/TCSI.2024.3519191
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The Tsetlin Machine (TM) is a novel machine learning algorithm that uses Tsetlin automata (TAs) to define propositional logic expressions (clauses) for classification. This paper describes a field-programmable gate array (FPGA) accelerator for image classification based on the Convolutional Coalesced Tsetlin Machine. The accelerator classifies booleanized images of $28\times 28$ pixels into 10 classes, and is configured with 128 clauses in a highly parallel architecture. To achieve fast clause evaluation and class prediction, the TA action signals and the clause weights per class are available from registers. Full on-device training is included, and the TAs are implemented with 34 Block RAM (BRAM) instances which operate in parallel. Each BRAM is addressed by the clause number and has a 72-bit word width that supports 8 TAs. The design is implemented in a Xilinx Zynq Ultrascale $+$ XCZU7 FPGA. Running at 50 MHz, the accelerator core achieves 134k image classifications per second, with an energy consumption per classification of 13.3 $\mu$ J. A single training epoch of 60k samples requires a processing time of 1.5 seconds. The accelerator obtains a test accuracy of 97.6% on MNIST, 84.1% on Fashion-MNIST and 82.8% on Kuzushiji-MNIST.
引用
收藏
页码:830 / 843
页数:14
相关论文
共 50 条
  • [11] Automatic Compiler Based FPGA Accelerator for CNN Training
    Venkataramanaiah, Shreyas Kolala
    Ma, Yufei
    Yin, Shihui
    Nurvithadhi, Eriko
    Dasu, Aravind
    Cao, Yu
    Seo, Jae-sun
    2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 166 - 172
  • [12] Enabling On-Device Smartphone GPU based Training: Lessons Learned
    Das, Anish
    Kwon, Young D.
    Chauhan, Jagmohan
    Mascolo, Cecilia
    2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,
  • [13] SVMnet: Non-Parametric Image Classification Based on Convolutional Ensembles of Support Vector Machines for Small Training Sets
    Goddard, Hunter
    Shamir, Lior
    IEEE ACCESS, 2022, 10 : 24029 - 24038
  • [14] An FPGA-Based Reconfigurable CNN Training Accelerator Using Decomposable Winograd
    Wang, Hui
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 175 - 180
  • [15] Cyclic Learning Rate-Based Co-Training for Image Classification With Noisy Labels
    Zheng, Ying
    Gu, Yu
    Bai, Pingping
    Yuan, Dong
    Zhou, Siqi
    Lyu, Xin
    Chen, Ang
    IEEE ACCESS, 2025, 13 : 6292 - 6305
  • [16] An Efficient Deep Network in Network Architecture for Image Classification on FPGA Accelerator.
    Alaeddine, Hmidi
    Jihene, Malek
    Khemaja, Maha
    2021 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2021), 2021, : 72 - 77
  • [17] Improving Crowdsourcing-Based Image Classification Through Expanded Input Elicitation and Machine Learning
    Yasmin, Romena
    Hassan, Md Mahmudulla
    Grassel, Joshua. T. T.
    Bhogaraju, Harika
    Escobedo, Adolfo. R. R.
    Fuentes, Olac
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [18] A Reconfigurable Accelerator for Generative Adversarial Network Training Based on FPGA
    Yin, Tongtong
    Mao, Wendong
    Lu, Jinming
    Wang, Zhongfeng
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 144 - 149
  • [19] FPGA-Based Network Traffic Classification Using Machine Learning
    Elnawawy, Mohammed
    Sagahyroon, Assim
    Shanableh, Tamer
    IEEE ACCESS, 2020, 8 : 175637 - 175650
  • [20] GPU-based Private Information Retrieval for On-Device Machine Learning Inference
    Lam, Maximilian
    Johnson, Jeff
    Xiong, Wenjie
    Maeng, Kiwan
    Gupta, Udit
    Li, Yang
    Lai, Liangzhen
    Leontiadis, Ilias
    Rhu, Minsoo
    Lee, Hsien-Hsin S.
    Reddi, Vijay Janapa
    Wei, Gu-Yeon
    Brooks, David
    Suh, G. Edward
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 1, 2024, : 197 - 214