Hardware acceleration of Tiny YOLO deep neural networks for sign language recognition: A comprehensive performance analysis

被引:2
作者
Jaiswal, Mohita [1 ]
Sharma, Abhishek [1 ]
Saini, Sandeep [1 ]
机构
[1] LNM Inst Informat Technol, Dept Elect & Commun Engn, Jaipur 302031, Rajasthan, India
关键词
Deep neural networks; Sign language recognition; Tiny YOLOv2; Hardware architecture; FPGA; FPGA;
D O I
10.1016/j.vlsi.2024.102287
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we benchmark two automation frameworks, Vitis AI and FINN, for sign language recognition on a Field Programmable Gate Array (FPGA). We conducted an in-depth exploration of both frameworks using Tiny YOLOv2 networks by varying design parameters such as precision, parallelism ratio, etc. Further, a fair baseline comparison is made based on accuracy, speed, and hardware resources. Experimental findings demonstrate that the Vitis AI outperforms the FINN framework and traditional GPU and CPU platforms by achieving significant improvements of 1.08x, 1.7x, and 2.9x in terms of latency. Leveraging Vitis AI, our system achieved a detection speed of 32.7 frames per second (FPS) on the Kria KV260 FPGA with a power consumption rate of 5.6 W and an impressive mean Average Precision (mAP) score of 61.2% on the Hindi Indian Sign Language (ISL) dataset.
引用
收藏
页数:12
相关论文
共 50 条
[1]   Multimodal Neural Network Acceleration on a Hybrid CPU-FPGA Architecture: A Case Study [J].
Ajili, Mehdi Trabelsi ;
Hara-Azumi, Yuko .
IEEE ACCESS, 2022, 10 :9603-9617
[2]  
[Anonymous], 2022, Dpuczdx8g for zynq ultrascale+ mpsocs product guide (pg338) v4.1
[3]   Helping Hearing-Impaired in Emergency Situations: A Deep Learning-Based Approach [J].
Areeb, Qazi Mohammad ;
Maryam ;
Nadeem, Mohammad ;
Alroobaea, Roobaea ;
Anwer, Faisal .
IEEE ACCESS, 2022, 10 :8502-8517
[4]   YOLACT Real-time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9156-9165
[5]   A review of hand gesture and sign language recognition techniques [J].
Cheok, Ming Jin ;
Omar, Zaid ;
Jaward, Mohamed Hisham .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) :131-153
[6]   Object detection using YOLO: challenges, architectural successors, datasets and applications [J].
Diwan, Tausif ;
Anirudh, G. ;
Tembhurne, Jitendra, V .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) :9243-9275
[7]  
Ducasse Q, 2021, Arxiv, DOI arXiv:2102.01341
[8]   A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN for Object Detection [J].
Duy Thanh Nguyen ;
Tuan Nghia Nguyen ;
Kim, Hyun ;
Lee, Hyuk-Jae .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (08) :1861-1873
[9]   Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments [J].
Fang, Wei ;
Wang, Lin ;
Ren, Peiming .
IEEE ACCESS, 2020, 8 :1935-1944
[10]  
Fasfous N, 2021, Arxiv, DOI arXiv:2102.03456