At-Scale Assessment of Weight Clustering for Energy-Efficient Object Detection Accelerators

被引:0
作者
Caro, Marti [1 ,2 ]
Tabani, Hamid [1 ]
Abella, Jaume [1 ]
机构
[1] Barcelona Supercomp Ctr BSC, Barcelona, Spain
[2] Univ Politecn Catalunya UPC, Barcelona, Spain
来源
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING | 2022年
关键词
D O I
10.1145/3477314.3507161
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
DNN-based object detection operates on large data volumes to fetch images andDNNweights, which leads to high power and bandwidth demands. Solutions to mitigate those demands, such as weight clustering, are normally studied in limited examples of a much smaller scale than target applications, which poses difficulties to determine the best tradeoff to implement. This paper performs an atscale (using a real life application) assessment of weight clustering for a DNN-based object detection system - You Only Look Once (YOLO) - considering real driving videos. Our case study shows that an Output Stationary accelerator (e.g. a systolic array) restricting weights to only between 32 (5-bit) and 256 (8-bit) different values allows preserving the accuracy of the original 32-bit weights of YOLO while decreasing bandwidth requirements to around 30%40% of the original bandwidth, and overall energy consumption to around 45% of the original consumption. Overall, our case study provides key insights on which to take design decisions for an accelerator for camera-based object detection.
引用
收藏
页码:530 / 533
页数:4
相关论文
共 22 条
  • [21] Sparse-YOLO: Hardware/Software Co-Design of an FPGA Accelerator for YOLOv2
    Wang, Zixiao
    Xu, Ke
    Wu, Shuaixiao
    Liu, Li
    Liu, Lingzhi
    Wang, Dong
    [J]. IEEE ACCESS, 2020, 8 : 116569 - 116585
  • [22] Ye SK, 2018, Arxiv, DOI arXiv:1811.01907