Designed Dithering Sign Activation for Binary Neural Networks

被引:0
作者
Monroy, Brayan [1 ]
Estupinan, Juan [1 ]
Gelvez-Barrera, Tatiana [1 ]
Bacca, Jorge [1 ]
Arguello, Henry [1 ]
机构
[1] Univ Ind Santander, Dept Comp Sci, Bucaramanga 680002, Colombia
关键词
Kernel; Convolution; Neural networks; Correlation; Quantization (signal); Batch normalization; Optimization; Binary neural networks; binary activations; quantization; dithering; classification tasks;
D O I
10.1109/JSTSP.2024.3467926
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binary Neural Networks emerged as a cost-effective and energy-efficient solution for computer vision tasks by binarizing either network weights or activations. However, common binary activations, such as the Sign activation function, abruptly binarize the values with a single threshold, losing fine-grained details in the feature outputs. This work proposes an activation that applies multiple thresholds following dithering principles, shifting the Sign activation function for each pixel according to a spatially periodic threshold kernel. Unlike literature methods, the shifting is defined jointly for a set of adjacent pixels, taking advantage of spatial correlations. Experiments over the classification task using both grayscale and RGB datasets demonstrate the effectiveness of the designed dithering Sign activation function as an alternative activation for binary neural networks, without increasing the computational cost. Further, DeSign balances the preservation of details with the efficiency of binary operations.
引用
收藏
页码:1100 / 1107
页数:8
相关论文
共 26 条
  • [1] Deep Coded Aperture Design: An End-to-End Approach for Computational Imaging Tasks
    Bacca, Jorge
    Gelvez-Barrera, Tatiana
    Arguello, Henry
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 : 1148 - 1160
  • [2] Deep Learning with Low Precision by Half-wave Gaussian Quantization
    Cai, Zhaowei
    He, Xiaodong
    Sun, Jian
    Vasconcelos, Nuno
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5406 - 5414
  • [3] Courbariaux M, 2016, Arxiv, DOI [arXiv:1602.02830, DOI 10.48550/ARXIV.1602.02830]
  • [4] Courbariaux M, 2015, ADV NEUR IN, V28
  • [5] Regularizing Activation Distribution for Training Binarized Deep Networks
    Ding, Ruizhou
    Chin, Ting-Wu
    Liu, Zeye
    Marculescu, Diana
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11400 - 11409
  • [6] TELEVISION BY PULSE CODE MODULATION
    GOODALL, WM
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1951, 30 (01): : 33 - 49
  • [7] A review of semantic segmentation using deep neural networks
    Guo, Yanming
    Liu, Yu
    Georgiou, Theodoros
    Lew, Michael S.
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (02) : 87 - 93
  • [8] Holesovsky O., 2018, P 23 COMP VIS WINT W
  • [9] Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448
  • [10] Kim H., 2020, P INT C LEARN REPR