Biased ReLU neural networks

被引:32
|
作者
Liang, XingLong [1 ]
Xu, Jun [1 ]
机构
[1] Harbin Inst Technol, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Biased ReLU; Neural network; PWL network flexibility; ADAPTIVE HINGING HYPERPLANES;
D O I
10.1016/j.neucom.2020.09.050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NN) with rectified linear units (ReLU) have been widely implemented since 2012. In this paper, we describe an activation function called the biased ReLU neuron (BReLU), which is similar to the ReLU. Based on this activation function, we propose the BReLU NN (BRNN). The structure of the BRNN is similar to that of the ReLU network. However, the difference between the two is that the BReLU introduces several biases for each input variable. This allows the BRNN to divide the input space into a greater number of linear regions and improve network flexibility. The BRNN parameters to be estimated are the weight matrices and the bias parameters of the BReLU neurons. The weights are obtained using the backpropagation method. Moreover, we propose a method to compute the bias parameters of the BReLU neurons. In this method, batch normalization is applied to the BRNN, and the variance and mean of the input variables are obtained. Based on these two parameters, the bias parameters are estimated. In addition, we investigate the flexibility of the BRNN. Specifically, we study the number of linear regions and provide the upper bound for the maximum number of linear regions. The results indicate that for the same input dimension, the BRNN divides the input space into a greater number of linear regions than the ReLU network. This explains to a certain extent why the BRNN has the superior approximation ability. Experiments are carried out using five datasets, and the results verify the effectiveness of the proposed method. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:71 / 79
页数:9
相关论文
共 50 条
  • [11] Rule extraction: Using neural networks or for neural networks?
    Zhou, ZH
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (02) : 249 - 253
  • [12] Model Predictive Control of Deep Neural Network Model with ReLU Structure using Mixed Integer Programming
    Mukai, Masakazu
    Degawa, Takuma
    Ogawa, Masatoshi
    Takei, Takayuki
    Akimichi, Toshikado
    Kurita, Shigeaki
    IFAC PAPERSONLINE, 2024, 58 (18): : 65 - 70
  • [13] Adaptive two-layer ReLU neural network: II. Ritz approximation to elliptic PDEs *
    Liu, Min
    Cai, Zhiqiang
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 113 : 103 - 116
  • [14] NEURAL NETWORKS
    Magenreuter, Reinhard
    MATHEMATICS AND INFORMATICS, 2016, 59 (05): : 526 - 538
  • [15] Are analog neural networks better than binary neural networks?
    M. Vidyasagar
    Circuits, Systems and Signal Processing, 1998, 17 : 243 - 270
  • [16] Are analog neural networks better than binary neural networks?
    Vidyasagar, M
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 1998, 17 (02) : 243 - 270
  • [17] Learning Two-Layer ReLU Networks Is Nearly as Easy as Learning Linear Classifiers on Separable Data
    Yang, Qiuling
    Sadeghi, Alireza
    Wang, Gang
    Sun, Jian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 4416 - 4427
  • [18] Mechanical Strength Estimation of Self-Compacting Geopolymer Concrete using ReLU based Deep Neural Network
    Mazumder, Endow Ayar
    Prasad, M. L., V
    ADVANCES IN MATERIALS AND PROCESSING TECHNOLOGIES, 2024, 10 (03) : 2168 - 2185
  • [19] Survey on Robustness Verification of Feedforward Neural Networks and Recurrent Neural Networks
    Liu Y.
    Yang P.-F.
    Zhang L.-J.
    Wu Z.-L.
    Feng Y.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (07): : 1 - 33
  • [20] Adaptive two-layer ReLU neural network: I. Best least-squares approximation
    Liu, Min
    Cai, Zhiqiang
    Chen, Jingshuang
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 113 : 34 - 44