Biased ReLU neural networks

被引：32

作者：

Liang, XingLong ^{[1
]}

Xu, Jun ^{[1
]}

机构：

[1] Harbin Inst Technol, Shenzhen 518055, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 423卷

基金：

中国国家自然科学基金;

关键词：

Biased ReLU; Neural network; PWL network flexibility; ADAPTIVE HINGING HYPERPLANES;

D O I：

10.1016/j.neucom.2020.09.050

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural networks (NN) with rectified linear units (ReLU) have been widely implemented since 2012. In this paper, we describe an activation function called the biased ReLU neuron (BReLU), which is similar to the ReLU. Based on this activation function, we propose the BReLU NN (BRNN). The structure of the BRNN is similar to that of the ReLU network. However, the difference between the two is that the BReLU introduces several biases for each input variable. This allows the BRNN to divide the input space into a greater number of linear regions and improve network flexibility. The BRNN parameters to be estimated are the weight matrices and the bias parameters of the BReLU neurons. The weights are obtained using the backpropagation method. Moreover, we propose a method to compute the bias parameters of the BReLU neurons. In this method, batch normalization is applied to the BRNN, and the variance and mean of the input variables are obtained. Based on these two parameters, the bias parameters are estimated. In addition, we investigate the flexibility of the BRNN. Specifically, we study the number of linear regions and provide the upper bound for the maximum number of linear regions. The results indicate that for the same input dimension, the BRNN divides the input space into a greater number of linear regions than the ReLU network. This explains to a certain extent why the BRNN has the superior approximation ability. Experiments are carried out using five datasets, and the results verify the effectiveness of the proposed method. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：71 / 79

页数：9

共 50 条

[11] Rule extraction: Using neural networks or for neural networks?
Zhou, ZH
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (02) : 249 - 253
[12] Model Predictive Control of Deep Neural Network Model with ReLU Structure using Mixed Integer Programming
Mukai, Masakazu
Degawa, Takuma
Ogawa, Masatoshi
Takei, Takayuki
Akimichi, Toshikado
Kurita, Shigeaki
IFAC PAPERSONLINE, 2024, 58 (18): : 65 - 70
[13] Adaptive two-layer ReLU neural network: II. Ritz approximation to elliptic PDEs *
Liu, Min
Cai, Zhiqiang
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 113 : 103 - 116
[14] NEURAL NETWORKS
Magenreuter, Reinhard
MATHEMATICS AND INFORMATICS, 2016, 59 (05): : 526 - 538
[15] Are analog neural networks better than binary neural networks?
M. Vidyasagar
Circuits, Systems and Signal Processing, 1998, 17 : 243 - 270
[16] Are analog neural networks better than binary neural networks?
Vidyasagar, M
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 1998, 17 (02) : 243 - 270
[17] Learning Two-Layer ReLU Networks Is Nearly as Easy as Learning Linear Classifiers on Separable Data
Yang, Qiuling
Sadeghi, Alireza
Wang, Gang
Sun, Jian
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 4416 - 4427
[18] Mechanical Strength Estimation of Self-Compacting Geopolymer Concrete using ReLU based Deep Neural Network
Mazumder, Endow Ayar
Prasad, M. L., V
ADVANCES IN MATERIALS AND PROCESSING TECHNOLOGIES, 2024, 10 (03) : 2168 - 2185
[19] Survey on Robustness Verification of Feedforward Neural Networks and Recurrent Neural Networks
Liu Y.
Yang P.-F.
Zhang L.-J.
Wu Z.-L.
Feng Y.
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (07): : 1 - 33
[20] Adaptive two-layer ReLU neural network: I. Best least-squares approximation
Liu, Min
Cai, Zhiqiang
Chen, Jingshuang
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2022, 113 : 34 - 44

← 1 2 3 4 5 →