Biased ReLU neural networks

被引：37

作者：

Liang, XingLong ^{[1
]}

Xu, Jun ^{[1
]}

机构：

[1] Harbin Inst Technol, Shenzhen 518055, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 423卷

基金：

中国国家自然科学基金;

关键词：

Biased ReLU; Neural network; PWL network flexibility; ADAPTIVE HINGING HYPERPLANES;

D O I：

10.1016/j.neucom.2020.09.050

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural networks (NN) with rectified linear units (ReLU) have been widely implemented since 2012. In this paper, we describe an activation function called the biased ReLU neuron (BReLU), which is similar to the ReLU. Based on this activation function, we propose the BReLU NN (BRNN). The structure of the BRNN is similar to that of the ReLU network. However, the difference between the two is that the BReLU introduces several biases for each input variable. This allows the BRNN to divide the input space into a greater number of linear regions and improve network flexibility. The BRNN parameters to be estimated are the weight matrices and the bias parameters of the BReLU neurons. The weights are obtained using the backpropagation method. Moreover, we propose a method to compute the bias parameters of the BReLU neurons. In this method, batch normalization is applied to the BRNN, and the variance and mean of the input variables are obtained. Based on these two parameters, the bias parameters are estimated. In addition, we investigate the flexibility of the BRNN. Specifically, we study the number of linear regions and provide the upper bound for the maximum number of linear regions. The results indicate that for the same input dimension, the BRNN divides the input space into a greater number of linear regions than the ReLU network. This explains to a certain extent why the BRNN has the superior approximation ability. Experiments are carried out using five datasets, and the results verify the effectiveness of the proposed method. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：71 / 79

页数：9

共 29 条

[1]

[Anonymous], ARXIV171102114

[2]

[Anonymous], ARXIV150500853

[3]

[Anonymous], PROC CVPR IEEE

[4]

[Anonymous], 2015, Tiny ImageNet Visual Recognition Challenge., DOI DOI 10.1109/ICCV.2015.123

[5]

[Anonymous], ARXIV14126830

[6]

[Anonymous], ARXIV151107289

[7]

[Anonymous], 1975, Facing up to Arrangements: Face-Count Formulas for Partitions of Space by Hyperplanes: Face-count Formulas for Partitions of Space by Hyperplanes

[8]

[Anonymous], ARXIV13126098

[9]

Bodyanskiy Y, 2019, CEUR WORKSHOP PROCEE, V2533, P14

[10] HINGING HYPERPLANES FOR REGRESSION, CLASSIFICATION, AND FUNCTION APPROXIMATION [J].

BREIMAN, L .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :999-1013

← 1 2 3 →