Self-Adaptive Layer: An Application of Function Approximation Theory to Enhance Convergence Efficiency in Neural Networks

被引：0

作者：

Chan, Ka-Hou ^{[1
]}

Im, Sio-Kei ^{[2
]}

Ke, Wei ^{[2
]}

机构：

[1] Macao Polytech Inst, Sch Appl Sci, Macau, Peoples R China

[2] Macao Polytech Inst, Macau, Peoples R China

来源：

2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020) | 2020年

关键词：

Function Approximation; Orthogonal Polynomial; Self-Adaptive; Neural Network;

D O I：

10.1109/icoin48656.2020.9016534

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural networks provide a general architecture to model complex nonlinear systems, but the source data are often mixed with a lot of noise and interference information. One way to offer a smoother alternative for addressing this issue in training is to increase the neural or layer size. In this paper, a new self-adaptive layer is developed to overcome the problems of neural networks so as to achieve faster convergence and avoid local minimum. We incorporate function approximation theory into the layer element arrangement, so that the training process and the network approximation properties can be investigated via linear algebra, where the precision of adaptation can be controlled by the order of polynomials being used. Experimental results show that our proposed layer leads to significantly faster performance in convergence. As a result, this new layer greatly enhances the training accuracy. Moreover, the design and implementation can be easily deployed in most current systems.

引用

页码：447 / 452

页数：6

共 29 条

[1]

Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)

[2] SinP[N]: A Fast Convergence Activation Function for Convolutional Neural Networks [J].

Chan, Ka-Hou ;

Im, Sio-Kei ;

Ke, Wei ;

Lei, Ngan-Lin .

2018 IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING COMPANION (UCC COMPANION), 2018, :365-369

[3]

Chernodub A., 2016, ARXIV160402313

[4]

Cho K., 2014, EMNLP 2014, DOI DOI 10.3115/V1/D14-1179

[5]

Duchi J, 2011, J MACH LEARN RES, V12, P2121

[6]

Fan J., 2008, NONLINEAR TIME SERIE, DOI DOI 10.1007/978-0-387-69395-8

[7] Smooth function approximation using neural networks [J].

Ferrari, S ;

Stengel, RF .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (01) :24-38

[8]

Glorot X., 2010, P INT C ART INT STAT, P249

[9]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

[10] NEURAL NETWORKS AND PHYSICAL SYSTEMS WITH EMERGENT COLLECTIVE COMPUTATIONAL ABILITIES [J].

HOPFIELD, JJ .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1982, 79 (08) :2554-2558

← 1 2 3 →