Data-driven performance metrics for neural network learning

被引:1
作者
Alessandri, Angelo [1 ,4 ]
Gaggero, Mauro [2 ]
Sanguineti, Marcello [2 ,3 ]
机构
[1] Univ Genoa, DIME, Genoa, Italy
[2] Natl Res Council Italy, INM, Genoa, Italy
[3] Univ Genoa, DIBRIS, Genoa, Italy
[4] Univ Genoa, DIME, Via Opera Pia 15, I-16145 Genoa, Italy
关键词
Cramer-Rao bound; extended Kalman filter; feedforward neural networks; local minima; neural learning; performance metrics; APPROXIMATE MINIMIZATION; MODELS; BOUNDS; ALGORITHMS;
D O I
10.1002/acs.3701
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectiveness of data-driven neural learning in terms of both local mimima trapping and convergence rate is addressed. Such issues are investigated in a case study involving the training of one-hidden-layer feedforward neural networks with the extended Kalman filter, which reduces the search for the optimal network parameters to a state estimation problem, as compared to descent-based methods. In this respect, the performances of the training are assessed by using the Cramer-Rao bound, along with a novel metric based on an empirical criterion to evaluate robustness with respect to local minima trapping. Numerical results are provided to illustrate the performances of the training based on the extended Kalman filter in comparison with gradient-based learning.
引用
收藏
页数:12
相关论文
共 34 条
[1]   A recursive algorithm for nonlinear least-squares problems [J].
Alessandri, A. ;
Cuneo, M. ;
Pagnan, S. ;
Sanguineti, M. .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2007, 38 (02) :195-216
[2]  
Alessandri A., 2003, On the convergence of EKF-based parameters optimization for neural networks. Proc. IEEE Conf. On Decision and Control, Maui, Hawaii
[3]   Black-Box Modeling and Optimal Control of a Two-Phase Flow Using Level Set Methods [J].
Alessandri, Angelo ;
Bagnerini, Patrizia ;
Gaggero, Mauro ;
Mantelli, Luca ;
Santamaria, Vincenzo ;
Traverso, Alberto .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (02) :520-534
[4]   Optimal Control of Propagating Fronts by Using Level Set Methods and Neural Approximations [J].
Alessandri, Angelo ;
Bagnerini, Patrizia ;
Gaggero, Mauro .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :902-912
[5]   Predictive Control of Container Flows in Maritime Intermodal Terminals [J].
Alessandri, Angelo ;
Cervellera, Cristiano ;
Gaggero, Mauro .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2013, 21 (04) :1423-1431
[6]   Two highly efficient second-order algorithms for training feedforward networks [J].
Ampazis, N ;
Perantonis, SJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (05) :1064-1074
[7]  
[Anonymous], 2016, Learning Real and Boolean Functions: when Is Deep Better than Shallow
[8]  
[Anonymous], 1987, PRACTICAL METHODS OP
[9]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[10]   Low-discrepancy sampling for approximate dynamic programming with local approximators [J].
Cervellera, C. ;
Gaggero, M. ;
Maccio, D. .
COMPUTERS & OPERATIONS RESEARCH, 2014, 43 :108-115