A New Correntropy-Based Conjugate Gradient Backpropagation Algorithm for Improving Training in Neural Networks

被引:69
作者
Heravi, Ahmad Reza [1 ]
Hodtani, Ghosheh Abed [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad 9177948974, Iran
关键词
Artificial neural networks; conjugate gradient (CG) descent; convergence; correntropy; mean square error (MSE) methods; optimization methods; GLOBAL CONVERGENCE; MINIMIZATION; CRITERION; DESCENT;
D O I
10.1109/TNNLS.2018.2827778
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mean square error (MSE) is the most prominent criterion in training neural networks and has been employed in numerous learning problems. In this paper, we suggest a group of novel robust information theoretic backpropagation (BP) methods, as correntropy-based conjugate gradient BP (CCG-BP). CCG-BP algorithms converge faster than the common correntropy-based BP algorithms and have better performance than the common CG-BP algorithms based on MSE, especially in nonGaussian environments and in cases with impulsive noise or heavy-tailed distributions noise. In addition, a convergence analysis of this new type of method is particularly considered. Numerical results for several samples of function approximation, synthetic function estimation, and chaotic time series prediction illustrate that our new BP method is more robust than the MSE-based method in the sense of impulsive noise, especially when SNR is low.
引用
收藏
页码:6252 / 6263
页数:12
相关论文
共 60 条
[1]   DESCENT PROPERTY AND GLOBAL CONVERGENCE OF THE FLETCHER REEVES METHOD WITH INEXACT LINE SEARCH [J].
ALBAALI, M .
IMA JOURNAL OF NUMERICAL ANALYSIS, 1985, 5 (01) :121-124
[2]  
[Anonymous], 2006, Sequential quadratic programming, DOI DOI 10.1007/0-387-22742-3_18
[3]  
[Anonymous], 2005, Lectures on Lipschitz Analysis, University of Jyva"\skyla
[4]  
[Anonymous], 2013, Neural networks: a systematic introductionM
[5]  
[Anonymous], 1952, METHODS CONJUGATE GR
[6]  
[Anonymous], 2001, NEURAL NETWORKS COMP
[7]   OPTIMIZATION FOR TRAINING NEURAL NETS [J].
BARNARD, E .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02) :232-240
[8]   Entropy and Correntropy Against Minimum Square Error in Offline and Online Three-Day Ahead Wind Power Forecasting [J].
Bessa, Ricardo J. ;
Miranda, Vladimiro ;
Gama, Joao .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2009, 24 (04) :1657-1666
[9]  
CHAMBERS JA, 1994, ELECTRON LETT, V30, P1574, DOI 10.1049/el:19941060
[10]   CONJUGATE-GRADIENT ALGORITHM FOR EFFICIENT TRAINING OF ARTIFICIAL NEURAL NETWORKS [J].
CHARALAMBOUS, C .
IEE PROCEEDINGS-G CIRCUITS DEVICES AND SYSTEMS, 1992, 139 (03) :301-310