On different facets of regularization theory

被引:101
作者
Chen, Z [1 ]
Haykin, S [1 ]
机构
[1] McMaster Univ, Commun Res Lab, Adapt Syst Lab, Hamilton, ON L8S 4K1, Canada
关键词
D O I
10.1162/089976602760805296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This review provides a comprehensive understanding of regularization theory from different perspectives, emphasizing smoothness and simplicity principles. Using the tools of operator theory and Fourier analysis, it is shown that the solution of the classical Tikhonov regularization problem can be derived from the regularized functional defined by a linear differential (integral) operator in the spatial (Fourier) domain. State-of-the-art research relevant to the regularization theory is reviewed, covering Occam's razor, minimum length description, Bayesian theory, pruning algorithms, informational (entropy) theory, statistical learning theory, and equivalent regularization. The universal principle of regularization in terms of Kolmogorov complexity is discussed. Finally, some prospective studies on regularization theory and beyond are suggested.
引用
收藏
页码:2791 / 2846
页数:56
相关论文
共 160 条
[42]   Regularization networks: Fast weight calculation via Kalman filtering [J].
De Nicolao, G ;
Ferrari-Trecate, G .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (02) :228-235
[43]  
DEBNATH L, 1999, INTRO HILBRT SPACES
[44]   UNSUPERVISED MUTUAL INFORMATION CRITERION FOR ELIMINATION OF OVERTRAINING IN SUPERVISED MULTILAYER NETWORKS [J].
DECO, G ;
FINNOFF, W ;
ZIMMERMANN, HG .
NEURAL COMPUTATION, 1995, 7 (01) :86-107
[45]  
Dontchev A.L., 1993, WELL POSED OPTIMIZAT, V1543, P176, DOI DOI 10.1007/BFB0084195
[46]  
Duchon J., 1977, LECT NOTES MATH, P85
[47]   Regularization networks and support vector machines [J].
Evgeniou, T ;
Pontil, M ;
Poggio, T .
ADVANCES IN COMPUTATIONAL MATHEMATICS, 2000, 13 (01) :1-50
[48]   LEARNING AND GENERALIZATION IN RADIAL BASIS FUNCTION NETWORKS [J].
FREEMAN, JAS ;
SAAD, D .
NEURAL COMPUTATION, 1995, 7 (05) :1000-1020
[49]   On a class of support vector kernels based on frames in function Hilbert spaces [J].
Gao, JB ;
Harris, CJ ;
Gunn, SR .
NEURAL COMPUTATION, 2001, 13 (09) :1975-1994
[50]   NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].
GEMAN, S ;
BIENENSTOCK, E ;
DOURSAT, R .
NEURAL COMPUTATION, 1992, 4 (01) :1-58