Statistical inference: learning in artificial neural networks

被引:15
作者
Yang, HH
Murata, N
Amari, S
机构
[1] Oregon Grad Inst, Dept Comp Sci, Portland, OR 97291 USA
[2] RIKEN, BSI, Lab Informat Synth, Wako, Saitama 35101, Japan
关键词
D O I
10.1016/S1364-6613(97)01114-5
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Artificial neural networks (ANNs) are widely used to model low-level neural activities and high-level cognitive functions. In this article, we review the application of statistical inference for learning in ANNs. Statistical inference provides an objective way to derive learning algorithms both for training and for evaluation of the performance of trained ANNs. Solutions to the over-fitting problem by model- selection methods, based on either conventional statistical approaches or on a Bayesian approach, are discussed. The use of supervised and unsupervised learning algorithms for ANNs are reviewed. Training a multilayer ANN by supervised learning is equivalent to nonlinear regression. The ensemble methods, bagging and arching, described here, can be applied to combine ANNs to form a new predictor with improved performance. Unsupervised learning algorithms that are derived either by the Hebbian law for bottom-up self-organization, or by global objective functions for top-down self-organization are also discussed.
引用
收藏
页码:4 / 10
页数:7
相关论文
共 50 条
[1]   Learning in Artificial Neural Networks: A Statistical Perspective [J].
White, Halbert .
NEURAL COMPUTATION, 1989, 1 (04) :425-464
[2]   Neural networks and statistical inference: seeking robust and efficient learning [J].
Capobianco, E .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 32 (3-4) :443-454
[3]   From statistical inference to a differential learning rule for stochastic neural networks [J].
Saglietti, Luca ;
Gerace, Federica ;
Ingrosso, Alessandro ;
Baldassi, Carlo ;
Zecchina, Riccardo .
INTERFACE FOCUS, 2018, 8 (06)
[4]   Regular Inference on Artificial Neural Networks [J].
Mayr, Franz ;
Yovine, Sergio .
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2018, 2018, 11015 :350-369
[5]   Statistical Methods and Artificial Neural Networks [J].
Mammadov, Mammadagha ;
Yazici, Berna ;
Yolacan, Senay ;
Aslanargun, Atilla ;
Yuzer, Ali Fuat ;
Agaoglu, Embiya .
JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2006, 5 (02) :495-512
[6]   STATISTICAL PROPERTIES OF ARTIFICIAL NEURAL NETWORKS [J].
BARRON, AR .
PROCEEDINGS OF THE 28TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-3, 1989, :280-285
[7]   Neural networks as inference and learning engines [J].
Crespo, Jose L. ;
Mora, Eduardo .
Microcomputers in civil engineering, 1995, 10 (02) :89-96
[8]   Decentralized Statistical Inference with Unrolled Graph Neural Networks [J].
Wang, He ;
Shen, Yifei ;
Wang, Ziyuan ;
Li, Dongsheng ;
Zhang, Jun ;
Letaief, Khaled B. ;
Lu, Jie .
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, :2634-2640
[9]   Statistical Compact Modeling With Artificial Neural Networks [J].
Dai, Wu ;
Li, Yu ;
Rong, Zhao ;
Peng, Baokang ;
Zhang, Lining ;
Wang, Runsheng ;
Huang, Ru .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (12) :5156-5160
[10]   Statistical Process Monitoring of Artificial Neural Networks [J].
Malinovskaya, Anna ;
Mozharovskyi, Pavlo ;
Otto, Philipp .
TECHNOMETRICS, 2024, 66 (01) :104-117