Generalization and selection of examples in feedforward neural networks

被引：26

作者：

Franco, L ^{[1
]}

Cannas, SA ^{[1
]}

机构：

[1] Univ Nacl Cordoba, Fac Matemat Astron & Fis, RA-5000 Cordoba, Argentina

来源：

NEURAL COMPUTATION | 2000年 / 12卷 / 10期

关键词：

D O I：

10.1162/089976600300014999

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we study how the selection of examples affects the learning procedure in a boolean neural network and its relationship with the complexity of the function under study and its architecture. We analyze the generalization capacity for different target functions with particular architectures through an analytical calculation of the minimum number of examples needed to obtain full generalization (i.e., zero generalization error). The analysis of the training sets associated with such parameter leads us to propose a general architecture-independent criterion for selection of training examples. The criterion was checked through numerical simulations for various particular target functions with particular architectures, as well as for random target functions in a nonoverlapping receptive field perceptron. In all cases, the selection sampling criterion lead to an improvement in the generalization capacity compared with a pure random sampling. We also show that for the parity problem, one of the most used problems for testing learning algorithms, only the use of the whole set of examples ensures global learning in a depth two architecture. We show that this difficulty can be overcome by considering a tree-structured network of depth 2 log(2) (N) - 1.

引用

页码：2405 / 2426

页数：22

共 18 条

[1] What Size Net Gives Valid Generalization? [J].

Baum, Eric B. ;

Haussler, David .

NEURAL COMPUTATION, 1989, 1 (01) :151-160

[2] NEURAL NET ALGORITHMS THAT LEARN IN POLYNOMIAL-TIME FROM EXAMPLES AND QUERIES [J].

BAUM, EB .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (01) :5-19

[3]

Boers E., 1993, P COMP SCI NETH, P87

[4] ARITHMETIC PERCEPTRONS [J].

CANNAS, SA .

NEURAL COMPUTATION, 1995, 7 (01) :173-181

[5]

COHN D, 1994, MACH LEARN, V15, P201, DOI 10.1007/BF00993277

[6] Neural network exploration using optimal experiment design [J].

Cohn, DA .

NEURAL NETWORKS, 1996, 9 (06) :1071-1083

[7] ONLINE LEARNING IN THE COMMITTEE MACHINE [J].

COPELLI, M ;

CATICHA, N .

JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06) :1615-1625

[8] Solving arithmetic problems using feed-forward neural networks [J].

Franco, L ;

Cannas, SA .

NEUROCOMPUTING, 1998, 18 (1-3) :61-79

[9]

HANCOCK TR, 1994, MACH LEARN, V16, P161, DOI 10.1023/A:1022637108202

[10]

Haykin S., 1994, NEURAL NETWORKS COMP

← 1 2 →