Hidden neuron pruning for multilayer perceptrons using a sensitivity measure

被引:0
作者
Yeung, DS [1 ]
Zeng, XQ [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
来源
2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS | 2002年
关键词
multilayer perceptron; neural networks; pruning; sensitivity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the design of neural networks, how to choose the proper size of a network for a given task is an important and difficult problem that still deserves further exploration. One popular approach to tackling this problem is that of starting with an oversized network and then pruning it to a smaller size so as to achieve less computational complexity and better performance in generalization. This paper presents a pruning technique, via a quantified sensitivity measure, to remove as many neurons as possible, those with the least relevance, from the hidden layers of a Multilayer Perceptron (MLP). The sensitivity of an individual neuron is defined as the expectation of its output deviation due to expected input deviation with respect to the overall inputs from a continuous interval. The relevance of a neuron is defined as the multiplication of its sensitivity value by the summation of its outgoing weights. The basic idea is to iteratively train the network according to a certain performance criterion and then remove the neurons with the lowest relevance values. The pruning technique is novel in its quantified sensitivity measure. Computer simulations demonstrate that it works well.
引用
收藏
页码:1751 / 1757
页数:7
相关论文
共 14 条
[1]  
BURRASCANO P, 1993, P 1993 INT JOINT C N, V1, P347
[2]   A new pruning heuristic based on variance analysis of sensitivity information [J].
Engelbrecht, AP .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (06) :1386-1399
[3]  
ENGELBRECHT AP, 1996, IEEE INT C NEUR NETW, V2, P1274
[4]   A SIMPLE AND EFFECTIVE METHOD FOR REMOVAL OF HIDDEN UNITS AND WEIGHTS [J].
HAGIWARA, M .
NEUROCOMPUTING, 1994, 6 (02) :207-218
[5]  
Hassibi B., 1993, ADV NEURAL INFORM PR, P164, DOI DOI 10.5555/645753.668069
[6]  
Karnin E D, 1990, IEEE Trans Neural Netw, V1, P239, DOI 10.1109/72.80236
[7]   Constructive algorithms for structure learning in feedforward neural networks for regression problems [J].
Kwok, TY ;
Yeung, DY .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (03) :630-645
[8]  
LeCun Y., 1990, Advances in neural information processing systems, P598
[9]  
Mozer M. C., 1989, ADV NEURAL INFORMATI, P107
[10]  
Pedersen MW, 1996, ADV NEUR IN, V8, P521