Finding a succinct multi-layer perceptron having shared weights

被引:0
作者
Tanahashi, Y [1 ]
Chin, XF [1 ]
Saito, K [1 ]
Nakano, R [1 ]
机构
[1] Nagoya Inst Technol, Showa Ku, Nagoya, Aichi 4668555, Japan
来源
PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5 | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to find a succinct neural network having shared weights. We focus on weight sharing. Weight sharing constrains the freedom of weight values and weights are allowed to have one of common weights. A near-zero common weight can be eliminated, called weight pruning. Recently, a weight sharing method called BCW has been proposed. The BCW employs merge and split operations based on 2nd-order optimal criteria, and can escape local optima through bidirectional clustering. However, the BCW assumes a vital network parameter J, the number of hidden units, is given. This paper modifies the BCW to make the procedure faster so that the selection of J based on cross-validation can be done in reasonable cpu time. Our experiments showed that the proposed method can restore the original model for an artificial data set, and finds a small number of common weights and an interesting tendency for a real data set.
引用
收藏
页码:1418 / 1423
页数:6
相关论文
共 50 条
  • [31] Multi-layer Perceptron Based Video Surveillance System
    Harihar, Vijai Kumar
    Sukumaran, Sandeep
    Sirajuddin, Samar
    Sali, Aswathy
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 490 - 495
  • [32] Seismic data denoising based on multi-layer perceptron
    Wang Q.
    Tang J.
    Zhang L.
    Liu X.
    Xu Z.
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (02): : 272 - 281
  • [33] Geno-mathematical identification of the multi-layer perceptron
    Ostermark, Ralf
    NEURAL COMPUTING & APPLICATIONS, 2009, 18 (04) : 331 - 344
  • [34] Battle royale optimizer for training multi-layer perceptron
    Saeid Agahian
    Taymaz Akan
    Evolving Systems, 2022, 13 : 563 - 575
  • [35] Multi-layer perceptron learning in the domain of attributed graphs
    Jain, BJ
    Wysotzki, F
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1003 - 1008
  • [36] Many-objective training of a multi-layer perceptron
    Koeppen, Mario
    Yoshida, Kaori
    NEURAL NETWORK WORLD, 2007, 17 (06) : 627 - 637
  • [37] A Stochastic Computational Multi-Layer Perceptron with Backward Propagation
    Liu, Yidong
    Liu, Siting
    Wang, Yanzhi
    Lombardi, Fabrizio
    Han, Jie
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (09) : 1273 - 1286
  • [38] Multi-layer perceptron based modelling of nonlinear systems
    Lightbody, G
    Irwin, GW
    FUZZY SETS AND SYSTEMS, 1996, 79 (01) : 93 - 112
  • [39] A novel scheme to determine the architecture of a Multi-layer Perceptron
    Chintalapudi, KK
    Pal, NR
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2297 - 2302
  • [40] Prediction of zenith tropospheric delay by multi-layer perceptron
    Katsougiannopoulos, S.
    Pikridas, C.
    JOURNAL OF APPLIED GEODESY, 2009, 3 (04) : 223 - 229