Information geometry on extendable hierarchical large scale neural network model

被引:3
作者
Liu, YH [1 ]
Luo, SW [1 ]
Li, AJ [1 ]
Huang, H [1 ]
机构
[1] No Jiaotong Univ, Dept Comp Sci, Beijing 100044, Peoples R China
来源
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS | 2003年
关键词
information geometry; extendable hierarchical large scale neural network; dually flat manifold; knowledge increase;
D O I
10.1109/ICMLC.2003.1259707
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an extendable hierarchical large scale neural network model is developed based on the theoretical analysis of information geometry. In a hierarchical set of systems, a lower order system is included in the parameter space of a larger one as a subset. Such a parameter space has rich geometrical structures that are responsible for the dynamic behaviors of learning. Extendable hierarchical large scale neural network divides a task into small tasks, and each task is fulfilled by a small network under the principle of divide and conquers to improve the performance of a single network. By studying the dual manifold architecture for a family of neural networks and analyzing the hierarchical expansion of this model based on information geometry, the paper proposes a new method to construct the extendable hierarchical large scale neural network model that has knowledge-increasable and structure-extendible ability. The method helps to provide explanation of the transformation mechanism of human recognition system and understand the theory of global architecture of neural network.
引用
收藏
页码:1380 / 1384
页数:5
相关论文
共 10 条
  • [1] Natural gradient works efficiently in learning
    Amari, S
    [J]. NEURAL COMPUTATION, 1998, 10 (02) : 251 - 276
  • [2] AMARI S, 1999, UNPUB IEEE T INF THE
  • [3] AMARI S, 1995, NEURAL NETWORK, V95, P1379
  • [4] Amari S-I., 1985, Lecture Notes in Statistics, P28
  • [5] Adaptive Mixtures of Local Experts
    Jacobs, Robert A.
    Jordan, Michael I.
    Nowlan, Steven J.
    Hinton, Geoffrey E.
    [J]. NEURAL COMPUTATION, 1991, 3 (01) : 79 - 87
  • [6] HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM
    JORDAN, MI
    JACOBS, RA
    [J]. NEURAL COMPUTATION, 1994, 6 (02) : 181 - 214
  • [7] Luo SW, 1998, ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, P1273, DOI 10.1109/ICOSP.1998.770851
  • [8] LUO SW, 1997, 2 SIN GERM WORKSH AD, pP83
  • [9] LUO SW, 2002, ICMLC 02
  • [10] WEN JW, 2002, INFORMATION GEOMETRY