Distributed learning strategy based on chips for classification with large-scale dataset

被引:2
作者
Yang, Bo
Su, Xiaohong
Wang, Yadong
机构
[1] IBM Corp, China Res Lab, Beijing 100094, Peoples R China
[2] Sch Comp Sci & Technol, Harbin Inst Technol, Harbin 150001, Peoples R China
关键词
distributed learning strategy; artificial neural network; classification; multiagent; protein secondary structure prediction;
D O I
10.1142/S0218001407005739
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning with very large-scale datasets is always necessary when handling real problems using artificial neural networks. However, it is still an open question how to balance computing efficiency and learning stability, when traditional neural networks spend a large amount of running time and memory to solve a problem with large-scale learning dataset. In this paper, we report the. rst evaluation of neural network distributed-learning strategies in large-scale classification over protein secondary structure. Our accomplishments include: (1) an architecture analysis on distributed-learning, (2) the development of scalable distributed system for large-scale dataset classification, (3) the description of a novel distributed-learning strategy based on chips, (4) a theoretical analysis of distributed-learning strategies for structure-distributed and data-distributed, (5) an investigation and experimental evaluation of distributed-learning strategy based-on chips with respect to time complexity and their effect on the classification accuracy of artificial neural networks. It is demonstrated that the novel distributed-learning strategy is better-balanced in parallel computing efficiency and stability as compared with the previous algorithms. The application of the protein secondary structure prediction demonstrates that this method is feasible and effective in practical applications.
引用
收藏
页码:899 / 920
页数:22
相关论文
共 24 条
  • [1] Two-stage neural network for volume segmentation of medical images
    Ahmed, MN
    Farag, AA
    [J]. PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1143 - 1151
  • [2] ANDREA M, 2005, IEEE T GEOSCI REMOTE, V43, P2084
  • [3] TRAINING A 3-NODE NEURAL NETWORK IS NP-COMPLETE
    BLUM, AL
    RIVEST, RL
    [J]. NEURAL NETWORKS, 1992, 5 (01) : 117 - 127
  • [4] Bouchut Y., 2003, Proceedings of the Fifteenth IASTED Internation Conference on Parallel and Distributed Computing and Systems, P588
  • [5] BRENT F, 2005, INT J PATTERN RECOGN, V19, P249
  • [6] Cardot H., 1994, International Journal of Pattern Recognition and Artificial Intelligence, V8, P679, DOI 10.1142/S021800149400036X
  • [7] ERENSHTEYN R, 1996, P WORKSH INT GEST LA, P185
  • [8] EVRICLEA VM, 1997, IEEE T NEURAL NETWOR, V8, P288
  • [9] Geist A, 1994, PVM PARALLEL VIRTUAL
  • [10] ISA NAM, 2002, P INT C ART INT ENG, P458