Local minima found in the subparameter space can be effective for ensembles of deep convolutional neural networks

被引:10
|
作者
Yang, Yongquan [1 ]
Lv, Haijun [2 ]
Chen, Ning [3 ]
Wu, Yang [4 ]
Zheng, Jiayi [5 ]
Zheng, Zhongxi [1 ]
机构
[1] Sichuan Univ, West China Hosp, Lab Pathol, 37 Guo Xue Rd, Chengdu 610041, Peoples R China
[2] Baidu Co Ltd, AIPE, 701 Na Xian Rd, Shanghai 201210, Peoples R China
[3] Xian Polytech Univ, Sch Elect & Informat, 19 Jin Hua Rd, Xian 710048, Peoples R China
[4] Nara Inst Sci & Technol, Int Collaborat Lab Robot Vis, 8916-5 Takayamacho Ikoma, Nara 6300192, Japan
[5] Univ San Francisco, Premed Sch, San Francisco, CA 94117 USA
关键词
Ensemble learning; Ensemble selection; Ensemble fusion; Deep convolutional neural network; CLASSIFIER; SELECTION; STRATEGY;
D O I
10.1016/j.patcog.2020.107582
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensembles of deep convolutional neural networks (CNNs), which integrate multiple deep CNN models to achieve better generalization for an artificial intelligence application, now play an important role in ensemble learning due to the dominant position of deep learning. However, the usage of ensembles of deep CNNs is still not adequate because the increasing complexity of deep CNN architectures and the emerging data with large dimensionality have made the training stage and testing stage of ensembles of deep CNNs inevitably expensive. To alleviate this situation, we propose a new approach that finds multiple models converging to local minima in subparameter space for ensembles of deep CNNs. The subparameter space here refers to the space constructed by a partial selection of parameters, instead of the entire set of parameters, of a deep CNN architecture. We show that local minima found in the subparameter space of a deep CNN architecture can in fact be effective for ensembles of deep CNNs to achieve better generalization. Moreover, finding local minima in the subparameter space of a deep CNN architecture is more affordable at the training stage, and the multiple models at the found local minima can also be selectively fused to achieve better ensemble generalization while limiting the expense to a single deep CNN model at the testing stage. Demonstrations of MobilenetV2, Resnet50 and InceptionV4 (deep CNN architectures from lightweight to complex) on ImageNet, CIFAR-10 and CIFAR-10 0, respectively, lead us to believe that finding local minima in the subparameter space of a deep CNN architecture could be leveraged to broaden the usage of ensembles of deep CNNs. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:21
相关论文
共 50 条
  • [2] Ensembles of Deep Convolutional Neural Networks for Detecting Melanoma in Dermoscopy Images
    Tziomaka, Melina
    Maglogiannis, Ilias
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 12876 : 523 - 535
  • [3] On learning effective ensembles of deep neural networks for intrusion detection
    Folino, F.
    Folino, G.
    Guarascio, M.
    Pisani, F. S.
    Pontieri, L.
    INFORMATION FUSION, 2021, 72 : 48 - 69
  • [4] Effective Version Space Reduction for Convolutional Neural Networks
    Liu, Jiayu
    Chiotellis, Ioannis
    Triebel, Rudolph
    Cremers, Daniel
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 85 - 100
  • [5] Space Efficient Quantization for Deep Convolutional Neural Networks
    Zhao, Dong-Di
    Li, Fan
    Sharif, Kashif
    Xia, Guang-Min
    Wang, Yu
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 34 (02) : 305 - 317
  • [6] Face Space Representations in Deep Convolutional Neural Networks
    O'Toole, Alice J.
    Castillo, Carlos D.
    Parde, Connor J.
    Hill, Matthew Q.
    Chellappa, Rama
    TRENDS IN COGNITIVE SCIENCES, 2018, 22 (09) : 794 - 809
  • [7] Space Efficient Quantization for Deep Convolutional Neural Networks
    Dong-Di Zhao
    Fan Li
    Kashif Sharif
    Guang-Min Xia
    Yu Wang
    Journal of Computer Science and Technology, 2019, 34 : 305 - 317
  • [8] LEARNING IN NEURAL NETWORKS WITH LOCAL MINIMA
    HESKES, TM
    SLIJPEN, ETP
    KAPPEN, B
    PHYSICAL REVIEW A, 1992, 46 (08): : 5221 - 5231
  • [9] Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks
    Shen, Li
    Lin, Zhouchen
    Huang, Qingming
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 467 - 482
  • [10] Understanding the Effective Receptive Field in Deep Convolutional Neural Networks
    Luo, Wenjie
    Li, Yujia
    Urtasun, Raquel
    Zemel, Richard
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29