Multi-layer dynamic and asymmetric convolutions

被引:0
|
作者
罗纯杰 [1 ]
ZHAN Jianfeng [2 ]
机构
[1] Institute of Computing Technology,Chinese Academy of Sciences
[2] University of Chinese Academy of Sciences
关键词
D O I
暂无
中图分类号
TP183 [人工神经网络与计算];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic networks have become popular to enhance the model capacity while maintaining efficient inference by dynamically generating the weight based on over-parameters.They bring much more parameters and increase the difficulty of the training.In this paper,a multi-layer dynamic convolution(MDConv) is proposed,which scatters the over-parameters over multi-layers with fewer parameters but stronger model capacity compared with scattering horizontally;it uses the expanding form where the attention is applied to the features to facilitate the training;it uses the compact form where the attention is applied to the weights to maintain efficient inference.Moreover,a multi-layer asymmetric convolution(MAConv) is proposed,which has no extra parameters and computation cost at inference time compared with static convolution.Experimental results show that MDConv achieves better accuracy with fewer parameters and significantly facilitates the training;MAConv enhances the accuracy without any extra cost of storage or computation at inference time compared with static convolution.
引用
收藏
页码:227 / 236
页数:10
相关论文
共 50 条
  • [1] Multi-layer dynamic and asymmetric convolutions
    Luo C.
    Zhan J.
    High Technology Letters, 2022, 28 (03) : 227 - 236
  • [2] Dependence Research on Multi-Layer Convolutions of Images
    Liao, Zhiwu
    Yu, Yong
    Hu, Shaoxiang
    Frontiers in Physics, 2022, 10
  • [3] Dependence Research on Multi-Layer Convolutions of Images
    Liao, Zhiwu
    Yu, Yong
    Hu, Shaoxiang
    FRONTIERS IN PHYSICS, 2022, 10
  • [4] Toward asymmetric multi-layer for distributed monitoring
    Chaiprapa, P
    Uatrongjit, S
    Kantapanit, K
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : B541 - B544
  • [5] On the mathematical modeling of symmetric/asymmetric multi-layer orthotropic shells
    Krysko, V. A., Jr.
    Awrejcewicz, J.
    Zhigalov, M., V
    Krysko, V. A.
    INTERNATIONAL JOURNAL OF NON-LINEAR MECHANICS, 2020, 120 (120)
  • [6] Static and dynamic characteristics of a multi-layer electroelastic solid
    S. M. Afonin
    Mechanics of Solids, 2009, 44 : 935 - 950
  • [7] Static and dynamic characteristics of a multi-layer electroelastic solid
    Afonin, S. M.
    MECHANICS OF SOLIDS, 2009, 44 (06) : 935 - 950
  • [8] Multi-Layer Graph Analysis for Dynamic Social Networks
    Oselio, Brandon
    Kulesza, Alex
    Hero, Alfred O., III
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (04) : 514 - 523
  • [9] A Multi-Layer Dynamic Model for Customer Experience Analytics
    Chen, Sining
    Ho, Tin Kam
    Vyas, Avinash
    Cao, Jin
    Spiess, Jeffrey
    BELL LABS TECHNICAL JOURNAL, 2014, 18 (04) : 19 - 32
  • [10] Behavior to Dynamic Loads of Multi-layer Composite Structures
    Geanta, Victor
    Voiculescu, Ionelia
    Chereches, Tudor
    Zecheru, Teodora
    Matache, Liviu
    Rotariu, Adrian
    MATERIALE PLASTICE, 2019, 56 (02) : 460 - 465