Discretization-aware architecture search

被引:16
作者
Tian, Yunjie [1 ]
Liu, Chang [1 ]
Xie, Lingxi [2 ]
Jiao, Jianbin [1 ]
Ye, Qixiang [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Huawei Inc, Noahs Ark Lab, Beijing, Peoples R China
关键词
Neural architecture search; Weight-sharing; Discretization-aware; Imbalanced network configuration;
D O I
10.1016/j.patcog.2021.108186
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The search cost of neural architecture search (NAS) has been largely reduced by differentiable architecture search and weight-sharing methods. Such methods optimize a super-network with all possible edges and operations, and determine the optimal sub-network by discretization, i.e., pruning off operations/edges of small weights. However, the discretization process performed on either operations or edges incurs significant inaccuracy and thus the quality of the architecture is not guaranteed. In this paper, we propose discretization-aware architecture search (DA(2)S), and target at pushing the super-network towards the configuration of desired topology. DA(2)S is implemented with an entropy-based loss term, which can be regularized to differentiable architecture search in a plug-and-play fashion. The regularization is controlled by elaborated continuation functions, so that discretization is adaptive to the dynamic change of edges and operations. Experiments on standard image classification benchmarks demonstrate the effectiveness of our approach, in particular, under imbalanced network configurations that were not studied before. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 56 条
  • [31] Li L., 2019, P 35 C UNC ART INT U
  • [32] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [33] Adaptive Linear Span Network for Object Skeleton Detection
    Liu, Chang
    Tian, Yunjie
    Chen, Zhiwen
    Jiao, Jianbin
    Ye, Qixiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5096 - 5108
  • [34] Liu H., 2019, P INT C LEARN REPR N, P1
  • [35] Liu HaiJing Liu HaiJing, 2018, The Proceedings of the Fifteenth Congress of China Sheep Industry Development Sponsored by the China Animal Husbandry Association in 2018, Henan, China, 10-11 October, 2018, P13
  • [36] Block Proposal Neural Architecture Search
    Liu, Jiaheng
    Zhou, Shunfeng
    Wu, Yichao
    Chen, Ken
    Ouyang, Wanli
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 15 - 25
  • [37] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [38] Pham H, 2018, PR MACH LEARN RES, V80
  • [39] Real E, 2019, AAAI CONF ARTIF INTE, P4780
  • [40] Siems J., 2020, ABS2008097