TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices

被引:0
|
作者
Loni, Mohammad [1 ]
Mousavi, Hamid [1 ]
Riazati, Mohammad [1 ]
Daneshtalab, Masoud [1 ]
Sjodin, Mikael [1 ]
机构
[1] Malardalen Univ, Sch Innovat Design & Engn, Vasteras, Sweden
关键词
Quantization; Ternary Neural Network; Neural Architecture Search; Embedded Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-bit representation resulting in remarkable network compression and energy efficiency. However, there remains a significant gap in accuracy between TNNs and full-precision counterparts. Recent advances in Neural Architectures Search (NAS) promise opportunities in automated optimization for various deep learning tasks. Unfortunately, this area is unexplored for optimizing TNNs. This paper proposes TAS, a framework that drastically reduces the accuracy gap between TNNs and their full-precision counterparts by integrating quantization into the network design. We experienced that directly applying NAS to the ternary domain provides accuracy degradation as the search settings are customized for full-precision networks. To address this problem, we propose (i) a new cell template for ternary networks with maximum gradient propagation; and (ii) a novel learnable quantizer that adaptively relaxes the ternarization mechanism from the distribution of the weights and activation functions. Experimental results reveal that TAS delivers 2.64% higher accuracy and approximate to 2.8x memory saving over competing methods with the same bit-width resolution on the CIFAR-10 dataset. These results suggest that TAS is an effective method that paves the way for the efficient design of the next generation of quantized neural networks.
引用
收藏
页码:1115 / 1118
页数:4
相关论文
共 50 条
  • [21] Model reduction of feed forward neural networks for resource-constrained devices
    Fragkou, Evangelia
    Koultouki, Marianna
    Katsaros, Dimitrios
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14102 - 14127
  • [22] Tabu search for resource-constrained scheduling
    Verhoeven, MGA
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 106 (2-3) : 266 - 276
  • [23] Tabu search for resource-constrained scheduling
    Eindhoven Univ of Technology, Eindhoven, Netherlands
    Eur J Oper Res, 2-3 (266-276):
  • [24] Efficient federated learning on resource-constrained edge devices based on model pruning
    Wu, Tingting
    Song, Chunhe
    Zeng, Peng
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6999 - 7013
  • [25] Wireless Channel Adaptive DNN Split Inference for Resource-Constrained Edge Devices
    Lee, Jaeduk
    Lee, Hojung
    Choi, Wan
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (06) : 1520 - 1524
  • [26] Post-Quantum Cryptoprocessors Optimized for Edge and Resource-Constrained Devices in IoT
    Ebrahimi, Shahriar
    Bayat-Sarmadi, Siavash
    Mosanaei-Boorani, Hatameh
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03) : 5500 - 5507
  • [27] Efficient Privacy-Preserving Federated Learning for Resource-Constrained Edge Devices
    Wu, Jindi
    Xia, Qi
    Li, Qun
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 191 - 198
  • [28] FedComp: A Federated Learning Compression Framework for Resource-Constrained Edge Computing Devices
    Wu, Donglei
    Yang, Weihao
    Jin, Haoyu
    Zou, Xiangyu
    Xia, Wen
    Fang, Binxing
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (01) : 230 - 243
  • [29] Efficient federated learning on resource-constrained edge devices based on model pruning
    Tingting Wu
    Chunhe Song
    Peng Zeng
    Complex & Intelligent Systems, 2023, 9 : 6999 - 7013
  • [30] Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review
    Shuvo, Md. Maruf Hossain
    Islam, Syed Kamrul
    Cheng, Jianlin
    Morshed, Bashir I.
    PROCEEDINGS OF THE IEEE, 2023, 111 (01) : 42 - 91