TAS: Ternarized Neural Architecture Search for Resource-Constrained Edge Devices

被引：0

作者：

Loni, Mohammad ^{[1
]}

Mousavi, Hamid ^{[1
]}

Riazati, Mohammad ^{[1
]}

Daneshtalab, Masoud ^{[1
]}

Sjodin, Mikael ^{[1
]}

机构：

[1] Malardalen Univ, Sch Innovat Design & Engn, Vasteras, Sweden

来源：

PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022) | 2022年

关键词：

Quantization; Ternary Neural Network; Neural Architecture Search; Embedded Systems;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-bit representation resulting in remarkable network compression and energy efficiency. However, there remains a significant gap in accuracy between TNNs and full-precision counterparts. Recent advances in Neural Architectures Search (NAS) promise opportunities in automated optimization for various deep learning tasks. Unfortunately, this area is unexplored for optimizing TNNs. This paper proposes TAS, a framework that drastically reduces the accuracy gap between TNNs and their full-precision counterparts by integrating quantization into the network design. We experienced that directly applying NAS to the ternary domain provides accuracy degradation as the search settings are customized for full-precision networks. To address this problem, we propose (i) a new cell template for ternary networks with maximum gradient propagation; and (ii) a novel learnable quantizer that adaptively relaxes the ternarization mechanism from the distribution of the weights and activation functions. Experimental results reveal that TAS delivers 2.64% higher accuracy and approximate to 2.8x memory saving over competing methods with the same bit-width resolution on the CIFAR-10 dataset. These results suggest that TAS is an effective method that paves the way for the efficient design of the next generation of quantized neural networks.

引用

页码：1115 / 1118

页数：4

共 50 条

[21] Model reduction of feed forward neural networks for resource-constrained devices
Fragkou, Evangelia
Koultouki, Marianna
Katsaros, Dimitrios
APPLIED INTELLIGENCE, 2023, 53 (11) : 14102 - 14127
[22] Tabu search for resource-constrained scheduling
Verhoeven, MGA
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 106 (2-3) : 266 - 276
[23] Tabu search for resource-constrained scheduling
Eindhoven Univ of Technology, Eindhoven, Netherlands
Eur J Oper Res, 2-3 (266-276):
[24] Efficient federated learning on resource-constrained edge devices based on model pruning
Wu, Tingting
Song, Chunhe
Zeng, Peng
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6999 - 7013
[25] Wireless Channel Adaptive DNN Split Inference for Resource-Constrained Edge Devices
Lee, Jaeduk
Lee, Hojung
Choi, Wan
IEEE COMMUNICATIONS LETTERS, 2023, 27 (06) : 1520 - 1524
[26] Post-Quantum Cryptoprocessors Optimized for Edge and Resource-Constrained Devices in IoT
Ebrahimi, Shahriar
Bayat-Sarmadi, Siavash
Mosanaei-Boorani, Hatameh
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (03) : 5500 - 5507
[27] Efficient Privacy-Preserving Federated Learning for Resource-Constrained Edge Devices
Wu, Jindi
Xia, Qi
Li, Qun
2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 191 - 198
[28] FedComp: A Federated Learning Compression Framework for Resource-Constrained Edge Computing Devices
Wu, Donglei
Yang, Weihao
Jin, Haoyu
Zou, Xiangyu
Xia, Wen
Fang, Binxing
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (01) : 230 - 243
[29] Efficient federated learning on resource-constrained edge devices based on model pruning
Tingting Wu
Chunhe Song
Peng Zeng
Complex & Intelligent Systems, 2023, 9 : 6999 - 7013
[30] Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review
Shuvo, Md. Maruf Hossain
Islam, Syed Kamrul
Cheng, Jianlin
Morshed, Bashir I.
PROCEEDINGS OF THE IEEE, 2023, 111 (01) : 42 - 91

← 1 2 3 4 5 →