Universal Binary Neural Networks Design by Improved Differentiable Neural Architecture Search

被引:5
作者
Tan, Menghao [1 ]
Gao, Weifeng [1 ]
Li, Hong [1 ]
Xie, Jin [1 ]
Gong, Maoguo [2 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710126, Peoples R China
[2] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
关键词
Computer architecture; Quantization (signal); Neural networks; Training; Microprocessors; Convolution; Circuits and systems; Binary neural networks; neural architecture search; search optimization;
D O I
10.1109/TCSVT.2024.3398691
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binary Neural Networks (BNNs) using 1-bit weights and activations are emerging as a promising approach for mobile devices and edge computing platforms. Concurrently, traditional Neural Architecture Search (NAS) has gained widespread usage in automatically designing network architectures. However, the computation involved in binary NAS is more complex than in NAS due to the substantial information loss incurred by binary modules, and different binary spaces are required for different tasks. To address these challenges, a universal binary neural architecture search (UBNAS) algorithm is proposed. In this paper, the ApproxSign function is used to reduce the gradient error and accelerate the convergence in binary network searching and training. Moreover, UBNAS adopts a novel search space consisting of operations appropriate for the binary methods. To improve the original space operation module, we explore the effect of diverse structures for various modules and ultimately obtain a universal binary network structure. Additionally, the channel sampling ratio is adjusted to balance the advantages of different operations and an early stopping strategy is implemented to significantly reduce the computational burden associated with searching. We perform extensive experiments on CIFAR10, and ImageNet datasets and the results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:9153 / 9165
页数:13
相关论文
共 70 条
[1]   Low-cost low-power in-vehicle occupant detection with mm-wave FMCW radar [J].
Alizadeh, Mostafa ;
Abedi, Hajar ;
Shaker, George .
2019 IEEE SENSORS, 2019,
[2]  
Bethge J, 2020, Arxiv, DOI [arXiv:2001.05936, DOI 10.48550/ARXIV.2001.05936]
[3]  
Bulat Adrian, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12368), P309, DOI 10.1007/978-3-030-58592-1_19
[4]  
Bulat A, 2019, Arxiv, DOI arXiv:1909.13863
[5]  
Cai H, 2019, Arxiv, DOI [arXiv:1812.00332, DOI 10.48550/ARXIV.1812.00332]
[6]   Multiscale Attentive Image De-Raining Networks via Neural Architecture Search [J].
Cai, Lei ;
Fu, Yuli ;
Huo, Wanliang ;
Xiang, Youjun ;
Zhu, Tao ;
Zhang, Ying ;
Zeng, Huanqiang ;
Zeng, Delu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) :618-633
[7]  
Chen H., 2019, arXiv
[8]   Binarized Neural Architecture Search for Efficient Object Recognition [J].
Chen, Hanlin ;
Zhuo, Li'an ;
Zhang, Baochang ;
Zheng, Xiawu ;
Liu, Jianzhuang ;
Ji, Rongrong ;
Doermann, David ;
Guo, Guodong .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) :501-516
[9]   Progressive Differentiable Architecture Search: Bridging the Depth Gap between Search and Evaluation [J].
Chen, Xin ;
Xie, Lingxi ;
Wu, Jun ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1294-1303
[10]  
Courbariaux M, 2016, Arxiv, DOI [arXiv:1602.02830, DOI 10.48550/ARXIV.1602.02830]