Convergence of Deep Learning and Edge Computing using Model Optimization

被引:0
作者
Babaei, Peyman [1 ]
机构
[1] Islamic Azad Univ, Comp Dept, West Tehran Branch, Tehran, Iran
来源
PROCEEDINGS OF THE 13TH IRANIAN/3RD INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, MVIP | 2024年
关键词
Model optimization; Quantization; Weight pruning; Weight clustering; Edge computing; Deep learning;
D O I
10.1109/MVIP62238.2024.10491145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Edge systems are undergoing a groundbreaking computing evolution to support artificial intelligence, deep learning, and complex computational algorithms. Using cloud servers to perform deep learning model inference poses challenges such as response delays, increased communication costs, and data privacy concerns. Therefore, significant efforts have been made to push the processing of deep learning models to edge systems, which has led to the creation of edge intelligence as the intersection of learning and edge computing. Learning models, especially deep convolutional neural networks, have made significant achievements in machine vision, which provide high accuracy and predictability by spending computing power and memory. If these models are optimized and deployed on edge systems, there will be a revolution in the applications of edge systems in real time. In this paper, by using optimization techniques such as quantization, weight pruning, and weight clustering, the possibility of deploying a typical convolutional neural network model on edge systems that have limited computing resources and memory is investigated. The results show that by using a collaborative algorithm, despite the slight decrease in the accuracy of the model, it is possible to achieve a small-sized model that can even be deployed on microcontrollers.
引用
收藏
页码:130 / 135
页数:6
相关论文
共 20 条
[1]  
[Anonymous], 2019, Mach. Learn
[2]  
Arias-Duart A., 2023, 2023 IEEE CVF C COMP, P3709, DOI [10.1109/cvprw59228.2023.00380, DOI 10.1109/CVPRW59228.2023.00380]
[3]   Machine Learning in Embedded Systems: Limitations, Solutions and Future Challenges [J].
Batzolis, Eleftherios ;
Vrochidou, Eleni ;
Papakostas, George A. .
2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, :345-350
[4]  
cs.toronto, About Us
[5]  
Gupta S., 2015, P 32 INT C INT C MAC, P6
[6]  
Han S, 2015, ADV NEUR IN, V28
[7]   Learning to Prune Filters in Convolutional Neural Networks [J].
Huang, Qiangui ;
Zhou, Kevin ;
You, Suya ;
Neumann, Ulrich .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :709-718
[8]  
kaggle, About us
[9]  
Kaur Gagandeep, 2021, Proceedings of 2021 2nd International Conference on Intelligent Engineering and Management (ICIEM), P254, DOI 10.1109/ICIEM51511.2021.9445331
[10]   Edge Computing for Internet of Everything: A Survey [J].
Kong, Xiangjie ;
Wu, Yuhan ;
Wang, Hui ;
Xia, Feng .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) :23472-23485