Optimized Deep Learning Classification Model for Intelligent Edge devices

被引:0
作者
Naveen, Soumyalatha [1 ]
Kounte, Manjunath R [2 ]
机构
[1] Department of Computer Science and Engineering, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Karnataka, Manipal
[2] School of Electronics and Communication Engineering, REVA University, Karnataka, Bangalore
关键词
Deep Learning; Edge clusters; heterogenous; Pruning; Quantization;
D O I
10.25103/jestr.173.11
中图分类号
学科分类号
摘要
Deep learning models enable state-of-the-art accuracy in computer vision applications. However, the deeper, computationally expensive, and densely connected architecture of deep neural networks (DNN) have limitations for deploying the model on resource-constraint embedded IoT devices. We propose an efficient neural network compression framework that performs filter pruning, fine-tuning and 8-bit quantization to reduce computational complexity, inference time, and memory footprint. Furthermore, reducing the bit widths of activation and weights helps design a compact deployment model on resource-limited IoT devices such as smartphones. The proposed system is evaluated extensively on the CIFAR-10 dataset for Resnet34 and VGG16 models. In addition, we examine the efficacy of a larger model. The result shows that pruning followed by quantization compresses the neural network and compared to the baseline model, achieved an accuracy of 78.01% for Resnet34 and 82.34% for Vgg16 after pruning and quantization which is <1% of marginal loss in accuracy compared to the baseline model. Further, 80x unique parameters from the weight matrix of the model are reduced using k-means clustering along with 8-bit quantization. The study demonstrates that the pruning process had a minimal impact on ResNet34's accuracy, while VGG16 maintained its accuracy even after pruning. Both models showed a reduced memory footprint after applying k-means clustering and 8-bit quantization, making them more efficient for inference tasks without sacrificing performance significantly. Applications like Smart Traffic Management and autonomous vehicles involve deploying edge devices with cameras and sensors at intersections and roadsides to monitor and analyze real-time traffic conditions. The proposed optimized model can be employed for efficient object recognition and classification of vehicles, pedestrians, and traffic signs. © 2024 School of Science, DUTH. All rights reserved.
引用
收藏
页码:88 / 94
页数:6
相关论文
共 32 条
  • [1] Singh R., Gill S. S., Edge AI: A survey, Intern. Things Cyber.- Phys. Sys, 3, pp. 71-92, (2023)
  • [2] Li E., Zeng L., Zhou Z., Chen X., Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing, IEEE Trans. Wirel. Commun, 19, 1, pp. 447-457, (2020)
  • [3] Chen J., Ran X., Deep Learning With Edge Computing: A Review, Proceed. IEEE, 107, 8, pp. 1655-1674, (2019)
  • [4] Lim W. Y. B., Et al., Federated Learning in Mobile Edge Networks: A Comprehensive Survey, IEEE Communic. Surv. Tutor, 22, 3, pp. 2031-2063, (2020)
  • [5] Ran X., Chen H., Zhu X., Liu Z., Chen J., DeepDecision: A Mobile Deep Learning Framework for Edge Video Analytics, IEEE INFOCOM 2018 - IEEE Conf. Comp. Communic, pp. 1421-1429, (2018)
  • [6] Alrawais A., Alhothaily A., Hu C., Cheng X., Fog Computing for the Internet of Things: Security and Privacy Issues, IEEE Internet Comput, 21, 2, pp. 34-42, (2017)
  • [7] Wang R., Lai J., Zhang Z., Li X., Vijayakumar P., Karuppiah M., Privacy-Preserving Federated Learning for Internet of Medical Things Under Edge Computing, IEEE J. Biomed. Health Inform, 27, 2, pp. 854-865, (2023)
  • [8] Alzoubi Y. I., Osmanaj V. H., Jaradat A., Al-Ahmad A., Fog computing security and privacy for the Internet of Thing applications: State-of-the-art, Secur. Priv, 4, 2, (2021)
  • [9] Li H., Ota K., Dong M., Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing, IEEE Netw, 32, 1, pp. 96-101, (2018)
  • [10] Murshed M. G. S., Murphy C., Hou D., Khan N., Ananthanarayanan G., Hussain F., Machine Learning at the Network Edge: A Survey, ACM Comput. Surv, 54, 8, pp. 1-37, (2022)