Profiling Power Consumption for Deep Learning on Resource Limited Devices

被引:1
作者
Duggan, Aidan [1 ]
Scully, Ted [1 ]
Smith, Niall [1 ]
Giltinan, Alan [1 ]
机构
[1] Munster Technol Univ, Cork, Ireland
来源
ARTIFICIAL INTELLIGENCE XL, AI 2023 | 2023年 / 14381卷
关键词
Deep learning; Energy optimisation; Satellite data analysis; Edge computing;
D O I
10.1007/978-3-031-47994-6_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The introduction of convolutional neural networks (CNN) has had a significant impact on various computer vision tasks. The process of inference, where a CNN takes images as input and produces corresponding predictions, is a complex and resource hungry task that consumes significant power. Originally much of the processing was done on well resourced machines locally or hosted on various cloud platforms but there has been a recent trend towards moving the processing of data closer to where it is produced within resource limited 'edge' devices. It is important to understand the implications and limitations for deployment of a CNN to a device such as an earth observation satellite, which does not have a constant power source. Quantisation is a model optimising technique, where the precision of weights, biases, and activations are reduced, such that they consume less memory and power, and is a common approach to facilitate such deployments. This paper investigates the power consumption behaviour of CNN models from the DenseNet, EfficientNet, MobileNet, ResNet, ConvNeXt & RegNet architecture families, processing imagery on board a Nvidia Jetson Orin Nano platform. It was found that energy consumption varied from 6 mJ to 26 mJ per image for different base (non-quantised) models. Accuracy varied from 69% to 82% and latency varied from 1.3 ms to 7.5 ms per image. The effectiveness of quantisation in reducing the power requirements of CNNs during inference was also investigated, focusing on the use case of deployment to an earth observation satellite. A large difference was found between architectures with some reducing the energy consumption by up to 87% while others achieve less than 10%. The metrics "accuracy-per-joule" and "latency-by-joule" were introduced and used to benchmark and more effectively compare models energy-effectiveness and the impact of quantisation. After quantisation, an improvement in accuracy-per-joule of up to 700% and a latency-by-joule reduction of 99% was achieved.
引用
收藏
页码:129 / 141
页数:13
相关论文
共 24 条
[1]  
Arechiga AP, 2018, PROC NAECON IEEE NAT, P234, DOI 10.1109/NAECON.2018.8556744
[2]  
Buonaiuto N., 2017, Satellite identification imaging for small satellites using NVIDIA
[3]  
Courbariaux M, 2015, Arxiv, DOI arXiv:1412.7024
[4]  
Duggan A., 2023, Appendix Table 3
[5]  
Duggan A., 2023, Appendix Table 2
[6]   Towards the Use of Artificial Intelligence on the Edge in Space Systems: Challenges and Opportunities [J].
Furano, Gianluca ;
Meoni, Gabriele ;
Dunne, Aubrey ;
Moloney, David ;
Ferlet-Cavrois, Veronique ;
Tavoularis, Antonis ;
Byrne, Jonathan ;
Buckley, Leonie ;
Psarakis, Mihalis ;
Voss, Kay-Obbe ;
Fanucci, Luca .
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2020, 35 (12) :44-56
[7]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861, DOI 10.48550/ARXIV.1704.04861]
[8]   Onboard Processing With Hybrid and Reconfigurable Computing on Small Satellites [J].
George, Alan D. ;
Wilson, Christopher M. .
PROCEEDINGS OF THE IEEE, 2018, 106 (03) :458-470
[9]   CloudScout: A Deep Neural Network for On-Board Cloud Detection on Hyperspectral Images [J].
Giuffrida, Gianluca ;
Diana, Lorenzo ;
de Gioia, Francesco ;
Benelli, Gionata ;
Meoni, Gabriele ;
Donati, Massimiliano ;
Fanucci, Luca .
REMOTE SENSING, 2020, 12 (14)
[10]  
Han S, 2016, Arxiv, DOI arXiv:1510.00149