Increased Leverage of Transprecision Computing for Machine Vision Applications at the Edge

被引：0

作者：

Umar Ibrahim Minhas

JunKyu Lee

Lev Mukhanov

Georgios Karakonstantis

Hans Vandierendonck

Roger Woods

机构：

[1] Queen’s University Belfast,

来源：

Journal of Signal Processing Systems | 2022年 / 94卷

关键词：

Edge Computing; Approximate Computing; Transprecision Computing; Machine Vision;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The practical deployment of machine vision presents particular challenges for resource constrained edge devices. With a clear need to execute multiple tasks with variable workloads, there is a need for a robust approach that can dynamically adapt at runtime and which can maintain the maximum quality of service (QoS) within the available resource constraints. A lightweight approach that monitors the runtime workload constraints and leverages accuracy-throughput trade-offs on a graphics processing unit (GPU), is presented. It includes optimisation techniques that identify the configurations for each task in terms of optimal accuracy, energy and memory and management of the transparent switching between configurations. Using a neural network architecture search that statically generates a range of implementations that target a resource-precision trade-off, we explore the detection of the optimal parameters for the required QoS under specific memory and energy constraints. For an accuracy loss of 1%, we demonstrate that a 1.6×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.6\times$$\end{document} higher frame processing rate can be achieved on GPU with further improvements possible at further relaxed accuracy. In order to further improve the switching between configurations, we enhance the proposed mechanism by employing central processing units (CPUs) for offloading some of the executed frames, which helps to improve the frame rate by further 0.9%.

引用

页码：1101 / 1118

页数：17

共 23 条

[1]

Kang W(2019)Dms: Dynamic model scaling for quality-aware deep learning inference in mobile and embedded devices IEEE Access 7 168048-168059

[2]

Kim D(2012)Logic and memory design based on unequal error protection for voltage-scalable, robust and adaptive DSP systems Journal of Signal Processing System 68 415-431

[3]

Park J(2018)Energy-efficient iterative refinement using dynamic precision IEEE Journal on Emerging and Selected Topics in Circuits and Systems 8 722-735

[4]

Karakonstantis G(2019)Green fog planning for optimal internet-of-thing task scheduling IEEE Access 8 1224-1234

[5]

Mohapatra D(2018)A scalable and adaptable ilp-based approach for task mapping on mpsoc considering load balance and communication optimization IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 38 1744-1757

[6]

Roy K(undefined)undefined undefined undefined undefined-undefined

[7]

Lee J(undefined)undefined undefined undefined undefined-undefined

[8]

Vandierendonck H(undefined)undefined undefined undefined undefined-undefined

[9]

Arif M(undefined)undefined undefined undefined undefined-undefined

[10]

Peterson GD(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 →