An Edge-Side Real-Time Video Analytics System With Dual Computing Resource Control

被引：1

作者：

Hu, Chuang ^{[1
]}

Lu, Rui ^{[2
]}

Sang, Qianlong ^{[1
]}

Liang, Huanghuang ^{[1
]}

Wang, Dan ^{[2
]}

Cheng, Dazhao ^{[1
]}

Zhang, Jin ^{[3
]}

Li, Qing ^{[4
]}

Peng, Junkun ^{[5
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430000, Peoples R China

[2] Hong Kong Polytech Univ, Hong Kong, Peoples R China

[3] Southern Univ Sci & Technol, Shenzhen 518000, Peoples R China

[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[5] Tsinghua Univ, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2023年 / 72卷 / 12期

基金：

国家重点研发计划;

关键词：

Field programmable gate arrays; Visual analytics; Real-time systems; Analytical models; Streaming media; Graphics processing units; Computational modeling; Bandit learning; dual-image FPGA; video analytics; accelerators; middleware; DESIGN;

D O I：

10.1109/TC.2023.3301136

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video analytics systems conduct video preprocessing to filter out unnecessary frames and model inference using appropriately selected neural networks for high analytics speed. Video preprocessing is instruction-intensive computing (IIC) executed by CPU, and model inference is data-intensive computing (DIC) executed by GPU. In this paper, we show the analytics accuracy of existing systems can largely vary in fields, caused by the dynamic IIC and DIC workloads of different contents in applications. Unfortunately, cameras have fixed CPU/GPU resources and cannot effectively adapt to workload dynamics. We develop Gemini, a new edge-side real-time video analytics system enhanced by a dual-image FPGA. We take the advantage of negligible image switching time of dual-image FPGAs, pre-configure one CPU image and one GPU image and elastically multiplex the dual CPU-GPU resources in time dimension. Gemini requires both hardware and software revisions. In hardware, we overcome challenges of hardware-dependent application development, low communication efficiency between the microprocessor and FPGA, and high programming complexity by hardware abstraction, asynchronous data transfer mechanism and stub-skeleton middleware. In software, we overcome the challenge of adapting to the dynamic workloads by a bandit learning approach. We implement Gemini and show that Gemini can improve the analytics accuracy to 90.35%.

引用

页码：3399 / 3415

页数：17

共 42 条

[1] DECORE: Deep Compression with Reinforcement Learning [J].

Alwani, Manoj ;

Wang, Yang ;

Madhavan, Vashisht .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12339-12349

[2]

Amazon Web Services, AWS DeepLens-Deep Learning Enabled Video Camera for Developers AWS

[3]

[Anonymous], 2021, Machine learning image and video analysis-Amazon rekognition

[4]

[Anonymous], 2023, Banana Pi M2+

[5]

[Anonymous], 2020, Hikvision Intrusion Detectors

[6]

[Anonymous], 2020, AcuSense Technology

[7]

[Anonymous], 2023, Raspberry Pi 4 Model B

[8]

[Anonymous], 2023, Orange Pi 3 LTS

[9] Toward Multi-FPGA Acceleration of the Neural Networks [J].

Biookaghazadeh, Saman ;

Ravi, Pravin Kumar ;

Zhao, Ming .

ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 17 (02)

[10]

C. of Auburn AL, 2019, City of Auburn Toomer's Corner webcam2

← 1 2 3 4 5 →