Content-Aware Adaptive Device-Cloud Collaborative Inference for Object Detection

被引:8
作者
Hu, Youbing [1 ]
Li, Zhijun [1 ]
Chen, Yongrui [2 ]
Cheng, Yun [3 ]
Cao, Zhiqiang [1 ]
Liu, Jie [1 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150000, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Swiss Fed Inst Technol, Comp Engn & Networks Lab, CH-8000 Zurich, Switzerland
关键词
Internet of Things; Cloud computing; Collaboration; Computational modeling; Object detection; Task analysis; Adaptation models; Adaptive inference; deep neural network (DNN); device-cloud collaborative; object detection; HUMAN ACTION RECOGNITION; NETWORKING; INTELLIGENCE;
D O I
10.1109/JIOT.2023.3279579
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many intelligent applications based on deep neural networks (DNNs) are increasingly running on Internet of Things (IoT) devices. Unfortunately, the computing resources of these IoT devices are limited, which will seriously hinder the widespread deployment of various smart applications. A popular solution is to offload part of computation tasks from the IoT device to cloud by way of device-cloud collaboration. However, existing collaboration approaches may suffer from long network transmission delay or degraded accuracy due to the large amount of intermediate results, bring enormous challenges to the tasks, such as object detection, that require massive computing resources. In this article, we propose an efficient device-cloud collaborative inference (DCCI) object detection framework, which dynamically adjusts the amount of transferred data according to the content of input images. Specifically, a content-aware hard-case discriminator is proposed to automatically classify the input images as hard-cases or simple-cases, the hard-cases are uploaded to the cloud to be processed by a deployed heavyweight model, and the simple cases are processed by a lightweight model deployed to the IoT device, where the lightweight model is automatically compressed based on reinforcement learning according to the resource constraints of the IoT device. Furthermore, a collaborative scheduler based on the runtime load and network transmission capability of IoT devices is proposed to optimize the collaborative computation between IoT devices and the cloud. Extensive experimental evaluations show that compared to the Device-only approach, DCCI can reduce the memory footprint and compute resources of IoT devices by more than 90.0% and 30.87%, respectively. Compared to Cloud-centric, DCCI can save 2.0x of network bandwidth. In addition, compared with the state-of-the-art DNN partitioning method, DCCI can save 1.2x of inference latency, and 1.3x of IoT device energy consumption with the same accuracy constraint.
引用
收藏
页码:19087 / 19101
页数:15
相关论文
共 59 条
[1]  
Ahmad T, 2021, INT J ADV COMPUT SC, V12, P18
[2]  
Ashok A., 2018, P INT C LEARN REPR
[3]   COLLABORATIVE OBJECT DETECTORS ADAPTIVE TO BANDWIDTH AND COMPUTATION [J].
Assine, Juliano S. ;
Santos Filho, J. C. S. ;
Valle, Eduardo .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :2839-2843
[4]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[5]  
Bonghi R., 2021, Jetson stats
[6]   Reading Wikipedia to Answer Open-Domain Questions [J].
Chen, Danqi ;
Fisch, Adam ;
Weston, Jason ;
Bordes, Antoine .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879
[7]   Joint FrontEdgeCloud IoVT Analytics: Resource-Effective Design and Scheduling [J].
Chen, Youjia ;
Zhao, Tiesong ;
Cheng, Peng ;
Ding, Ming ;
Chen, Chang Wen .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) :23941-23953
[8]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.3115/V1/D14-1179]
[9]   Serving DNNs in Real Time at Datacenter Scale with Project Brainwave [J].
Chung, Eric ;
Fowers, Jeremy ;
Ovtcharov, Kalin ;
Papamichael, Michael ;
Caulfield, Adrian ;
Massengill, Todd ;
Liu, Ming ;
Lo, Daniel ;
Alkalay, Shlomi ;
Haselman, Michael ;
Abeydeera, Maleen ;
Adams, Logan ;
Angepat, Hari ;
Boehn, Christian ;
Chiou, Derek ;
Firestein, Oren ;
Forin, Alessandro ;
Gatlin, Kang Su ;
Ghandi, Mahdi ;
Heil, Stephen ;
Holohan, Kyle ;
El Husseini, Ahmad ;
Juhasz, Tamas ;
Kagi, Kara ;
Kovvuri, Ratna K. ;
Lanka, Sitaram ;
van Megen, Friedel ;
Mukhortov, Dima ;
Patel, Prerak ;
Perez, Brandon ;
Rapsang, Amanda Grace ;
Reinhardt, Steven K. ;
Rouhani, Bita Darvish ;
Sapek, Adam ;
Seera, Raja ;
Shekar, Sangeetha ;
Sridharan, Balaji ;
Weisz, Gabriel ;
Woods, Lisa ;
Xiao, Phillip Yi ;
Zhang, Dan ;
Zhao, Ritchie ;
Burger, Doug .
IEEE MICRO, 2018, 38 (02) :8-20
[10]  
Elsken T, 2019, J MACH LEARN RES, V20