Split Computing With Scalable Feature Compression for Visual Analytics on the Edge

被引:0
作者
Yuan, Zhongzheng [1 ]
Rawlekar, Samyak [2 ]
Garg, Siddharth [1 ]
Erkip, Elza [1 ]
Wang, Yao [1 ]
机构
[1] NYU, Elect & Comp Engn Dept, Tandon Sch Engn, Brooklyn, NY 11201 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
Image coding; Computational modeling; Task analysis; Servers; Performance evaluation; Bit rate; Analytical models; Computer vision; feature compression; object detection; split computing; COLLABORATIVE INTELLIGENCE;
D O I
10.1109/TMM.2024.3406165
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Running deep visual analytics models for real-time applications is challenging for mobile devices. Offloading the computation to edge server can mitigate computation bottleneck at the mobile device, but may decrease the analytics performance due to the necessity of compressing the image data. We consider a "split computing" system to offload a part of the deep learning model's computation and introduce a novel learned feature compression approach with lightweight computation. We demonstrate the effectiveness of the split computing pipeline in performing computation offloading for the problems of object detection and image classification. Compared to compressing the raw images at the mobile, and running the analytics model on the decompressed images at the server, the proposed feature-compression approach can achieve significantly higher analytics performance at the same bit rate, while reducing the complexity at the mobile. We further propose a scalable feature compression approach, which facilitates adaptation to network bandwidth dynamics, while having comparable performance to the non-scalable approach.
引用
收藏
页码:10121 / 10133
页数:13
相关论文
共 50 条
[31]   Efficient Machine Learning on Edge Computing Through Data Compression Techniques [J].
Larrakoetxea, Nerea Gomez ;
Astobiza, Joseba Eskubi ;
Lopez, Iker Pastor ;
Urquijo, Borja Sanz ;
Barruetabena, Jon Garcia ;
Rego, Agustin Zubillaga .
IEEE ACCESS, 2023, 11 :31676-31685
[32]   Dynamic Distributed Model Compression for Efficient Decentralized Federated Learning and Incentive Provisioning in Edge Computing Networks [J].
Asheralieva, Alia ;
Niyato, Dusit ;
Wei, Xuetao .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (07) :6293-6314
[33]   END-TO-END LEARNED SCALABLE MULTILAYER FEATURE COMPRESSION FOR MACHINE VISION TASKS [J].
Chen, Qiaoxi ;
Gao, Changsheng ;
Liu, Dong .
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, :1781-1787
[34]   Learning-Based Scalable Image Compression With Latent-Feature Reuse and Prediction [J].
Mei, Yixin ;
Li, Li ;
Li, Zhu ;
Li, Fan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :4143-4157
[35]   3D Point Cloud Object Detection on Edge Devices for Split Computing [J].
Noguchi, Taisuke ;
Azumi, Takuya .
2024 IEEE 3RD REAL-TIME AND INTELLIGENT EDGE COMPUTING WORKSHOP, RAGE 2024, 2024, :6-11
[36]   Analysis of Visual Sensory System based Edge Computing Approach in IOT [J].
Simsek, Emrah ;
Ozyer, Baris .
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[37]   Visual information perception system of coal mine comprehensive excavation working face for edge computing terminal [J].
Zhao, Dongyang ;
Su, Guoyong ;
Wang, Pengyu .
IET IMAGE PROCESSING, 2024, 18 (12) :3681-3698
[38]   Lightweight Online Profiling-Based Configuration Adaptation for Video Analytics System in Edge Computing [J].
Kim, Woo-Joong ;
Youn, Chan-Hyun .
IEEE ACCESS, 2020, 8 :116881-116899
[39]   Edge Computing Resource Management for Cross-Camera Video Analytics: Workload and Model Adaptation [J].
Chen, Huan-Ting ;
Chiang, Yao ;
Wei, Hung-Yu .
IEEE ACCESS, 2024, 12 :12098-12109
[40]   FedComp: A Federated Learning Compression Framework for Resource-Constrained Edge Computing Devices [J].
Wu, Donglei ;
Yang, Weihao ;
Jin, Haoyu ;
Zou, Xiangyu ;
Xia, Wen ;
Fang, Binxing .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (01) :230-243