AdaEvo: Edge-Assisted Continuous and Timely DNN Model Evolution for Mobile Devices

被引:0
|
作者
Wang, Lehao [1 ]
Yu, Zhiwen [1 ]
Yu, Haoyi [1 ]
Liu, Sicong [1 ]
Xie, Yaxiong [2 ]
Guo, Bin [1 ]
Liu, Yunxin [3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Xian 710071, Shaanxi, Peoples R China
[2] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] Tsinghua Univ, Inst AI Ind Res AIR, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Servers; Artificial neural networks; Quality of experience; Computational modeling; Mobile computing; Processor scheduling; Edge-assisted computing; mobile applications; DNN evolution; task scheduling;
D O I
10.1109/TMC.2023.3316388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile video applications today have attracted significant attention. Deep learning model (e.g., deep neural network, DNN) compression is widely used to enable on-device inference for facilitating robust and private mobile video applications. The compressed DNN, however, is vulnerable to the agnostic data drift of the live video captured from the dynamically changing mobile scenarios. To combat the data drift, mobile ends rely on edge servers to continuously evolve and re-compress the DNN with freshly collected data. We design a framework, AdaEvo, that efficiently supports the resource-limited edge server handling mobile DNN evolution tasks from multiple mobile ends. The key goal of AdaEvo is to maximize the average quality of experience (QoE), i.e., the proportion of high-quality DNN service time to the entire life cycle, for all mobile ends. Specifically, it estimates the DNN accuracy drops at the mobile end without labels and performs a dedicated video frame sampling strategy to control the size of retraining data. In addition, it balances the limited computing and memory resources on the edge server and the competition between asynchronous tasks initiated by different mobile users. With an extensive evaluation of real-world videos from mobile scenarios and across four diverse mobile tasks, experimental results show that AdaEvo enables up to 34% accuracy improvement and 32% average QoE improvement.
引用
收藏
页码:2485 / 2503
页数:19
相关论文
共 21 条
  • [1] Semantic Extraction Model Selection for IoT Devices in Edge-Assisted Semantic Communications
    Chen, Hong
    Fang, Fang
    Wang, Xianbin
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (07) : 1733 - 1737
  • [2] Edge-Assisted Distributed DNN Collaborative Computing Approach for Mobile Web Augmented Reality in 5G Networks
    Ren, Pei
    Qiao, Xiuquan
    Huang, Yakun
    Liu, Ling
    Dustdar, Schahram
    Chen, Junliang
    IEEE NETWORK, 2020, 34 (02): : 254 - 261
  • [3] Cutting-Edge Inference: Dynamic DNN Model Partitioning and Resource Scaling for Mobile AI
    Lim, Jeong-A
    Lee, Joohyun
    Kwak, Jeongho
    Kim, Yeongjin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 3300 - 3316
  • [4] A Lightweight Certificateless Edge-Assisted Encryption for IoT Devices: Enhancing Security and Performance
    Cui, Xinhua
    Tian, Youliang
    Zhang, Xinyu
    Lin, Hongwei
    Li, Mengqian
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 2930 - 2942
  • [5] Secure Data Deduplication Protocol for Edge-Assisted Mobile CrowdSensing Services
    Li, Jiliang
    Su, Zhou
    Guo, Deke
    Choo, Kim-Kwang Raymond
    Ji, Yusheng
    Pu, Huayan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (01) : 742 - 753
  • [6] A Co-Scheduling Framework for DNN Models on Mobile and Edge Devices With Heterogeneous Hardware
    Xu, Zhiyuan
    Yang, Dejun
    Yin, Chengxiang
    Tang, Jian
    Wang, Yanzhi
    Xue, Guoliang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (03) : 1275 - 1288
  • [7] EdgeSaver: Edge-Assisted Energy-Aware Mobile Video Streaming for User Retention Enhancement
    Liao, Hanlong
    Tang, Guoming
    Guo, Deke
    Wu, Kui
    Wu, Yangjing
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (09): : 6550 - 6562
  • [8] Edge-Assisted Real-Time Instance Segmentation for Resource-Limited IoT Devices
    Xie, Yuanyan
    Guo, Yu
    Mi, Zhenqiang
    Yang, Yang
    Obaidat, Mohammad S.
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 473 - 485
  • [9] Distributed DNN Inference With Fine-Grained Model Partitioning in Mobile Edge Computing Networks
    Li, Hui
    Li, Xiuhua
    Fan, Qilin
    He, Qiang
    Wang, Xiaofei
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9060 - 9074
  • [10] Cooperative Computational Offloading in Mobile Edge Computing for Vehicles: A Model-Based DNN Approach
    Munawar, Suleman
    Ali, Zaiwar
    Waqas, Muhammad
    Tu, Shanshan
    Hassan, Syed Ali
    Abbas, Ghulam
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (03) : 3376 - 3391