Task-Oriented Video Compressive Streaming for Real-Time Semantic Segmentation

被引:1
|
作者
Xiao, Xuedou [1 ]
Zuo, Yingying [2 ]
Yan, Mingxuan [2 ]
Wang, Wei [2 ]
He, Jianhua [3 ]
Zhang, Qian [4 ]
机构
[1] Wuhan Univ Technol, Sch Nav, Wuhan 430062, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Essex Univ, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, England
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Clear Water Bay, Hong Kong, Peoples R China
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金; 欧盟地平线“2020”;
关键词
Image coding; Bandwidth; Streaming media; Semantic segmentation; Accuracy; Servers; Predictive coding; Adaptive streaming; DNN-driven compression; edge computing; semantic segmentation;
D O I
10.1109/TMC.2024.3446185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time semantic segmentation (SS) is a major task for various vision-based applications such as self-driving. Due to the limited computing resources and stringent performance requirements, streaming videos from camera-embedded mobile devices to edge servers for SS is a promising approach. While there are increasing efforts on task-oriented video compression, most SS-applicable algorithms apply more uniform compression, as the sensitive regions are less obvious and concentrated. Such processing results in low compression performance and significantly limits the capacity of edge servers supporting real-time SS. In this paper, we propose STAC, a novel task-oriented DNN-driven video compressive streaming algorithm tailed for SS, to strike accuracy-bitrate balance and adapt to time-varying bandwidth. It exploits DNN's gradients as sensitivity metrics for fine-grained spatial adaptive compression and includes a temporal adaptive scheme that integrates spatial adaptation with predictive coding. Furthermore, we design a new bandwidth-aware neural network, serving as a compatible configuration tuner to fit time-varying bandwidth and content. STAC is evaluated in a system with a commodity mobile device and an edge server with real-world network traces. Experiments show that STAC can save up to 63.7-75.2% of bandwidth or improve accuracy by 3.1-9.5% compared to state-of-the-art algorithms, while capable of adapting to time-varying bandwidth.
引用
收藏
页码:14396 / 14413
页数:18
相关论文
共 50 条
  • [31] A lightweight network with attention decoder for real-time semantic segmentation
    Wang, Kang
    Yang, Jinfu
    Yuan, Shuai
    Li, Mingai
    VISUAL COMPUTER, 2022, 38 (07): : 2329 - 2339
  • [32] A Lightweight and Dynamic Convolutional Network for Real-time Semantic Segmentation
    Zhang, Chunyu
    Xu, Fang
    Wu, Chengdong
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4062 - 4067
  • [33] Real-time semantic segmentation with local spatial pixel adjustment
    Xiao, Cunjun
    Hao, Xingjun
    Li, Haibin
    Li, Yaqian
    Zhang, Wenming
    IMAGE AND VISION COMPUTING, 2022, 123
  • [34] Real-time semantic segmentation via sequential knowledge distillation
    Wu, Jipeng
    Ji, Rongrong
    Liu, Jianzhuang
    Xu, Mingliang
    Zheng, Jiawen
    Shao, Ling
    Tian, Qi
    NEUROCOMPUTING, 2021, 439 : 134 - 145
  • [35] Efficient use of recent progresses for Real-time Semantic segmentation
    Safae El Houfi
    Aicha Majda
    Machine Vision and Applications, 2020, 31
  • [36] A Real-Time Road Scene Semantic Segmentation Model Based on Spatial Context Learning
    Xiao, Xiaomei
    Tang, Jialiang
    Lu, Xiaoyan
    Feng, Zhengyong
    Li, Yi
    IEEE ACCESS, 2024, 12 : 178495 - 178506
  • [37] Real-Time Semantic Segmentation via Spatial-Detail Guided Context Propagation
    Hao, Shijie
    Zhou, Yuan
    Guo, Yanrong
    Hong, Richang
    Cheng, Jun
    Wang, Meng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022,
  • [38] Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation
    Peng, Chengli
    Tian, Tian
    Chen, Chen
    Guo, Xiaojie
    Ma, Jiayi
    NEURAL NETWORKS, 2021, 137 : 188 - 199
  • [39] Semantic Segmentation of Panoramic Images for Real-Time Parking Slot Detection
    Lai, Cong
    Yang, Qingyu
    Guo, Yixin
    Bai, Fujun
    Sun, Hongbin
    REMOTE SENSING, 2022, 14 (16)
  • [40] ADSCNet: asymmetric depthwise separable convolution for semantic segmentation in real-time
    Wang, Jiawei
    Xiong, Hongyun
    Wang, Haibo
    Nian, Xiaohong
    APPLIED INTELLIGENCE, 2020, 50 (04) : 1045 - 1056