Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 40 条
  • [31] Task-Oriented Communication for Multidevice Cooperative Edge Inference
    Shao, Jiawei
    Mao, Yuyi
    Zhang, Jun
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (01) : 73 - 87
  • [32] Siddique Umer, 2020, P MACHINE LEARNING R, P8905
  • [33] Singh A., 2021, P INT C LEARN REPR
  • [34] BOLA: Near-Optimal Bitrate Adaptation for Online Videos
    Spiteri, Kevin
    Urgaonkar, Rahul
    Sitaraman, Ramesh K.
    [J]. IEEE INFOCOM 2016 - THE 35TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, 2016,
  • [35] CS2P: Improving Video Bitrate Selection and Adaptation with Data-Driven Throughput Prediction
    Sun, Yi
    Yin, Xiaoqi
    Jiang, Junchen
    Sekar, Vyas
    Lin, Fuyuan
    Wang, Nanshu
    Liu, Tao
    Sinopoli, Bruno
    [J]. PROCEEDINGS OF THE 2016 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '16), 2016, : 272 - 285
  • [36] Cratus: A Lightweight and Robust Approach for Mobile Live Streaming
    Wang, Bo
    Xu, Mingwei
    Ren, Fengyuan
    Zhou, Chao
    Wu, Jianping
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (08) : 2761 - 2775
  • [37] ILCAS: Imitation Learning-Based Configuration- Adaptive Streaming for Live Video Analytics With Cross-Camera Collaboration
    Wu, Duo
    Zhang, Dayou
    Zhang, Miao
    Zhang, Ruoyu
    Wang, Fangxin
    Cui, Shuguang
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 6743 - 6757
  • [38] Yan FY, 2020, PROCEEDINGS OF THE 17TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P495
  • [39] Yang F, 2020, AAAI CONF ARTIF INTE, V34, P6599
  • [40] A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP
    Yin, Xiaoqi
    Jindal, Abhishek
    Sekar, Vyas
    Sinopoli, Bruno
    [J]. Computer Communication Review, 2015, 45 (04): : 325 - 338