Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 40 条
  • [1] Oboe: Auto-tuning Video ABR Algorithms to Network Conditions
    Akhtar, Zahaib
    Nam, Yun Seong
    Govindan, Ramesh
    Rao, Sanjay
    Chen, Jessica
    Katz-Bassett, Ethan
    Ribeiro, Bruno
    Zhan, Jibin
    Zhang, Hui
    [J]. PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, : 44 - 58
  • [2] Alt B, 2019, IEEE INFOCOM SER, P1000, DOI [10.1109/infocom.2019.8737418, 10.1109/INFOCOM.2019.8737418]
  • [3] Recent Advancements in End-to-End Autonomous Driving Using Deep Learning: A Survey
    Chib, Pranav Singh
    Singh, Pravendra
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 103 - 118
  • [4] F. C. Commission "Federal communications commission, 2016, Raw data-Measuring broadband america
  • [5] Garg D., 2021, P INT C NEUR INF PRO, P4028
  • [6] Rate-Splitting for Intelligent Reflecting Surface-Aided Multiuser VR Streaming
    Huang, Rui
    Wong, Vincent W. S.
    Schober, Robert
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (05) : 1516 - 1535
  • [7] Huang T., 2023, P IEEE C COMP COMM, P1
  • [8] Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach
    Huang, Tianchi
    Zhou, Chao
    Zhang, Rui-Xiao
    Wu, Chenglei
    Sun, Lifeng
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2485 - 2503
  • [9] Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services
    Huang, Tianchi
    Zhang, Rui-Xiao
    Sun, Lifeng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1350 - 1365
  • [10] Quality-Aware Neural Adaptive Video Streaming With Lifelong Imitation Learning
    Huang, Tianchi
    Zhou, Chao
    Yao, Xin
    Zhang, Rui-Xiao
    Wu, Chenglei
    Yu, Bing
    Sun, Lifeng
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (10) : 2324 - 2342