Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引：0

作者：

Wang, Shuoyao ^{[1
]}

Lin, Jiawei ^{[1
]}

Ye, Fangwei ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China

[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;

D O I：

10.1109/TMC.2024.3437455

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.

引用

页码：13670 / 13683

页数：14

共 40 条

[1] Oboe: Auto-tuning Video ABR Algorithms to Network Conditions
Akhtar, Zahaib
Nam, Yun Seong
Govindan, Ramesh
Rao, Sanjay
Chen, Jessica
Katz-Bassett, Ethan
Ribeiro, Bruno
Zhan, Jibin
Zhang, Hui
[J]. PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, : 44 - 58
[2] Alt B, 2019, IEEE INFOCOM SER, P1000, DOI [10.1109/infocom.2019.8737418, 10.1109/INFOCOM.2019.8737418]
[3] Recent Advancements in End-to-End Autonomous Driving Using Deep Learning: A Survey
Chib, Pranav Singh
Singh, Pravendra
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 103 - 118
[4] F. C. Commission "Federal communications commission, 2016, Raw data-Measuring broadband america
[5] Garg D., 2021, P INT C NEUR INF PRO, P4028
[6] Rate-Splitting for Intelligent Reflecting Surface-Aided Multiuser VR Streaming
Huang, Rui
Wong, Vincent W. S.
Schober, Robert
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (05) : 1516 - 1535
[7] Huang T., 2023, P IEEE C COMP COMM, P1
[8] Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach
Huang, Tianchi
Zhou, Chao
Zhang, Rui-Xiao
Wu, Chenglei
Sun, Lifeng
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) : 2485 - 2503
[9] Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services
Huang, Tianchi
Zhang, Rui-Xiao
Sun, Lifeng
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1350 - 1365
[10] Quality-Aware Neural Adaptive Video Streaming With Lifelong Imitation Learning
Huang, Tianchi
Zhou, Chao
Yao, Xin
Zhang, Rui-Xiao
Wu, Chenglei
Yu, Bing
Sun, Lifeng
[J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (10) : 2324 - 2342

← 1 2 3 4 →