Bitrate Adaptation and Guidance With Meta Reinforcement Learning

被引:1
|
作者
Bentaleb, Abdelhak [1 ]
Lim, May [2 ]
Akcay, Mehmet N. [3 ]
Begen, Ali C. [3 ]
Zimmermann, Roger [2 ]
机构
[1] Concordia Univ, Gina Cody Sch Engn & Comp Sci, Montreal, PQ H3G 1M8, Canada
[2] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[3] Ozyegin Univ, TR-34794 Istanbul, Turkiye
关键词
Bit rate; Servers; Quality of experience; Performance evaluation; Task analysis; Mobile computing; Training; Adaptive streaming; meta-RL; ABR; CMCD; CMSD; bitrate guidance; quality awareness; VIDEO; MODEL;
D O I
10.1109/TMC.2024.3376560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive bitrate (ABR) schemes enable streaming clients to adapt to time-varying network/device conditions for a stall-free viewing experience. Most ABR schemes use manually tuned heuristics or learning-based methods. Heuristics are easy to implement but do not always perform well, whereas learning-based methods generally perform well but are difficult to deploy on low-resource devices. To make the most out of both worlds, we earlier developed Ahaggar, a learning-based scheme executing on the server side that provides quality-aware bitrate guidance to streaming clients running their own heuristics. Ahaggar's novelty is the meta reinforcement learning approach taking network conditions, clients' statuses and device resolutions, and streamed content as input features to perform bitrate guidance. Ahaggar uses the new Common Media Client/Server Data (CMCD/SD) protocols to exchange the necessary metadata between the servers and clients. While Ahaggar was a significant step forward, in this study, we focus on three open areas, namely, (i) exploring the performance of Ahaggar in a heterogeneous environment including both Ahaggar and non-Ahaggar clients with varied network conditions and device resolutions, and (ii) quantifying the impact of device resolutions on QoE with Ahaggar. We thoroughly investigate these areas and report our findings. We also (iii) discuss the Ahaggar design choices. Experiments on an open-source system show that Ahaggar adapts to unseen conditions fast and outperforms its competitors in several viewer experience metrics.
引用
收藏
页码:10378 / 10392
页数:15
相关论文
共 50 条
  • [41] Benchmark of Bitrate Adaptation in Video Streaming
    Chen, Jessica
    Milner, Henry
    Stoica, Ion
    Zhan, Jibin
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2021, 13 (04):
  • [42] Meta Q-network: a combination of reinforcement learning and meta learning
    Lu, Min
    Wang, Yi
    Wang, Wenfeng
    INTERNATIONAL JOURNAL OF APPLIED NONLINEAR SCIENCE, 2022, 3 (03) : 179 - 188
  • [43] KNN-Q Learning Algorithm of Bitrate Adaptation for Video Streaming over HTTP
    Lin, HuaiDi
    Shen, ZhenYuan
    Zhou, HuaKang
    Liu, XingGuang
    Zhang, Leilei
    Xiao, Gang
    Cheng, Zhenbo
    2020 INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC), 2020, : 302 - 306
  • [44] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
    Jarosik, Piotr
    Lewandowski, Marcin
    2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478
  • [45] Adversarial Reinforcement Learning for Unsupervised Domain Adaptation
    Zhang, Youshan
    Ye, Hui
    Davison, Brian D.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 635 - 644
  • [46] Unsupervised Basis Function Adaptation for Reinforcement Learning
    Barker, Edward
    Ras, Charl
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [47] Computational Missile Guidance: A Deep Reinforcement Learning Approach
    He, Shaoming
    Shin, Hyo-Sang
    Tsourdos, Antonios
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (08): : 571 - 582
  • [48] CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
    Liu, Jinxin
    Zu, Lipeng
    He, Li
    Wang, Donglin
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [49] Deep Reinforcement Learning for Spacecraft Proximity Operations Guidance
    Hovell, Kirk
    Ulrich, Steve
    JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 254 - 264
  • [50] Unsupervised basis function adaptation for reinforcement learning
    Barker, Edward
    Ras, Charl
    Journal of Machine Learning Research, 2019, 20