Bitrate Adaptation and Guidance With Meta Reinforcement Learning

被引:1
|
作者
Bentaleb, Abdelhak [1 ]
Lim, May [2 ]
Akcay, Mehmet N. [3 ]
Begen, Ali C. [3 ]
Zimmermann, Roger [2 ]
机构
[1] Concordia Univ, Gina Cody Sch Engn & Comp Sci, Montreal, PQ H3G 1M8, Canada
[2] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[3] Ozyegin Univ, TR-34794 Istanbul, Turkiye
关键词
Bit rate; Servers; Quality of experience; Performance evaluation; Task analysis; Mobile computing; Training; Adaptive streaming; meta-RL; ABR; CMCD; CMSD; bitrate guidance; quality awareness; VIDEO; MODEL;
D O I
10.1109/TMC.2024.3376560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive bitrate (ABR) schemes enable streaming clients to adapt to time-varying network/device conditions for a stall-free viewing experience. Most ABR schemes use manually tuned heuristics or learning-based methods. Heuristics are easy to implement but do not always perform well, whereas learning-based methods generally perform well but are difficult to deploy on low-resource devices. To make the most out of both worlds, we earlier developed Ahaggar, a learning-based scheme executing on the server side that provides quality-aware bitrate guidance to streaming clients running their own heuristics. Ahaggar's novelty is the meta reinforcement learning approach taking network conditions, clients' statuses and device resolutions, and streamed content as input features to perform bitrate guidance. Ahaggar uses the new Common Media Client/Server Data (CMCD/SD) protocols to exchange the necessary metadata between the servers and clients. While Ahaggar was a significant step forward, in this study, we focus on three open areas, namely, (i) exploring the performance of Ahaggar in a heterogeneous environment including both Ahaggar and non-Ahaggar clients with varied network conditions and device resolutions, and (ii) quantifying the impact of device resolutions on QoE with Ahaggar. We thoroughly investigate these areas and report our findings. We also (iii) discuss the Ahaggar design choices. Experiments on an open-source system show that Ahaggar adapts to unseen conditions fast and outperforms its competitors in several viewer experience metrics.
引用
收藏
页码:10378 / 10392
页数:15
相关论文
共 50 条
  • [1] ADAPTIVE GUIDANCE WITH REINFORCEMENT META LEARNING
    Gaudet, Brian
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 4091 - 4109
  • [2] MetaLive: Meta-Reinforcement Learning Based Collective Bitrate Adaptation for Multi-Party Live Streaming
    Yang, Yi
    Li, Xiang
    Xu, Yeting
    Li, Wenzhong
    Hu, Jiangyi
    Xu, Taishan
    Ren, Xiancheng
    Lu, Sanglu
    EURO-PAR 2023: PARALLEL PROCESSING, 2023, 14100 : 65 - 80
  • [3] Adaptive guidance and integrated navigation with reinforcement meta-learning
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    ACTA ASTRONAUTICA, 2020, 169 : 180 - 190
  • [4] Meta Reinforcement Learning for Sim-to-real Domain Adaptation
    Arndt, Karol
    Hazara, Murtaza
    Ghadirzadeh, Ali
    Kyrki, Ville
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2725 - 2731
  • [5] Game Adaptation by Using Reinforcement Learning Over Meta Games
    Simão Reis
    Luís Paulo Reis
    Nuno Lau
    Group Decision and Negotiation, 2021, 30 : 321 - 340
  • [6] Game Adaptation by Using Reinforcement Learning Over Meta Games
    Reis, Simao
    Reis, Luis Paulo
    Lau, Nuno
    GROUP DECISION AND NEGOTIATION, 2021, 30 (02) : 321 - 340
  • [7] Deep Reinforcement Meta-learning Guidance with Impact Angle Constraint
    Liang C.
    Wang W.-H.
    Lai C.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (05): : 611 - 620
  • [8] Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
    Ren, Zhizhou
    Liu, Anji
    Liang, Yitao
    Peng, Jian
    Ma, Jianzhu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [9] Federated Deep Reinforcement Learning-Based Caching and Bitrate Adaptation for VR Panoramic Video in Clustered MEC Networks
    Li, Yan
    ELECTRONICS, 2022, 11 (23)
  • [10] Learning and Fast Adaptation for Grid Emergency Control via Deep Meta Reinforcement Learning
    Huang, Renke
    Chen, Yujiao
    Yin, Tianzhixi
    Huang, Qiuhua
    Tan, Jie
    Yu, Wenhao
    Li, Xinya
    Li, Ang
    Du, Yan
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (06) : 4168 - 4178