Bitrate Adaptation and Guidance With Meta Reinforcement Learning

被引:1
|
作者
Bentaleb, Abdelhak [1 ]
Lim, May [2 ]
Akcay, Mehmet N. [3 ]
Begen, Ali C. [3 ]
Zimmermann, Roger [2 ]
机构
[1] Concordia Univ, Gina Cody Sch Engn & Comp Sci, Montreal, PQ H3G 1M8, Canada
[2] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[3] Ozyegin Univ, TR-34794 Istanbul, Turkiye
关键词
Bit rate; Servers; Quality of experience; Performance evaluation; Task analysis; Mobile computing; Training; Adaptive streaming; meta-RL; ABR; CMCD; CMSD; bitrate guidance; quality awareness; VIDEO; MODEL;
D O I
10.1109/TMC.2024.3376560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive bitrate (ABR) schemes enable streaming clients to adapt to time-varying network/device conditions for a stall-free viewing experience. Most ABR schemes use manually tuned heuristics or learning-based methods. Heuristics are easy to implement but do not always perform well, whereas learning-based methods generally perform well but are difficult to deploy on low-resource devices. To make the most out of both worlds, we earlier developed Ahaggar, a learning-based scheme executing on the server side that provides quality-aware bitrate guidance to streaming clients running their own heuristics. Ahaggar's novelty is the meta reinforcement learning approach taking network conditions, clients' statuses and device resolutions, and streamed content as input features to perform bitrate guidance. Ahaggar uses the new Common Media Client/Server Data (CMCD/SD) protocols to exchange the necessary metadata between the servers and clients. While Ahaggar was a significant step forward, in this study, we focus on three open areas, namely, (i) exploring the performance of Ahaggar in a heterogeneous environment including both Ahaggar and non-Ahaggar clients with varied network conditions and device resolutions, and (ii) quantifying the impact of device resolutions on QoE with Ahaggar. We thoroughly investigate these areas and report our findings. We also (iii) discuss the Ahaggar design choices. Experiments on an open-source system show that Ahaggar adapts to unseen conditions fast and outperforms its competitors in several viewer experience metrics.
引用
收藏
页码:10378 / 10392
页数:15
相关论文
共 50 条
  • [21] Reinforcement Learning Based Adaptive Bitrate Algorithm For Transmitting Panoramic Videos
    Wu, Xiaona
    Li, Xiao
    Tong, Xun
    Xie, Rong
    Song, Li
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [22] Reinforcement learning guidance law of Q-learning
    Zhang Q.
    Ao B.
    Zhang Q.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
  • [23] Meta-learning in Reinforcement Learning
    Schweighofer, N
    Doya, K
    NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
  • [24] Meta Reinforcement Learning with Hebbian Learning
    Wang, Di
    2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 52 - 58
  • [25] GreenABR: Energy-Aware Adaptive Bitrate Streaming with Deep Reinforcement Learning
    Turkkan, Bekir Oguzhan
    Dai, Ting
    Raman, Adithya
    Kosar, Tevfik
    Chen, Changyou
    Bulut, Muhammed Fatih
    Zola, Jaroslaw
    Sow, Daby
    PROCEEDINGS OF THE 13TH ACM MULTIMEDIA SYSTEMS CONFERENCE, MMSYS 2022, 2022, : 150 - 163
  • [26] Continuous Bitrate & Latency Control with Deep Reinforcement Learning for Live Video Streaming
    Hong, Ruying
    Shen, Qiwei
    Zhang, Lei
    Wang, Jing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2637 - 2641
  • [27] Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization
    Wen, Lu
    Zhang, Songan
    Tseng, H. Eric
    Singh, Baljeet
    Filev, Dimitar
    Peng, Huei
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8987 - 8994
  • [28] A Multi-Constraint Guidance and Maneuvering Penetration Strategy via Meta Deep Reinforcement Learning
    Zhao, Sibo
    Zhu, Jianwen
    Bao, Weimin
    Li, Xiaoping
    Sun, Haifeng
    DRONES, 2023, 7 (10)
  • [29] Range-Aware Impact Angle Guidance Law With Deep Reinforcement Meta-Learning
    Liang, Chen
    Wang, Weihong
    Liu, Zhenghua
    Lai, Chao
    Wang, Sen
    IEEE ACCESS, 2020, 8 (08): : 152093 - 152104
  • [30] Reinforcement learning for dynamic multimedia adaptation
    Charvillat, Vincent
    Grigoras, Romulus
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (03) : 1034 - 1058