Bitrate Adaptation and Guidance With Meta Reinforcement Learning

被引:1
|
作者
Bentaleb, Abdelhak [1 ]
Lim, May [2 ]
Akcay, Mehmet N. [3 ]
Begen, Ali C. [3 ]
Zimmermann, Roger [2 ]
机构
[1] Concordia Univ, Gina Cody Sch Engn & Comp Sci, Montreal, PQ H3G 1M8, Canada
[2] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[3] Ozyegin Univ, TR-34794 Istanbul, Turkiye
关键词
Bit rate; Servers; Quality of experience; Performance evaluation; Task analysis; Mobile computing; Training; Adaptive streaming; meta-RL; ABR; CMCD; CMSD; bitrate guidance; quality awareness; VIDEO; MODEL;
D O I
10.1109/TMC.2024.3376560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive bitrate (ABR) schemes enable streaming clients to adapt to time-varying network/device conditions for a stall-free viewing experience. Most ABR schemes use manually tuned heuristics or learning-based methods. Heuristics are easy to implement but do not always perform well, whereas learning-based methods generally perform well but are difficult to deploy on low-resource devices. To make the most out of both worlds, we earlier developed Ahaggar, a learning-based scheme executing on the server side that provides quality-aware bitrate guidance to streaming clients running their own heuristics. Ahaggar's novelty is the meta reinforcement learning approach taking network conditions, clients' statuses and device resolutions, and streamed content as input features to perform bitrate guidance. Ahaggar uses the new Common Media Client/Server Data (CMCD/SD) protocols to exchange the necessary metadata between the servers and clients. While Ahaggar was a significant step forward, in this study, we focus on three open areas, namely, (i) exploring the performance of Ahaggar in a heterogeneous environment including both Ahaggar and non-Ahaggar clients with varied network conditions and device resolutions, and (ii) quantifying the impact of device resolutions on QoE with Ahaggar. We thoroughly investigate these areas and report our findings. We also (iii) discuss the Ahaggar design choices. Experiments on an open-source system show that Ahaggar adapts to unseen conditions fast and outperforms its competitors in several viewer experience metrics.
引用
收藏
页码:10378 / 10392
页数:15
相关论文
共 50 条
  • [31] Domain Adaptation for Reinforcement Learning on the Atari
    Carr, Thomas
    Chli, Maria
    Vogiatzis, George
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1859 - 1861
  • [32] Meta-IRLSOT++: A meta-inverse reinforcement learning method for fast adaptation of trajectory prediction networks
    Yang, Biao
    Lu, Yanan
    Wan, Rui
    Hu, Hongyu
    Yang, Changchun
    Ni, Rongrong
    Expert Systems with Applications, 2024, 240
  • [33] Hierarchical reinforcement learning guidance with threat avoidance
    Li Bohao
    Wu Yunjie
    Li Guofei
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (05) : 1173 - 1185
  • [34] Probabilistic Counterexample Guidance for Safer Reinforcement Learning
    Ji, Xiaotong
    Filieri, Antonio
    QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2023, 2023, 14287 : 311 - 328
  • [35] Hierarchical reinforcement learning guidance with threat avoidance
    LI Bohao
    WU Yunjie
    LI Guofei
    JournalofSystemsEngineeringandElectronics, 2022, 33 (05) : 1173 - 1185
  • [36] Semantic Guidance of Dialogue Generation with Reinforcement Learning
    Hsueh, Cheng-Hsun
    Ma, Wei-Yun
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 1 - 9
  • [37] Fast Adaptation with Meta-Reinforcement Learning for Trust Modelling in Human-Robot Interaction
    Gao, Yuan
    Sibirtseva, Elena
    Castellano, Ginevra
    Kragic, Danica
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 305 - 312
  • [38] EVeREst: Bitrate Adaptation for Cloud VR
    Liubogoshchev, Mikhail
    Korneev, Evgeny
    Khorov, Evgeny
    ELECTRONICS, 2021, 10 (06) : 1 - 17
  • [39] Adaptive Bitrate Algorithms via Deep Reinforcement Learning With Digital Twins Assisted Trajectory
    Ye, Jin
    Qin, Shaowen
    Xiao, Qingyu
    Jiang, Wenchao
    Tang, Xin
    Li, Xiaohuan
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (04): : 3522 - 3535
  • [40] Meta-reinforcement learning for adaptive spacecraft guidance during finite-thrust rendezvous missions
    Federici, Lorenzo
    Scorsoglio, Andrea
    Zavoli, Alessandro
    Furfaro, Roberto
    ACTA ASTRONAUTICA, 2022, 201 : 129 - 141