Bitrate Adaptation and Guidance With Meta Reinforcement Learning

被引：1

作者：

Bentaleb, Abdelhak ^{[1
]}

Lim, May ^{[2
]}

Akcay, Mehmet N. ^{[3
]}

Begen, Ali C. ^{[3
]}

Zimmermann, Roger ^{[2
]}

机构：

[1] Concordia Univ, Gina Cody Sch Engn & Comp Sci, Montreal, PQ H3G 1M8, Canada

[2] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore

[3] Ozyegin Univ, TR-34794 Istanbul, Turkiye

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 11期

关键词：

Bit rate; Servers; Quality of experience; Performance evaluation; Task analysis; Mobile computing; Training; Adaptive streaming; meta-RL; ABR; CMCD; CMSD; bitrate guidance; quality awareness; VIDEO; MODEL;

D O I：

10.1109/TMC.2024.3376560

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Adaptive bitrate (ABR) schemes enable streaming clients to adapt to time-varying network/device conditions for a stall-free viewing experience. Most ABR schemes use manually tuned heuristics or learning-based methods. Heuristics are easy to implement but do not always perform well, whereas learning-based methods generally perform well but are difficult to deploy on low-resource devices. To make the most out of both worlds, we earlier developed Ahaggar, a learning-based scheme executing on the server side that provides quality-aware bitrate guidance to streaming clients running their own heuristics. Ahaggar's novelty is the meta reinforcement learning approach taking network conditions, clients' statuses and device resolutions, and streamed content as input features to perform bitrate guidance. Ahaggar uses the new Common Media Client/Server Data (CMCD/SD) protocols to exchange the necessary metadata between the servers and clients. While Ahaggar was a significant step forward, in this study, we focus on three open areas, namely, (i) exploring the performance of Ahaggar in a heterogeneous environment including both Ahaggar and non-Ahaggar clients with varied network conditions and device resolutions, and (ii) quantifying the impact of device resolutions on QoE with Ahaggar. We thoroughly investigate these areas and report our findings. We also (iii) discuss the Ahaggar design choices. Experiments on an open-source system show that Ahaggar adapts to unseen conditions fast and outperforms its competitors in several viewer experience metrics.

引用

页码：10378 / 10392

页数：15

共 50 条

[41] Benchmark of Bitrate Adaptation in Video Streaming
Chen, Jessica
Milner, Henry
Stoica, Ion
Zhan, Jibin
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2021, 13 (04):
[42] Meta Q-network: a combination of reinforcement learning and meta learning
Lu, Min
Wang, Yi
Wang, Wenfeng
INTERNATIONAL JOURNAL OF APPLIED NONLINEAR SCIENCE, 2022, 3 (03) : 179 - 188
[43] KNN-Q Learning Algorithm of Bitrate Adaptation for Video Streaming over HTTP
Lin, HuaiDi
Shen, ZhenYuan
Zhou, HuaKang
Liu, XingGuang
Zhang, Leilei
Xiao, Gang
Cheng, Zhenbo
2020 INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC), 2020, : 302 - 306
[44] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
Jarosik, Piotr
Lewandowski, Marcin
2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478
[45] Adversarial Reinforcement Learning for Unsupervised Domain Adaptation
Zhang, Youshan
Ye, Hui
Davison, Brian D.
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 635 - 644
[46] Unsupervised Basis Function Adaptation for Reinforcement Learning
Barker, Edward
Ras, Charl
JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[47] Computational Missile Guidance: A Deep Reinforcement Learning Approach
He, Shaoming
Shin, Hyo-Sang
Tsourdos, Antonios
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (08): : 571 - 582
[48] CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
Liu, Jinxin
Zu, Lipeng
He, Li
Wang, Donglin
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[49] Deep Reinforcement Learning for Spacecraft Proximity Operations Guidance
Hovell, Kirk
Ulrich, Steve
JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 254 - 264
[50] Unsupervised basis function adaptation for reinforcement learning
Barker, Edward
Ras, Charl
Journal of Machine Learning Research, 2019, 20

← 1 2 3 4 5 →