DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引：9

作者：

Qian, Si-Ze ^{[1
]}

Xie, Yuhong ^{[1
]}

Pan, Zipeng ^{[1
]}

Zhang, Yuan ^{[2
]}

Lin, Tao ^{[2
]}

机构：

[1] Commun Univ China, Beijing, Peoples R China

[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

Short video streaming; reinforcement learning; action masking;

D O I：

10.1145/3503161.3551573

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.

引用

页码：7030 / 7034

页数：5

共 50 条

[11] A Reinforcement Learning Based Algorithm for Robot Action Planning
Svaco, Marko
Jerbic, Bojan
Polancec, Mateo
Suligoj, Filip
ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2018, 2019, 67 : 493 - 503
[12] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
Ho, Yung-Han
Cho, Chuan-Yuan
Peng, Wen-Hsiao
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
[13] Deep reinforcement learning based edge computing for video processing
Han, Seung-Yeop
Lee, Hyang-Won
ICT EXPRESS, 2023, 9 (03): : 433 - 438
[14] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
Yoon, Ui Nyoung
Hong, Myung Duk
Jo, Geun-Sik
SENSORS, 2023, 23 (07)
[15] Fastconv: Fast Learning based Adaptive BitRate Algorithm for Video Streaming
Meng, Linghui
Zhang, Fangyu
Bo, Lei
Lu, Hancheng
Qin, Jin
Han, Jiangping
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[16] Deep reinforcement learning-based antilock braking algorithm
Mantripragada, V. Krishna Teja
Kumar, R. Krishna
VEHICLE SYSTEM DYNAMICS, 2023, 61 (05) : 1410 - 1431
[17] Pricing-Based Deep Reinforcement Learning for Live Video Streaming With Joint User Association and Resource Management in Mobile Edge Computing
Chou, Po-Yu
Chen, Wei-Yu
Wang, Chih-Yu
Hwang, Ren-Hung
Chen, Wen-Tsuen
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 4310 - 4324
[18] A Deep Reinforcement Learning-Based Optimal Transmission Control Method for Streaming Videos
Yang, Yawen
Xiao, Yuxuan
IEEE ACCESS, 2024, 12 : 53088 - 53098
[19] Intelligent Video Streaming at Network Edge: An Attention-Based Multiagent Reinforcement Learning Solution
Tang, Xiangdong
Chen, Fei
He, Yunlong
FUTURE INTERNET, 2023, 15 (07)
[20] Enhancing the Crowdsourced Live Streaming: a Deep Reinforcement Learning Approach
Zhang, Rui-Xiao
Huang, Tianchi
Ma, Ming
Pang, Haitian
Yao, Xin
Wu, Chenglei
Sun, Lifeng
PROCEEDINGS OF THE 29TH ACM WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO (NOSSDAV'19), 2019, : 55 - 60

← 1 2 3 4 5 →