DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:9
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [11] A Reinforcement Learning Based Algorithm for Robot Action Planning
    Svaco, Marko
    Jerbic, Bojan
    Polancec, Mateo
    Suligoj, Filip
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2018, 2019, 67 : 493 - 503
  • [12] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
    Ho, Yung-Han
    Cho, Chuan-Yuan
    Peng, Wen-Hsiao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
  • [13] Deep reinforcement learning based edge computing for video processing
    Han, Seung-Yeop
    Lee, Hyang-Won
    ICT EXPRESS, 2023, 9 (03): : 433 - 438
  • [14] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    SENSORS, 2023, 23 (07)
  • [15] Fastconv: Fast Learning based Adaptive BitRate Algorithm for Video Streaming
    Meng, Linghui
    Zhang, Fangyu
    Bo, Lei
    Lu, Hancheng
    Qin, Jin
    Han, Jiangping
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [16] Deep reinforcement learning-based antilock braking algorithm
    Mantripragada, V. Krishna Teja
    Kumar, R. Krishna
    VEHICLE SYSTEM DYNAMICS, 2023, 61 (05) : 1410 - 1431
  • [17] Pricing-Based Deep Reinforcement Learning for Live Video Streaming With Joint User Association and Resource Management in Mobile Edge Computing
    Chou, Po-Yu
    Chen, Wei-Yu
    Wang, Chih-Yu
    Hwang, Ren-Hung
    Chen, Wen-Tsuen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 4310 - 4324
  • [18] A Deep Reinforcement Learning-Based Optimal Transmission Control Method for Streaming Videos
    Yang, Yawen
    Xiao, Yuxuan
    IEEE ACCESS, 2024, 12 : 53088 - 53098
  • [19] Intelligent Video Streaming at Network Edge: An Attention-Based Multiagent Reinforcement Learning Solution
    Tang, Xiangdong
    Chen, Fei
    He, Yunlong
    FUTURE INTERNET, 2023, 15 (07)
  • [20] Enhancing the Crowdsourced Live Streaming: a Deep Reinforcement Learning Approach
    Zhang, Rui-Xiao
    Huang, Tianchi
    Ma, Ming
    Pang, Haitian
    Yao, Xin
    Wu, Chenglei
    Sun, Lifeng
    PROCEEDINGS OF THE 29TH ACM WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO (NOSSDAV'19), 2019, : 55 - 60