DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:9
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [1] Reinforcement-Learning Based Preload Strategy for Short Video
    Ren, Zhicheng
    Shan, Yongxin
    Jiang, Wanchun
    Shan, Yijing
    Shan, Danfeng
    Wang, Jianxin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 327 - 339
  • [2] TCLiVi: Transmission Control in Live Video Streaming Based on Deep Reinforcement Learning
    Cui, Laizhong
    Su, Dongyuan
    Yang, Shu
    Wang, Zhi
    Ming, Zhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 651 - 663
  • [3] LiveClip: Towards Intelligent Mobile Short-Form Video Streaming with Deep Reinforcement Learning
    He, Jianchao
    Hu, Miao
    Zhou, Yipeng
    Wu, Di
    NOSSDAV '20: PROCEEDINGS OF THE 2020 WORKSHOP ON NETWORK AND OPERATING SYSTEM SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2020, : 54 - 59
  • [4] End-to-End Streaming Video Temporal Action Segmentation With Reinforcement Learning
    Zhang, Jin-Rong
    Wen, Wu-Jun
    Liu, Sheng-Lan
    Huang, Gao
    Li, Yun-Heng
    Li, Qi-Feng
    Feng, Lin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [5] Intelligent Caching for Mobile Video Streaming in Vehicular Networks with Deep Reinforcement Learning
    Luo, Zhaohui
    Liwang, Minghui
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [6] Reinforcement learning-based rate adaptation in dynamic video streaming
    Hafez, N. A.
    Hassan, M. S.
    Landolsi, T.
    TELECOMMUNICATION SYSTEMS, 2023, 83 (04) : 395 - 407
  • [7] Reinforcement learning-based rate adaptation in dynamic video streaming
    N. A. Hafez
    M. S. Hassan
    T. Landolsi
    Telecommunication Systems, 2023, 83 : 395 - 407
  • [8] Adaptive Video Streaming via Deep Reinforcement Learning from User Trajectory Preferences
    Xiao, Qingyu
    Ye, Jin
    Pang, Chengjie
    Ma, Liangdi
    Jiang, Wengchao
    2020 IEEE 39TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2020,
  • [9] Startup delay aware short video ordering: Problem, model, and a reinforcement learning based algorithm
    Gao, Zhipeng
    Li, Chunxi
    Zhao, Yongxiang
    Zhang, Baoxian
    Li, Cheng
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (02)
  • [10] Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming
    Jiang, Zhiqian
    Zhang, Xu
    Xu, Yiling
    Ma, Zhan
    Sun, Jun
    Zhang, Yunfei
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (02) : 409 - 423