DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:9
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [41] A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
    Wang, Xu
    Li, Yujie
    Wang, Haoyu
    Huang, Longzhao
    Ding, Shuxue
    SENSORS, 2022, 22 (19)
  • [42] RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning
    Chen, Yi-Ren
    Rezapour, Amir
    Tzeng, Wen-Guey
    Tsai, Shi-Chun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 3185 - 3199
  • [43] A Deep Reinforcement Learning Based Mapless Navigation Algorithm Using Continuous Actions
    Duo Nanxun
    Wang Qinzhao
    Lv Qiang
    Wei Heng
    Zhang Pei
    2019 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2019), 2019, : 63 - 68
  • [44] Deep recognition of partial differential equations based on reinforcement learning and genetic algorithm
    Jinyang Du
    Renyun Liu
    Du Cheng
    Qingliang Li
    Fanhua Yu
    The Journal of Supercomputing, 81 (5)
  • [45] Vehicle Simulation Algorithm for Observations with Variable Dimensions Based on Deep Reinforcement Learning
    Liu, Yunzhuo
    Zhang, Ruoning
    Zhou, Shijie
    ELECTRONICS, 2023, 12 (24)
  • [46] PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming
    Naresh, Mandan
    Saxena, Paresh
    Gupta, Manik
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 199 - 204
  • [47] Automated construction scheduling using deep reinforcement learning with valid action sampling
    Yao, Yuan
    Tam, Vivian W. Y.
    Wang, Jun
    Le, Khoa N.
    Butera, Anthony
    AUTOMATION IN CONSTRUCTION, 2024, 166
  • [48] Node Selection Algorithm for Federated Learning Based on Deep Reinforcement Learning for Edge Computing in IoT
    Yan, Shuai
    Zhang, Peiying
    Huang, Siyu
    Wang, Jian
    Sun, Hao
    Zhang, Yi
    Tolba, Amr
    ELECTRONICS, 2023, 12 (11)
  • [49] A deep reinforcement learning based algorithm for a distributed precast concrete production scheduling
    Du, Yu
    Li, Jun-qing
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2024, 268
  • [50] A Deep Reinforcement Learning Based Dynamic Pricing Algorithm in Ride-Hailing
    Shi, Bing
    Cao, Zhi
    Luo, Yikai
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 489 - 505