DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:9
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [21] Deep Reinforcement Learning for Financial Forecasting in Static and Streaming Cases
    Ram, Aravilli Atchuta
    Yadav, Sandarbh
    Vivek, Yelleti
    Ravi, Vadlamani
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024, 23 (06)
  • [22] ABRaider: Multiphase Reinforcement Learning for Environment-Adaptive Video Streaming
    Choi, Wangyu
    Chen, Jiasi
    Yoon, Jongwon
    IEEE ACCESS, 2022, 10 : 53108 - 53123
  • [23] CLAPS: Curriculum Learning-based Adaptive Bitrate and Preloading for Short video streaming
    Sun, Fengzhou
    Yang, Hao
    Lin, Tao
    Zhang, Yuan
    Chen, Zhe
    Chen, Zheng
    Yan, Jinyao
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [24] Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction
    童亮
    陆际联
    Journal of Beijing Institute of Technology(English Edition), 2006, (02) : 133 - 137
  • [25] Action Space Shaping in Deep Reinforcement Learning
    Kanervisto, Anssi
    Scheller, Christian
    Hautamaki, Ville
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 479 - 486
  • [26] Path planning of robotic arm based on deep reinforcement learning algorithm
    Al-Gabalawy M.
    Advanced Control for Applications: Engineering and Industrial Systems, 2022, 4 (01):
  • [27] Distributed Edge Computing Offloading Algorithm Based on Deep Reinforcement Learning
    Li, Yunzhao
    Qi, Feng
    Wang, Zhili
    Yu, Xiuming
    Shao, Sujie
    IEEE ACCESS, 2020, 8 : 85204 - 85215
  • [28] DHP: A Joint Video Download and Dynamic Bitrate Adaptation Algorithm for Short Video Streaming
    Gao, Wenhua
    Zhang, Lanju
    Yang, Hao
    Zhang, Yuan
    Yan, Jinyao
    Lin, Tao
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 587 - 598
  • [29] Deep Reinforcement Learning for Video Summarization with Semantic Reward
    Sun, Haoran
    Zhu, Xiaolong
    Zhou, Conghua
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
  • [30] Macro-Action-Based Deep Multi-Agent Reinforcement Learning
    Xiao, Yuchen
    Hoffman, Joshua
    Amato, Christopher
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100