A Viewport Prediction Framework for Panoramic Videos

被引:6
|
作者
Tang, Jinting [1 ,2 ]
Huo, Yongkai [1 ,2 ]
Yang, Shaoshi [3 ,4 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Shenzhen Univ, Sch Comp Sci & Software Engn, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Sch Comp Sci & Software Engn, Res Inst Future Media Comp, Shenzhen 518060, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[4] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
基金
中国国家自然科学基金;
关键词
panoramic video; viewport prediction; object tracking; deep learning;
D O I
10.1109/ijcnn48605.2020.9207562
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Panoramic video is considered to be an attractive video format, since it provides the viewers with an immersive experience, such as virtual reality (VR) gaming. However, the viewers only focus on part of panoramic video, which is referred to as viewport. Hence, the resources consumed for distributing the remaining part of the panoramic video are wasted. It is intuitive to only deliver the video data within this viewport for reducing the distribution cost. Empirically, viewports within a time interval are highly correlated, hence the historical trajectory may be used for predicting the future viewports. On the other hand, a viewer tends to sustain attention on a specific object in a panoramic video. Motivated by these findings, we propose a deep learning-based viewport Prediction scheme, namely HOP, where the Historical viewport trajectory of viewers and Object tracking are jointly exploited by the long short-term memory (LSTM) networks. Additionally, our solution is capable of predicting multiple future viewports, while a single viewport prediction was supported by the state-of-the-art contributions. Simulation results show that our proposed HOP scheme outperforms the benchmarkers by up to 33.5% in terms of the prediction error.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Panoramic Stereo Videos with a Single Camera
    Aggarwal, Rajat
    Vohra, Amrisha
    Namboodiri, Anoop M.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3755 - 3763
  • [22] HEVC-compliant Viewport-adaptive Streaming of Stereoscopic Panoramic Video
    Zare, Alireza
    Sreedhar, Kashyap Kammachi
    Vadakital, Vinod Kumar Malamal
    Aminlou, Alireza
    Hannuksela, Miska M.
    Gabbouj, Moncef
    2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [23] Tile-Based Viewport-Adaptive Panoramic Video Streaming on Smartphones
    Qian, Feng
    Han, Bo
    Xiao, Qingyang
    Gopalakrishnan, Vijay
    MOBICOM'18: PROCEEDINGS OF THE 24TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, 2018, : 817 - 819
  • [24] CoLive: Edge-Assisted Clustered Learning Framework for Viewport Prediction in 360° Live Streaming
    Wang, Mu
    Chen, Xingyan
    Yang, Xu
    Peng, Shuai
    Zhao, Yu
    Xu, Mingwei
    Xu, Changqiao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5078 - 5091
  • [25] Prediction of motion sickness degree of stereoscopic panoramic videos based on content perception and binocular characteristics
    Lu, Ziang
    Yu, Mei
    Jiang, Gangyi
    Chi, Biwei
    Dong, Qifeng
    DIGITAL SIGNAL PROCESSING, 2022, 132
  • [26] Optimizing storage and delivery of Omnidirectional Videos in Viewport-dependent streaming
    Kammachi-Sreedhar, Kashyap
    Hannuksela, Miska M.
    Aksu, Emre B.
    Ilola, Lauri
    Condrad, Lukasz
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 177 - 180
  • [27] Static object tracking in road panoramic videos
    State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing, China
    不详
    Proc. - IEEE Int. Symp. Multimedia, ISM, (57-64):
  • [28] Data compression and transmission aspects of panoramic videos
    Ng, KT
    Chan, SC
    Shum, HY
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (01) : 82 - 95
  • [29] Panoramic Image Quality-Enhancement by Fusing Neural Textures of the Adaptive Initial Viewport
    Li, Shiyuan
    Lin, Chunyu
    Liao, Kang
    Zhao, Yao
    Zhang, Xue
    2020 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES WORKSHOPS (VRW 2020), 2020, : 817 - 818
  • [30] Optimal Set of 360-Degree Videos for Viewport-Adaptive Streaming
    Corbillon, Xavier
    Devlic, Alisa
    Simon, Gwendal
    Chakareski, Jacob
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 943 - 951