A Viewport Prediction Framework for Panoramic Videos

被引:6
|
作者
Tang, Jinting [1 ,2 ]
Huo, Yongkai [1 ,2 ]
Yang, Shaoshi [3 ,4 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Shenzhen Univ, Sch Comp Sci & Software Engn, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Sch Comp Sci & Software Engn, Res Inst Future Media Comp, Shenzhen 518060, Peoples R China
[3] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[4] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
基金
中国国家自然科学基金;
关键词
panoramic video; viewport prediction; object tracking; deep learning;
D O I
10.1109/ijcnn48605.2020.9207562
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Panoramic video is considered to be an attractive video format, since it provides the viewers with an immersive experience, such as virtual reality (VR) gaming. However, the viewers only focus on part of panoramic video, which is referred to as viewport. Hence, the resources consumed for distributing the remaining part of the panoramic video are wasted. It is intuitive to only deliver the video data within this viewport for reducing the distribution cost. Empirically, viewports within a time interval are highly correlated, hence the historical trajectory may be used for predicting the future viewports. On the other hand, a viewer tends to sustain attention on a specific object in a panoramic video. Motivated by these findings, we propose a deep learning-based viewport Prediction scheme, namely HOP, where the Historical viewport trajectory of viewers and Object tracking are jointly exploited by the long short-term memory (LSTM) networks. Additionally, our solution is capable of predicting multiple future viewports, while a single viewport prediction was supported by the state-of-the-art contributions. Simulation results show that our proposed HOP scheme outperforms the benchmarkers by up to 33.5% in terms of the prediction error.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A unified evaluation framework for head motion prediction methods in 360° videos
    Rondon, Miguel Fabian Romero
    Sassatelli, Lucile
    Aparicio-Pardo, Ramon
    Precioso, Frederic
    MMSYS'20: PROCEEDINGS OF THE 2020 MULTIMEDIA SYSTEMS CONFERENCE, 2020, : 279 - 284
  • [32] A Content-based Viewport Prediction Framework for 360° Video Using Personalized Federated Learning and Fusion Techniques
    Setayesh, Mehdi
    Wong, Vincent W. S.
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 654 - 659
  • [33] When Green Screen Meets Panoramic Videos: An Interesting Video Combination Framework for Virtual Studio and Cellphone Applications
    Ye, Long
    Feng, Chenxi
    Cai, Juanjuan
    IEEE ACCESS, 2020, 8 : 2337 - 2347
  • [34] Panoramic Vision Transformer for Saliency Detection in 360° Videos
    Yun, Heeseung
    Lee, Sehun
    Kim, Gunhee
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 422 - 439
  • [35] EXPONENTIAL COORDINATES BASED ROTATION STABILIZATION FOR PANORAMIC VIDEOS
    Hoang-Phong Nguyen
    Tien-Thong Nguyen Do
    Kim, Jinwook
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 46 - 50
  • [36] 360Cast+: Viewport Adaptive Soft Delivery for 360-Degree Videos
    Yujun, Lu
    Fujihashi, Takuya
    Saruwatari, Shunsuke
    Watanabe, Takashi
    IEEE ACCESS, 2021, 9 : 52684 - 52697
  • [37] Generating VR Live Videos with Tripod Panoramic Rig
    Xu, Feng
    Zhao, Tianqi
    Luo, Bicheng
    Dai, Qionghai
    25TH 2018 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2018, : 446 - 450
  • [38] Content Analysis of YouTube Videos That Demonstrate Panoramic Radiography
    Grillon, Marlene
    Yeung, Andy Wai Kan
    HEALTHCARE, 2022, 10 (06)
  • [39] A SUBJECTIVE VISUAL QUALITY ASSESSMENT METHOD OF PANORAMIC VIDEOS
    Xu, Mai
    Li, Chen
    Liu, Yufan
    Deng, Xin
    Lu, Jiaxin
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 517 - 522
  • [40] Analyzing Viewport Prediction Under Different VR Interactions
    Xu, Tan
    Han, Bo
    Qian, Feng
    PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES (CONEXT '19), 2019, : 165 - 171