Panoramic Vision Transformer for Saliency Detection in 360° Videos

被引:11
作者
Yun, Heeseung [1 ]
Lee, Sehun [1 ]
Kim, Gunhee [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
来源
COMPUTER VISION - ECCV 2022, PT XXXV | 2022年 / 13695卷
关键词
360 degrees videos; Saliency detection; Vision transformer;
D O I
10.1007/978-3-031-19833-5_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
360 degrees video saliency detection is one of the challenging benchmarks for 360 degrees video understanding since non-negligible distortion and discontinuity occur in the projection of any format of 360 degrees videos, and capture-worthy viewpoint in the omnidirectional sphere is ambiguous by nature. We present a new framework named Panoramic Vision Transformer (PAVER). We design the encoder using Vision Transformer with deformable convolution, which enables us not only to plug pretrained models from normal videos into our architecture without additional modules or finetuning but also to perform geometric approximation only once, unlike previous deep CNN-based approaches. Thanks to its powerful encoder, PAVER can learn the saliency from three simple relative relations among local patch features, outperforming state-of-the-art models for the Wild360 benchmark by large margins without supervision or auxiliary information like class activation. We demonstrate the utility of our saliency prediction model with the omnidirectional video quality assessment task in VQA-ODV, where we consistently improve performance without any form of supervision, including head movement.
引用
收藏
页码:422 / 439
页数:18
相关论文
共 50 条
  • [1] Saliency Detection of Panoramic Images Based on Robust Vision Transformer and Multiple Attention
    Chen, Xiaolei
    Zhang, Pengcheng
    Lu, Yubing
    Cao, Baoning
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2246 - 2255
  • [2] Vision Transformer-Based Tailing Detection in Videos
    Lee, Jaewoo
    Lee, Sungjun
    Cho, Wonki
    Siddiqui, Zahid Ali
    Park, Unsang
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [3] Saliency Detection Model for Stereoscopic Panoramic Images
    Qiu Miaomiao
    Chai Xiongli
    Shao Feng
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [4] Saliency detection for panoramic landscape images of outdoor scenes
    Han, Byeong-Ju
    Sim, Jae-Young
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 27 - 37
  • [5] An Overview of Research Progress on Saliency Detection of Panoramic VR Images
    Ding Y.
    Liu Y.-W.
    Liu J.-X.
    Liu K.-D.
    Wang L.-M.
    Xu Z.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (07): : 1575 - 1583
  • [6] RELATIONAL ENTROPY-BASED SALIENCY DETECTION IN IMAGES AND VIDEOS
    Duncan, Kester
    Sarkar, Sudeep
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1093 - 1096
  • [7] Saliency Detection for Content Aware Computer Vision Applications
    Pandivalavan, Manipoonchelvi
    Karuppiah, Muneeswaran
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (04) : 528 - 533
  • [8] Panoramic Image Saliency Detection by Fusing Visual Frequency Feature and Viewing Behavior Pattern
    Ding, Ying
    Liu, Yanwei
    Liu, Jinxia
    Liu, Kedong
    Wang, Liming
    Xu, Zhen
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 418 - 429
  • [9] Multi-scale graph feature extraction network for panoramic image saliency detection
    Ripei Zhang
    Chunyi Chen
    Jun Peng
    The Visual Computer, 2024, 40 (2) : 953 - 970
  • [10] Multi-scale graph feature extraction network for panoramic image saliency detection
    Zhang, Ripei
    Chen, Chunyi
    Peng, Jun
    VISUAL COMPUTER, 2024, 40 (02) : 953 - 970