3D Layout encoding network for spatial-aware 3D saliency modelling

被引:2
|
作者
Yuan, Jing [1 ]
Cao, Yang [2 ]
Kang, Yu [2 ]
Song, Weiguo [1 ]
Yin, Zhongcheng [2 ]
Ba, Rui [1 ]
Ma, Qing [1 ]
机构
[1] Univ Sci & Technol China, State Key Lab Fire Sci, Hefei, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
image sensors; image fusion; object detection; image colour analysis; feature extraction; popular 3D multimedia applications; existing 3D devices; low quality; holes; predictions; single depth images; deep layout features; spatial-aware saliency prediction; coarse depth-induced saliency cues; depth details; high-quality RGB image; low-level; final prediction; spatial layout; saliency modelling results; OBJECT DETECTION; VISUAL-ATTENTION;
D O I
10.1049/iet-cvi.2018.5591
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Three-dimensional (3D) [red, green and blue (RGB) + depth] saliency modelling can help with popular 3D multimedia applications. However, depth images produced from existing 3D devices are often with low quality, e.g. containing noises and holes. In this study, rather than relying on features or predictions directly derived from single depth images, the authors propose to encode deep layout features to facilitate the spatial-aware saliency prediction. Specifically, they first generate coarse depth-induced saliency cues which are careless of depth details. Then, to leverage the information of the high-quality RGB image, they embed both low-level and high-level RGB deep features to refine the final prediction. In this way, they take both bottom-up and top-down cues together with spatial layout into account and achieve better saliency modelling results. Experiments on five public datasets show the superiority of the proposed method.
引用
收藏
页码:480 / 488
页数:9
相关论文
共 50 条
  • [1] Spatial-aware stacked regression network for real-time 3D hand pose estimation
    Ren, Pengfei
    Sun, Haifeng
    Huang, Weiting
    Hao, Jiachang
    Cheng, Daixuan
    Qi, Qi
    Wang, Jingyu
    Liao, Jianxin
    NEUROCOMPUTING, 2021, 437 : 42 - 57
  • [2] RegionNet: Region-feature-enhanced 3D Scene Understanding Network with Dual Spatial-aware Discriminative Loss
    Zhang, Guanghui
    Zhu, Dongchen
    Ye, Xiaoqing
    Shi, Wenjun
    Chen, Minghong
    Li, Jiamao
    Zhang, Xiaolin
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8247 - 8254
  • [3] 3D Spatial Layout Propagation in a Video Sequence
    Rituerto, Alejandro
    Manduchi, Roberto
    Murillo, Ana C.
    Guerrero, J. J.
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT II, 2014, 8815 : 374 - 382
  • [4] Visual search is influenced by 3D spatial layout
    Finlayson, Nonie J.
    Grove, Philip M.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2015, 77 (07) : 2322 - 2330
  • [5] Visual search is influenced by 3D spatial layout
    Nonie J. Finlayson
    Philip M. Grove
    Attention, Perception, & Psychophysics, 2015, 77 : 2322 - 2330
  • [6] Saliency3D: A 3D Saliency Dataset Collected on Screen
    Wang, Yao
    Dai, Qi
    Bace, Mihai
    Klein, Karsten
    Bulling, Andreas
    PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, ETRA 2024, 2024,
  • [7] A methodology for modelling of 3D spatial constraints
    Xu D.
    van Oosterom P.
    Zlatanova S.
    van Oosterom, Peter (p.j.m.vanoosterom@tudelft.nl), 1600, Springer Science and Business Media Deutschland GmbH (00): : 95 - 117
  • [8] 3D semantic segmentation based on spatial-aware convolution and shape completion for augmented reality applications
    Guo, Yun-Chih
    Weng, Tzu-Hsuan
    Fischer, Robin
    Fu, Li-Chen
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 224
  • [9] Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup Tables
    Wang, Tao
    Li, Yong
    Peng, Jingyang
    Ma, Yipeng
    Wang, Xian
    Song, Fenglong
    Yan, Youliang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2451 - 2460
  • [10] Modelling 3D spatial objects in a geo-DBMS using a 3D primitive
    Arens, C
    Stoter, J
    van Oosterom, P
    COMPUTERS & GEOSCIENCES, 2005, 31 (02) : 165 - 177