3D Layout encoding network for spatial-aware 3D saliency modelling

被引:2
|
作者
Yuan, Jing [1 ]
Cao, Yang [2 ]
Kang, Yu [2 ]
Song, Weiguo [1 ]
Yin, Zhongcheng [2 ]
Ba, Rui [1 ]
Ma, Qing [1 ]
机构
[1] Univ Sci & Technol China, State Key Lab Fire Sci, Hefei, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
image sensors; image fusion; object detection; image colour analysis; feature extraction; popular 3D multimedia applications; existing 3D devices; low quality; holes; predictions; single depth images; deep layout features; spatial-aware saliency prediction; coarse depth-induced saliency cues; depth details; high-quality RGB image; low-level; final prediction; spatial layout; saliency modelling results; OBJECT DETECTION; VISUAL-ATTENTION;
D O I
10.1049/iet-cvi.2018.5591
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Three-dimensional (3D) [red, green and blue (RGB) + depth] saliency modelling can help with popular 3D multimedia applications. However, depth images produced from existing 3D devices are often with low quality, e.g. containing noises and holes. In this study, rather than relying on features or predictions directly derived from single depth images, the authors propose to encode deep layout features to facilitate the spatial-aware saliency prediction. Specifically, they first generate coarse depth-induced saliency cues which are careless of depth details. Then, to leverage the information of the high-quality RGB image, they embed both low-level and high-level RGB deep features to refine the final prediction. In this way, they take both bottom-up and top-down cues together with spatial layout into account and achieve better saliency modelling results. Experiments on five public datasets show the superiority of the proposed method.
引用
收藏
页码:480 / 488
页数:9
相关论文
共 50 条
  • [21] 3D facility layout problem
    Mariem Besbes
    Marc Zolghadri
    Roberta Costa Affonso
    Faouzi Masmoudi
    Mohamed Haddar
    Journal of Intelligent Manufacturing, 2021, 32 : 1065 - 1090
  • [22] 3D facility layout problem
    Besbes, Mariem
    Zolghadri, Marc
    Affonso, Roberta Costa
    Masmoudi, Faouzi
    Haddar, Mohamed
    JOURNAL OF INTELLIGENT MANUFACTURING, 2021, 32 (04) : 1065 - 1090
  • [23] 3D landscape modelling using JAVA 3D/VRML
    Punia M.
    Pandey D.
    Journal of the Indian Society of Remote Sensing, 2006, 34 (4) : 397 - 403
  • [24] 3D Garment Modelling - Conception of its Structure in 3D
    Cichocka, Agnieszka
    Bruniaux, Pascal
    Frydrych, Iwona
    FIBRES & TEXTILES IN EASTERN EUROPE, 2016, 24 (04) : 121 - 128
  • [25] Assessing 2D visual encoding of 3D spatial connectivity
    Baldi, Benedetta F.
    Vuong, Jenny
    O'Donoghue, Sean I.
    FRONTIERS IN BIOINFORMATICS, 2024, 3
  • [26] Evaluating 3D spatial pyramids for classifying 3D shapes
    Lopez-Sastre, R. J.
    Garcia-Fuertes, A.
    Redondo-Cabrera, C.
    Acevedo-Rodriguez, F. J.
    Maldonado-Bascon, S.
    COMPUTERS & GRAPHICS-UK, 2013, 37 (05): : 473 - 483
  • [27] 3D Spatial Recognition without Spatially Labeled 3D
    Ren, Zhongzheng
    Misra, Ishan
    Schwing, Alexander G.
    Girdhar, Rohit
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13199 - 13208
  • [28] 3D or not 3D?
    Reidy, Heath
    PROFESSIONAL ENGINEERING, 2009, 22 (13) : 37 - 38
  • [29] 3D or not 3D?
    Rockley, Ted
    NEW SCIENTIST, 2013, 219 (2928) : 31 - 31
  • [30] 3D hybrid modelling of vascular network formation
    Perfahl, Holger
    Hughes, Barry D.
    Alarcon, Tomas
    Maini, Philip K.
    Lloyd, Mark C.
    Reuss, Matthias
    Byrne, Helen M.
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 414 : 254 - 268