Densely nested top-down flows for salient object detection

被引:0
作者
Chaowei FANG [1 ]
Haibin TIAN [2 ]
Dingwen ZHANG [3 ]
Qiang ZHANG [2 ]
Jungong HAN [4 ]
Junwei HAN [3 ]
机构
[1] School of Artificial Intelligence,Xidian University
[2] School of Mechano-Electronic Engineering,Xidian University
[3] Brain and Artificial Intelligence Laboratory,School of Automation,Northwestern Polytechnical University
[4] Department of Computer Science,Aberystwyth University
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
With the goal of identifying pixel-wise salient object regions from each input image, salient object detection(SOD) has been receiving great attention in recent years. One kind of mainstream SOD method is formed by a bottom-up feature encoding procedure and a top-down information decoding procedure. While numerous approaches have explored the bottom-up feature extraction for this task, the design of top-down flows remains under-studied. To this end, this paper revisits the role of top-down modeling in salient object detection and designs a novel densely nested top-down flows(DNTDF)-based framework. In every stage of DNTDF, features from higher levels are read in via the progressive compression shortcut paths(PCSPs). The notable characteristics of our proposed method are as follows.(1) The propagation of high-level features which usually have relatively strong semantic information is enhanced in the decoding procedure.(2) With the help of PCSP, the gradient vanishing issues caused by non-linear operations in top-down information flows can be alleviated.(3) Thanks to the full exploration of high-level features, the decoding process of our method is relatively memory-efficient compared to those of existing methods. Integrating DNTDF with EfficientN et, we construct a highly light-weighted SOD model, with very low computational complexity. To demonstrate the effectiveness of the proposed model, comprehensive experiments are conducted on six widely-used benchmark datasets. The comparisons to the most state-of-the-art methods as well as the carefully-designed baseline models verify our insights on the top-down flow modeling for SOD.
引用
收藏
页码:57 / 70
页数:14
相关论文
共 12 条
  • [1] Onfocus detection: identifying individual-camera eye contact from unconstrained images
    Dingwen ZHANG
    Bo WANG
    Gerong WANG
    Qiang ZHANG
    Jiajia ZHANG
    Jungong HAN
    Zheng YOU
    [J]. ScienceChina(InformationSciences), 2022, 65 (06) : 5 - 16
  • [2] Task-wise attention guided part complementary learning for few-shot image classification[J]. Gong CHENG,Ruimin LI,Chunbo LANG,Junwei HAN.Science China(Information Sciences). 2021(02)
  • [3] F3Net: Fusion, Feedback and Focus for Salient Object Detection[J] . Jun Wei,Shuhui Wang,Qingming Huang.Proceedings of the AAAI Conference on Artificial Intelligence . 2020 (07)
  • [4] SPFTN: A Joint Learning Framework for Localizing and Segmenting Objects in Weakly Labeled Videos
    Zhang, Dingwen
    Han, Junwei
    Yang, Le
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 475 - 489
  • [5] Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework
    Zhang, Dingwen
    Han, Junwei
    Zhao, Long
    Meng, Deyu
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (04) : 363 - 380
  • [6] Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection A survey
    Han, Junwei
    Zhang, Dingwen
    Cheng, Gong
    Liu, Nian
    Xu, Dong
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 84 - 100
  • [7] DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection[J] . Li Xi,Zhao Liming,Wei Lina,Yang Ming-Hsuan,Wu Fei,Zhuang Yueting,Ling Haibin,Wang Jingdong.IEEE transactions on image processing : a publication of the IEEE Signal Processing Society . 2016 (8)
  • [8] ImageNet Large Scale Visual Recognition Challenge
    Russakovsky, Olga
    Deng, Jia
    Su, Hao
    Krause, Jonathan
    Satheesh, Sanjeev
    Ma, Sean
    Huang, Zhiheng
    Karpathy, Andrej
    Khosla, Aditya
    Bernstein, Michael
    Berg, Alexander C.
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252
  • [9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [10] Global Contrast Based Salient Region Detection
    Cheng, Ming-Ming
    Mitra, Niloy J.
    Huang, Xiaolei
    Torr, Philip H. S.
    Hu, Shi-Min
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) : 569 - 582