Densely nested top-down flows for salient object detection

被引：0

作者：

Chaowei FANG ^{[1
]}

Haibin TIAN ^{[2
]}

Dingwen ZHANG ^{[3
]}

Qiang ZHANG ^{[2
]}

Jungong HAN ^{[4
]}

Junwei HAN ^{[3
]}

机构：

[1] School of Artificial Intelligence,Xidian University

[2] School of Mechano-Electronic Engineering,Xidian University

[3] Brain and Artificial Intelligence Laboratory,School of Automation,Northwestern Polytechnical University

[4] Department of Computer Science,Aberystwyth University

来源：

ScienceChina(InformationSciences) | 2022年 / 65卷 / 08期

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP391.41 [];

学科分类号：

080203 ;

摘要：

With the goal of identifying pixel-wise salient object regions from each input image, salient object detection(SOD) has been receiving great attention in recent years. One kind of mainstream SOD method is formed by a bottom-up feature encoding procedure and a top-down information decoding procedure. While numerous approaches have explored the bottom-up feature extraction for this task, the design of top-down flows remains under-studied. To this end, this paper revisits the role of top-down modeling in salient object detection and designs a novel densely nested top-down flows(DNTDF)-based framework. In every stage of DNTDF, features from higher levels are read in via the progressive compression shortcut paths(PCSPs). The notable characteristics of our proposed method are as follows.(1) The propagation of high-level features which usually have relatively strong semantic information is enhanced in the decoding procedure.(2) With the help of PCSP, the gradient vanishing issues caused by non-linear operations in top-down information flows can be alleviated.(3) Thanks to the full exploration of high-level features, the decoding process of our method is relatively memory-efficient compared to those of existing methods. Integrating DNTDF with EfficientN et, we construct a highly light-weighted SOD model, with very low computational complexity. To demonstrate the effectiveness of the proposed model, comprehensive experiments are conducted on six widely-used benchmark datasets. The comparisons to the most state-of-the-art methods as well as the carefully-designed baseline models verify our insights on the top-down flow modeling for SOD.

引用

页码：57 / 70

页数：14

共 12 条

[1] Onfocus detection: identifying individual-camera eye contact from unconstrained images
Dingwen ZHANG
Bo WANG
Gerong WANG
Qiang ZHANG
Jiajia ZHANG
Jungong HAN
Zheng YOU
[J]. ScienceChina(InformationSciences), 2022, 65 (06) : 5 - 16
[2] Task-wise attention guided part complementary learning for few-shot image classification[J]. Gong CHENG,Ruimin LI,Chunbo LANG,Junwei HAN.Science China（Information Sciences）. 2021(02)
[3] F3Net: Fusion, Feedback and Focus for Salient Object Detection[J] . Jun Wei,Shuhui Wang,Qingming Huang.Proceedings of the AAAI Conference on Artificial Intelligence . 2020 (07)
[4] SPFTN: A Joint Learning Framework for Localizing and Segmenting Objects in Weakly Labeled Videos
Zhang, Dingwen
Han, Junwei
Yang, Le
Xu, Dong
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 475 - 489
[5] Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework
Zhang, Dingwen
Han, Junwei
Zhao, Long
Meng, Deyu
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (04) : 363 - 380
[6] Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection A survey
Han, Junwei
Zhang, Dingwen
Cheng, Gong
Liu, Nian
Xu, Dong
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 84 - 100
[7] DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection[J] . Li Xi,Zhao Liming,Wei Lina,Yang Ming-Hsuan,Wu Fei,Zhuang Yueting,Ling Haibin,Wang Jingdong.IEEE transactions on image processing : a publication of the IEEE Signal Processing Society . 2016 (8)
[8] ImageNet Large Scale Visual Recognition Challenge
Russakovsky, Olga
Deng, Jia
Su, Hao
Krause, Jonathan
Satheesh, Sanjeev
Ma, Sean
Huang, Zhiheng
Karpathy, Andrej
Khosla, Aditya
Bernstein, Michael
Berg, Alexander C.
Fei-Fei, Li
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252
[9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
[10] Global Contrast Based Salient Region Detection
Cheng, Ming-Ming
Mitra, Niloy J.
Huang, Xiaolei
Torr, Philip H. S.
Hu, Shi-Min
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) : 569 - 582

← 1 2 →