Densely nested top-down flows for salient object detection

被引：0

作者：

Chaowei Fang

Haibin Tian

Dingwen Zhang

Qiang Zhang

Jungong Han

Junwei Han

机构：

[1] Xidian University,School of Artificial Intelligence

[2] Xidian University,School of Mechano

[3] Northwestern Polytechnical University,Electronic Engineering

[4] Aberystwyth University,Brain and Artificial Intelligence Laboratory, School of Automation

来源：

Science China Information Sciences | 2022年 / 65卷

关键词：

salient object detection; top-down flow; densely nested framework; convolutional neural networks;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the goal of identifying pixel-wise salient object regions from each input image, salient object detection (SOD) has been receiving great attention in recent years. One kind of mainstream SOD method is formed by a bottom-up feature encoding procedure and a top-down information decoding procedure. While numerous approaches have explored the bottom-up feature extraction for this task, the design of top-down flows remains under-studied. To this end, this paper revisits the role of top-down modeling in salient object detection and designs a novel densely nested top-down flows (DNTDF)-based framework. In every stage of DNTDF, features from higher levels are read in via the progressive compression shortcut paths (PCSPs). The notable characteristics of our proposed method are as follows. (1) The propagation of high-level features which usually have relatively strong semantic information is enhanced in the decoding procedure. (2) With the help of PCSP, the gradient vanishing issues caused by non-linear operations in top-down information flows can be alleviated. (3) Thanks to the full exploration of high-level features, the decoding process of our method is relatively memory-efficient compared to those of existing methods. Integrating DNTDF with EfficientNet, we construct a highly light-weighted SOD model, with very low computational complexity. To demonstrate the effectiveness of the proposed model, comprehensive experiments are conducted on six widely-used benchmark datasets. The comparisons to the most state-of-the-art methods as well as the carefully-designed baseline models verify our insights on the top-down flow modeling for SOD.

引用

共 33 条

[1] Han J W(2018)Advanced deep-learning techniques for salient and category-specific object detection: a survey IEEE Signal Process Mag 35 84-100
[2] Zhang D W(2020)SPFTN: a joint learning framework for localizing and segmenting objects in weakly labeled videos IEEE Trans Pattern Anal Mach Intell 42 475-489
[3] Cheng G(2019)Leveraging prior-knowledge for weakly supervised object detection under a collaborative self-paced curriculum learning framework Int J Comput Vis 127 363-380
[4] Zhang D W(2021)Task-wise attention guided part complementary learning for few-shot image classification Sci China Inf Sci 64 120104-3930
[5] Han J W(2022)Onfocus detection: identifying individual-camera eye contact from unconstrained images Sci China Inf Sci 65 160101-582
[6] Yang L(2020)Multi-attention based cross-domain beauty product image retrieval Sci China Inf Sci 63 120112-1769
[7] Zhang D W(2016)Deepsaliency: multi-task deep neural network model for salient object detection IEEE Trans Image Process 25 3919-1916
[8] Han J W(2015)Global contrast based salient region detection IEEE Trans Pattern Anal Mach Intell 37 569-252
[9] Zhao L(2020)Synthesizing supervision for learning deep saliency network without human annotation IEEE Trans Pattern Anal Mach Intell 42 1755-undefined
[10] Cheng G(2015)Spatial pyramid pooling in deep convolutional networks for visual recognition IEEE Trans Pattern Anal Mach Intell 37 1904-undefined

← 1 2 3 4 →