Perception-and-Regulation Network for Salient Object Detection

被引：9

作者：

Zhu, Jinchao ^{[1
,2
]}

Zhang, Xiaoyu ^{[1
]}

Fang, Xian ^{[3
]}

Wang, Yuxuan ^{[1
]}

Tan, Panlong ^{[1
]}

Liu, Junnan ^{[4
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China

[2] Tsinghua Univ, Dept Automat, BNRist, Tianjin 300350, Peoples R China

[3] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China

[4] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Semantics; Regulation; Object detection; Feature extraction; Convolution; Logic gates; Task analysis; Salient object detection; convolutional neural networks; attention mechanism; global perception; MODEL;

D O I：

10.1109/TMM.2022.3210366

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Effective fusion of different types of features is the key to salient object detection (SOD). The majority of the existing network structure designs are based on the subjective experience of scholars, and the process of feature fusion does not consider the relationship between the fused features and the highest-level features. In this paper, we focus on the feature relationship and propose a novel global attention unit, which we term the "perception-and-regulation" (PR) block, that adaptively regulates the feature fusion process by explicitly modelling the interdependencies between features. The perception part uses the structure of the fully connected layers in the classification networks to learn the size and shape of the objects. The regulation part selectively strengthens and weakens the features to be fused. An imitating eye observation module (IEO) is further employed to improve the global perception capabilities of the network. The imitation of foveal vision and peripheral vision enables the IEO to scrutinize highly detailed objects and to organize a broad spatial scene to better segment objects. Sufficient experiments conducted on the SOD datasets demonstrate that the proposed method performs favourably against the 29 state-of-the-art methods.

引用

页码：6525 / 6537

页数：13

共 84 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[3]

Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063

[4] EF-Net: A novel enhancement and fusion network for RGB-D saliency detection [J].

Chen, Qian ;

Fu, Keren ;

Liu, Ze ;

Chen, Geng ;

Du, Hongwei ;

Qiu, Bensheng ;

Shao, Ling .

PATTERN RECOGNITION, 2021, 112

[5] Reverse Attention-Based Residual Network for Salient Object Detection [J].

Chen, Shuhan ;

Tan, Xiuli ;

Wang, Ben ;

Lu, Huchuan ;

Hu, Xuelong ;

Fu, Yun .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3763-3776

[6]

Chen TC, 2009, PROC EUR SOLID-STATE, P1

[7]

Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599

[8] Global Contrast Based Salient Region Detection [J].

Cheng, Ming-Ming ;

Mitra, Niloy J. ;

Huang, Xiaolei ;

Torr, Philip H. S. ;

Hu, Shi-Min .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :569-582

[9] RepFinder: Finding Approximately Repeated Scene Elements for Image Editing [J].

Cheng, Ming-Ming ;

Zhang, Fang-Lue ;

Mitra, Niloy J. ;

Huang, Xiaolei ;

Hu, Shi-Min .

ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)

[10]

Deng ZJ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684

← 1 2 3 4 5 6 7 8 9 →