Salient Object Detection Based on Feature Enhancement in Complex Scene

被引:0
作者
Li B. [1 ]
Rao H. [1 ]
机构
[1] School of Electronic and Information Engineering, South China University of Technology, Guangzhou
来源
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2021年 / 49卷 / 11期
基金
中国国家自然科学基金;
关键词
Feature enhancement; Fully convolutional neural networks; Missed detection; Object misdetection; Salient object detection;
D O I
10.12141/j.issn.1000-565X.210125
中图分类号
学科分类号
摘要
The performance of salient object detection is greatly improved by the superior feature extraction ability of Fully Convolutional Neural Networks(FCN). However, the simple fusion strategies (feature addition or concatenation) cannot effectively enhance features, resulting in algorithm's object misdetection and missed detection in complex scenes. The paper proposed a specifically feature enhancement method to improve the performance of salient object detection. Firstly, object misdetection mostly occurs in a scene where the background is cluttered or the object and the background are intertwined, so it greatly alleviate the object misdetection problem from the perspective of global enhancement and structural enhancement, respectively. Secondly, the missed detection of the object generally occurs in the interior and edge of the object, so the study introduce residual learning to learn the information of the missed region and refine the loss of the object interior and edge. Finally, comparison results between the proposed method with other 13 kinds of advanced methods over 5 benchmark datasets indicate that the proposed model is superior to other 13 methods, and the problems of object misdetection and missed detection in complex scenes were successfully solved. © 2021, Editorial Department, Journal of South China University of Technology. All right reserved.
引用
收藏
页码:135 / 144
页数:9
相关论文
共 36 条
  • [1] CHENGM M M, ZHANG F L, HUANG X L, Et al., Repfinder: finding approximately repeated scene elements for image editing, ACM Transactions on Graphics, 29, 4, pp. 1-8, (2010)
  • [2] DAS A, ABHISHEK H, ZITNICK L, Et al., Human attention in visual question answering: Do humans and deep networks look at the same regions?, Computer Vision and Image Understanding, 163, pp. 90-100, (2017)
  • [3] FANG H, GUPTA S, IANDOLA F, Et al., From captions to visual concepts and back, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1473-1482, (2015)
  • [4] LIN Y, PANG Z, WANG D, Et al., Task-driven visual saliency and attention-based visual question answering
  • [5] MA Y F, ZHANG H J., Contrast-based image attention analysis by using fuzzy growing, ACM International Conference on Multimedia, pp. 374-381, (2003)
  • [6] CHENG M M, MITRA N J, HUANG X, Et al., Global contrast based salient region detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 3, pp. 569-582, (2014)
  • [7] PERAZZI F, KRAHENBUHI P, PRITCH Y, Et al., Saliency filters: Contrast based filtering for salient region detection, Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 733-740, (2012)
  • [8] LIU T, YUAN Z, SUN J, Et al., Learning to detect a salient object, IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 2, pp. 353-367, (2010)
  • [9] SUN J, LU H, LIU X., Saliency region detection based on Markov absorption probabilities, IEEE Transactions on Image Processing, 24, 5, pp. 1639-1649, (2015)
  • [10] CUI Dong, WANG Ming, LI Gang, Saliency detection based on multi-level deep features and random walk, Journal of South China University of Technology(Natural Science Edition), 48, 8, pp. 49-55, (2020)