A novel co-attention computation block for deep learning based image co-segmentation

被引:9
作者
Gong, Xiaopeng [1 ]
Liu, Xiabi [1 ]
Li, Yushuo [1 ]
Li, Huiyu [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualco-attention; Imageco-segmentation; Deeplearning; Correlationcalculation; Averagepooling; COSEGMENTATION;
D O I
10.1016/j.imavis.2020.103973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The correlation between images is crucial for solving the image co-segmentation problem that is segmenting common and salient objects from a set of related images. This paper proposes a novel co-attention computation block to compute the visual correlation between images for improving the co-segmentation performance. Here 'co-attention' means that we obtain the co-attention features in encoded features of an image to guide the attention in another image. To this purpose, we firstly introduce top-k average pooling to compute the channel co-attention descriptor. Then we explore the correlation between features in different spatial positions to get the spatial co-attention descriptor. Finally, these two types of co-attention descriptors are multiplied to generate a fused one. We obtain such a fused co-attention descriptor for each image and use it to produce the co-attention augmented feature map for the following processing in the applications. We embed the proposed co-attention block into a U-shaped Siamese network for fulfilling the image co-segmentation. It is proven to be able to improve the performance effectively in the experiments. To our best knowledge, it leads to the currently best results on Internet dataset and iCoseg dataset. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 55 条
[1]  
[Anonymous], 2016, CITY SCAPES DATASET
[2]   iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance [J].
Batra, Dhruv ;
Kowdle, Adarsh ;
Parikh, Devi ;
Luo, Jiebo ;
Chen, Tsuhan .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3169-3176
[3]   Efficient Sequential Correspondence Selection by Cosegmentation [J].
Cech, Jan ;
Matas, Jiri ;
Perdoch, Michal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) :1568-1581
[4]  
Chai YN, 2012, LECT NOTES COMPUT SC, V7572, P794, DOI 10.1007/978-3-642-33718-5_57
[5]   Control of goal-directed and stimulus-driven attention in the brain [J].
Corbetta, M ;
Shulman, GL .
NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) :201-215
[6]   Cosegmentation and Cosketch by Unsupervised Learning [J].
Dai, Jifeng ;
Wu, Ying Nian ;
Zhou, Jie ;
Zhu, Song-Chun .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1305-1312
[7]   Co-Segmentation by Composition [J].
Faktor, Alon ;
Irani, Michal .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1297-1304
[8]  
Fu HZ, 2015, PROC CVPR IEEE, P4428, DOI 10.1109/CVPR.2015.7299072
[9]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[10]   Robust Object Co-Segmentation Using Background Prior [J].
Han, Junwei ;
Quan, Rong ;
Zhang, Dingwen ;
Nie, Feiping .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) :1639-1651