Multiple Semantic Matching on Augmented N-Partite Graph for Object Co-Segmentation

被引:30
作者
Wang, Chuan [1 ,2 ]
Zhang, Hua [1 ,2 ]
Yang, Liang [3 ]
Cao, Xiaochun [1 ,2 ]
Xiong, Hongkai [4 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
[3] Tianjin Univ Commerce, Sch Informat Engn, Tianjin 300134, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Object co-segmentation; semantic candidate; multiple matches; N-partite graph; COSEGMENTATION; GRADIENTS;
D O I
10.1109/TIP.2017.2750410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent methods for object co-segmentation focus on discovering single co-occurring relation of candidate regions representing the foreground of multiple images. However, region extraction based only on low and middle level information often occupies a large area of background without the help of semantic context. In addition, seeking single matching solution very likely leads to discover local parts of common objects. To cope with these deficiencies, we present a new object co-segmentation framework, which takes advantages of semantic information and globally explores multiple co-occurring matching cliques based on an N-partite graph structure. To this end, we first propose to incorporate candidate generation with semantic context. Based on the regions extracted from semantic segmentation of each image, we design a merging mechanism to hierarchically generate candidates with high semantic responses. Second, all candidates are taken into consideration to globally formulate multiple maximum weighted matching cliques, which complement the discovery of part of the common objects induced by a single clique. To facilitate the discovery of multiple matching cliques, an N-partite graph, which inherently excludes intra-links between candidates from the same image, is constructed to separate multiple cliques without additional constraints. Further, we augment the graph with an additional virtual node in each part to handle irrelevant matches when the similarity between the two candidates is too small. Finally, with the explored multiple cliques, we statistically compute pixel-wise co-occurrence map for each image. Experimental results on two benchmark data sets, i.e., iCoseg and MSRC data sets achieve desirable performance and demonstrate the effectiveness of our proposed framework.
引用
收藏
页码:5825 / 5839
页数:15
相关论文
共 47 条
[1]   Semantic Object Selection [J].
Ahmed, Ejaz ;
Cohen, Scott ;
Price, Brian .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3150-3157
[2]  
Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
[3]  
[Anonymous], 2015, PROC CVPR IEEE
[4]  
Arbeláez P, 2009, PROC CVPR IEEE, P2294, DOI 10.1109/CVPRW.2009.5206707
[5]   iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance [J].
Batra, Dhruv ;
Kowdle, Adarsh ;
Parikh, Devi ;
Luo, Jiebo ;
Chen, Tsuhan .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3169-3176
[6]   Robust dense reconstruction by range merging based on confidence estimation [J].
Chen, Yadang ;
Hao, Chuanyan ;
Wu, Wen ;
Wu, Enhua .
SCIENCE CHINA-INFORMATION SCIENCES, 2016, 59 (09)
[7]   BING: Binarized Normed Gradients for Objectness Estimation at 300fps [J].
Cheng, Ming-Ming ;
Zhang, Ziming ;
Lin, Wen-Yan ;
Torr, Philip .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3286-3293
[8]   Global Contrast based Salient Region Detection [J].
Cheng, Ming-Ming ;
Zhang, Guo-Xin ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Hu, Shi-Min .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :409-416
[9]   Cosegmentation and Cosketch by Unsupervised Learning [J].
Dai, Jifeng ;
Wu, Ying Nian ;
Zhou, Jie ;
Zhu, Song-Chun .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1305-1312
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893