Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-Segmentation

被引:44
作者
Chen, Yun-Chun [1 ]
Lin, Yen-Yu [2 ]
Yang, Ming-Hsuan [3 ]
Huang, Jia-Bin [4 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 300, Taiwan
[3] Univ Calif Merced, Sch Engn, Merced, CA 95343 USA
[4] Virginia Tech, Dept Elect & Comp Engn, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Semantics; Task analysis; Image segmentation; Training; Clutter; Proposals; Pattern matching; Semantic matching; object co-segmentation; weakly-supervised learning; GRAPH; FLOW;
D O I
10.1109/TPAMI.2020.2985395
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks. The key insights of our method are two-fold. First, the estimated dense correspondence fields from semantic matching provide supervision for object co-segmentation by enforcing consistency between the predicted masks from a pair of images. Second, the predicted object masks from object co-segmentation in turn allow us to reduce the adverse effects due to background clutters for improving semantic matching. Our model is end-to-end trainable and does not require supervision from manually annotated correspondences and object masks. We validate the efficacy of our approach on five benchmark datasets: TSS, Internet, PF-PASCAL, PF-WILLOW, and SPair-71k, and show that our algorithm performs favorably against the state-of-the-art methods on both semantic matching and object co-segmentation tasks.
引用
收藏
页码:3632 / 3647
页数:16
相关论文
共 98 条
[1]  
[Anonymous], 2017, P BMVC
[2]  
[Anonymous], NIPS 2011
[3]   Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance [J].
Batra, Dhruv ;
Kowdle, Adarsh ;
Parikh, Devi ;
Luo, Jiebo ;
Chen, Tsuhan .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 93 (03) :273-292
[4]   Optimizing the decomposition for multiple foreground cosegmentation [J].
Chang, Haw-Shiuan ;
Wang, Yu-Chiang Frank .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 :18-27
[5]  
Chen Hong, 2018, ACCV, P435
[6]   Predicting Multiple Attributes via Relative Multi-task Learning [J].
Chen, Lin ;
Zhang, Qiang ;
Li, Baoxin .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1027-1034
[7]   Large-Scale Structure from Motion with Semantic Constraints of Aerial Images [J].
Chen, Yu ;
Wang, Yao ;
Lu, Peng ;
Chen, Yisong ;
Wang, Guoping .
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 :347-359
[8]   CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency [J].
Chen, Yun-Chun ;
Lin, Yen-Yu ;
Yang, Ming-Hsuan ;
Huang, Jia-Bin .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1791-1800
[9]  
Choy CB, 2016, ADV NEUR IN, V29
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893