Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-Segmentation

被引：44

作者：

Chen, Yun-Chun ^{[1
]}

Lin, Yen-Yu ^{[2
]}

Yang, Ming-Hsuan ^{[3
]}

Huang, Jia-Bin ^{[4
]}

机构：

[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan

[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 300, Taiwan

[3] Univ Calif Merced, Sch Engn, Merced, CA 95343 USA

[4] Virginia Tech, Dept Elect & Comp Engn, Blacksburg, VA 24061 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 10期

基金：

美国国家科学基金会;

关键词：

Semantics; Task analysis; Image segmentation; Training; Clutter; Proposals; Pattern matching; Semantic matching; object co-segmentation; weakly-supervised learning; GRAPH; FLOW;

D O I：

10.1109/TPAMI.2020.2985395

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks. The key insights of our method are two-fold. First, the estimated dense correspondence fields from semantic matching provide supervision for object co-segmentation by enforcing consistency between the predicted masks from a pair of images. Second, the predicted object masks from object co-segmentation in turn allow us to reduce the adverse effects due to background clutters for improving semantic matching. Our model is end-to-end trainable and does not require supervision from manually annotated correspondences and object masks. We validate the efficacy of our approach on five benchmark datasets: TSS, Internet, PF-PASCAL, PF-WILLOW, and SPair-71k, and show that our algorithm performs favorably against the state-of-the-art methods on both semantic matching and object co-segmentation tasks.

引用

页码：3632 / 3647

页数：16

共 98 条

[1]

[Anonymous], 2017, P BMVC

[2]

[Anonymous], NIPS 2011

[3] Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance [J].

Batra, Dhruv ;

Kowdle, Adarsh ;

Parikh, Devi ;

Luo, Jiebo ;

Chen, Tsuhan .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 93 (03) :273-292

[4] Optimizing the decomposition for multiple foreground cosegmentation [J].

Chang, Haw-Shiuan ;

Wang, Yu-Chiang Frank .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 :18-27

[5]

Chen Hong, 2018, ACCV, P435

[6] Predicting Multiple Attributes via Relative Multi-task Learning [J].

Chen, Lin ;

Zhang, Qiang ;

Li, Baoxin .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1027-1034

[7] Large-Scale Structure from Motion with Semantic Constraints of Aerial Images [J].

Chen, Yu ;

Wang, Yao ;

Lu, Peng ;

Chen, Yisong ;

Wang, Guoping .

PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 :347-359

[8] CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency [J].

Chen, Yun-Chun ;

Lin, Yen-Yu ;

Yang, Ming-Hsuan ;

Huang, Jia-Bin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1791-1800

[9]

Choy CB, 2016, ADV NEUR IN, V29

[10] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

← 1 2 3 4 5 6 7 8 9 10 →