Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation

被引：2

作者：

Bevandic, Petra ^{[1
]}

Orsic, Marin ^{[1
]}

Saric, Josip ^{[1
]}

Grubisic, Ivan ^{[1
]}

Segvic, Sinisa ^{[1
]}

机构：

[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, Zagreb 10000, Croatia

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 07期

关键词：

Semantic segmentation; Multi-domain training; Universal taxonomy;

D O I：

10.1007/s11263-024-01986-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep supervised models have an unprecedented capacity to absorb large quantities of training data. Hence, training on multiple datasets becomes a method of choice towards strong generalization in usual scenes and graceful performance degradation in edge cases. Unfortunately, popular datasets often have discrepant granularities. For instance, the Cityscapes road class subsumes all driving surfaces, while Vistas defines separate classes for road markings, manholes etc. Furthermore, many datasets have overlapping labels. For instance, pickups are labeled as trucks in VIPER, cars in Vistas, and vans in ADE20k. We address this challenge by considering labels as unions of universal visual concepts. This allows seamless and principled learning on multi-domain dataset collections without requiring any relabeling effort. Our method improves within-dataset and cross-dataset generalization, and provides opportunity to learn visual concepts which are not separately labeled in any of the training datasets. Experiments reveal competitive or state-of-the-art performance on two multi-domain dataset collections and on the WildDash 2 benchmark.

引用

页码：2450 / 2472

页数：23

共 69 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[2]

Bevandic P., 2022, Universal taxonomies for semantic segmentation (source code)

[3]

Bevandic P., 2022, BMVC

[4] Dense open-set recognition based on training with noisy negative images [J].

Bevandic, Petra ;

Kreso, Ivan ;

Orsic, Marin ;

Segvic, Sinisa .

IMAGE AND VISION COMPUTING, 2022, 124

[5] Multi-domain semantic segmentation with overlapping labels [J].

Bevandic, Petra ;

Orsic, Marin ;

Grubisic, Ivan ;

Saric, Josip ;

Segvic, Sinisa .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :2422-2431

[6]

Biase G.D., 2021, Computer vision and pattern recognition

[7] The Fishyscapes Benchmark: Measuring Blind Spots in Semantic Segmentation [J].

Blum, Hermann ;

Sarlin, Paul-Edouard ;

Nieto, Juan ;

Siegwart, Roland ;

Cadena, Cesar .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (11) :3119-3135

[8] In-Place Activated BatchNorm for Memory-Optimized Training of DNNs [J].

Bulo, Samuel Rota ;

Porzi, Lorenzo ;

Kontschieder, Peter .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5639-5647

[9]

Chan R., 2021, NeurIPS

[10] Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation [J].

Chan, Robin ;

Rottmann, Matthias ;

Gottschalk, Hanno .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5108-5117

← 1 2 3 4 5 6 7 →