MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

被引：123

作者：

Lambert, John ^{[1
,3
]}

Liu, Zhuang ^{[1
,2
]}

Sener, Ozan ^{[1
]}

Hays, James ^{[3
,4
]}

Koltun, Vladlen ^{[1
]}

机构：

[1] Intel Labs, Santa Clara, CA 95054 USA

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

[3] Georgia Inst Technol, Atlanta, GA 30332 USA

[4] Argo AI, Pittsburg, KS USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00295

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash leaderboard for robust semantic segmentation, with no exposure to WildDash data during training.

引用

页码：2876 / 2885

页数：10