MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

被引:123
作者
Lambert, John [1 ,3 ]
Liu, Zhuang [1 ,2 ]
Sener, Ozan [1 ]
Hays, James [3 ,4 ]
Koltun, Vladlen [1 ]
机构
[1] Intel Labs, Santa Clara, CA 95054 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Georgia Inst Technol, Atlanta, GA 30332 USA
[4] Argo AI, Pittsburg, KS USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash leaderboard for robust semantic segmentation, with no exposure to WildDash data during training.
引用
收藏
页码:2876 / 2885
页数:10
相关论文
共 45 条
[1]  
[Anonymous], 2018, CVPR
[2]  
[Anonymous], 2016, ARXIV160401545
[3]  
[Anonymous], 2017, ICCV
[4]  
[Anonymous], 2018, ECCV
[5]  
[Anonymous], 2011, CVPR
[6]  
[Anonymous], 2018, CVPR
[7]  
[Anonymous], 2017, CVPR
[8]  
[Anonymous], 2016, INT C LEARNING REPRE
[9]  
[Anonymous], 2016, CVPR
[10]  
[Anonymous], 2017, ARXIV170107275