The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

被引:710
作者
Berman, Maxim [1 ]
Triki, Amal Rannen [1 ]
Blaschko, Matthew B. [1 ]
机构
[1] Katholieke Univ Leuven, Ctr Proc Speech & Images, Dept ESAT, Leuven, Belgium
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
基金
比利时弗兰德研究基金会;
关键词
SUBMODULAR FUNCTIONS;
D O I
10.1109/CVPR.2018.00464
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Jaccard index, also referred to as the intersection over -union score, is commonly employed in the evaluation of image segmentation results given its perceptual qualities, scale invariance which lends appropriate relevance to small objects, and appropriate counting of false negatives, in comparison to per-pixel losses. We present a method for direct optimization of the mean intersection-over-union loss in neural networks, in the context of semantic image segmentation, based on the convex Lovcisz extension of sub-modular losses. The loss is shown to perform better with respect to the Jaccard index measure than the traditionally used cross-entropy loss. We show quantitative and qualitative differences between optimizing the Jaccard index per image versus optimizing the Jaccard index taken over an entire dataset. We evaluate the impact of our method in a semantic segmentation pipeline and show substantially improved intersection-over-union segmentation scores on the Pascal VOC and Cityscapes datasets using state-of-the-art deep learning segmentation architectures.
引用
收藏
页码:4413 / 4421
页数:9
相关论文
共 27 条
[1]  
Ahmed F., 2015, IEEE INT C COMP VIS
[2]  
[Anonymous], 1983, Mathematical programming the state of the art
[3]   Learning with Submodular Functions: A Convex Optimization Perspective [J].
Bach, Francis .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2013, 6 (2-3) :145-373
[4]  
Blaschko MB, 2008, LECT NOTES COMPUT SC, V5302, P2, DOI 10.1007/978-3-540-88682-2_2
[5]  
Bourdev L, 2010, LECT NOTES COMPUT SC, V6316, P168, DOI 10.1007/978-3-642-15567-3_13
[6]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[8]   What is a good evaluation measure for semantic segmentation? [J].
Csurka, Gabriela ;
Larlus, Diane ;
Perronnin, Florent .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[9]  
Everingham M, 2012, The PAS- CAL. Visual Object Classes Challenge 2012 (VOC2012) Results
[10]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338