Real-time instance segmentation with polygons using an Intersection-over-Union loss

被引:0
作者
Jodogne-del Litto, Katia [1 ]
Bilodeau, Guillaume-Alexandre [1 ]
机构
[1] Polytech Montreal, LITIV Lab, Montreal, PQ, Canada
来源
2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV | 2023年
基金
加拿大自然科学与工程研究理事会;
关键词
computer vision; instance segmentation; intersection-over-union; urban scene; mask approximation;
D O I
10.1109/CRV60082.2023.00027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting a binary mask for an object is more accurate but also more computationally expensive than a bounding box. Polygonal masks as developed in CenterPoly can be a good compromise. In this paper, we improve over CenterPoly by enhancing the classical regression L1 loss with a novel region-based loss and a novel order loss, as well as with a new training process for the vertices prediction head. Moreover, the previous methods that predict polygonal masks use different coordinate systems, but it is not clear if one is better than another, if we abstract the architecture requirement. We therefore investigate their impact on the prediction. We also use a new evaluation protocol with oracle predictions for the detection head, to further isolate the segmentation process and better compare the polygonal masks with binary masks. Our instance segmentation method is trained and tested with challenging datasets containing urban scenes, with a high density of road users. Experiments show, in particular, that using a combination of a regression loss and a region-based loss allows significant improvements on the Cityscapes and IDD test set compared to CenterPoly. Moreover the inference stage remains fast enough to reach real-time performance with an average of 0.045 s per frame for 2048x1024 images on a single RTX 2070 GPU. The code is available at: https://github.com/KatiaJDL/CenterPoly-v2.
引用
收藏
页码:153 / 160
页数:8
相关论文
共 31 条
[1]   Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus [J].
Acuna, David ;
Ling, Huan ;
Kar, Amlan ;
Fidler, Sanja .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :859-868
[2]  
Ba J., 2015, PROC INT C LEARN REP
[3]  
Bolya Daniel., 2019, arXiv, DOI DOI 10.48550/ARXIV.1904.02689
[4]  
BRADEN B., 1986, COLL MATH J, V17, P326, DOI DOI 10.1080/07468342.1986.11972974
[5]   Annotating Object Instances with a Polygon-RNN [J].
Castrejon, Lluis ;
Kundu, Kaustav ;
Urtasun, Raquel ;
Fidler, Sanja .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4485-4493
[6]   Sparse Instance Activation for Real-Time Instance Segmentation [J].
Cheng, Tianheng ;
Wang, Xinggang ;
Chen, Shaoyu ;
Zhang, Wenqiang ;
Zhang, Qian ;
Huang, Chang ;
Zhang, Zhaoxiang ;
Liu, Wenyu .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :4423-4432
[7]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[8]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[9]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[10]   Poly-YOLO: higher speed, more precise detection and instance segmentation for YOLOv3 [J].
Hurtik, Petr ;
Molek, Vojtech ;
Hula, Jan ;
Vajgl, Marek ;
Vlasanek, Pavel ;
Nejezchleba, Tomas .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10) :8275-8290