Improving Localization for Semi-Supervised Object Detection

被引：0

作者：

Rossi, Leonardo ^{[1
]}

Karimi, Akbar ^{[1
]}

Prati, Andrea ^{[1
]}

机构：

[1] Univ Parma, IMP Lab, DIA, Parma, Italy

来源：

IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II | 2022年 / 13232卷

关键词：

Object detection; Multi-task learning; Teacher-student technique; Semi-supervised learning; Semi-supervised object detection;

D O I：

10.1007/978-3-031-06430-2_43

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, Semi-Supervised Object Detection (SSOD) is a hot topic, since, while it is rather easy to collect images for creating a new dataset, labeling them is still an expensive and time-consuming task. One of the successful methods to take advantage of raw images on a Semi-Supervised Learning (SSL) setting is the Mean Teacher technique [17], where the operations of pseudo-labeling by the Teacher and the Knowledge Transfer from the Student to the Teacher take place simultaneously. However, the pseudo-labeling by thresholding is not the best solution since the confidence value is not strictly related to the prediction uncertainty, not permitting to safely filter predictions. In this paper, we introduce an additional classification task for bounding box localization to improve the filtering of the predicted bounding boxes and obtain higher quality on Student training. Furthermore, we empirically prove that bounding box regression on the unsupervised part can equally contribute to the training as much as category classification. Our experiments show that our IL-net (Improving Localization net) increases SSOD performance by 1.14% AP on COCO dataset in limited-annotation regime. The code is available at https://github.com/IMPLabliniPriunbiasedteacher/tree/ilnet.

引用

页码：516 / 527

页数：12

共 21 条

[1]

Clevert DA, 2016, Arxiv, DOI [arXiv:1511.07289, DOI 10.48550/ARXIV.1511.07289, 10.48550/arxiv.1511.07289]

[2] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[3] Bounding Box Regression with Uncertainty for Accurate Object Detection [J].

He, Yihui ;

Zhu, Chenchen ;

Wang, Jianren ;

Savvides, Marios ;

Zhang, Xiangyu .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2883-2892

[4]

Jeong J, 2019, ADV NEUR IN, V32

[5] Acquisition of Localization Confidence for Accurate Object Detection [J].

Jiang, Borui ;

Luo, Ruixuan ;

Mao, Jiayuan ;

Xiao, Tete ;

Jiang, Yuning .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :816-832

[6]

Laine S., 2017, INT C LEARNING REPRE, DOI DOI 10.48550/ARXIV.1610.02242

[7] Improving Object Detection with Selective Self-supervised Self-training [J].

Li, Yandong ;

Huang, Di ;

Qin, Danfeng ;

Wang, Liqiang ;

Gong, Boqing .

COMPUTER VISION - ECCV 2020, PT XXIX, 2020, 12374 :589-607

[8] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007

[9] Microsoft COCO: Common Objects in Context [J].

Lin, Tsung-Yi ;

Maire, Michael ;

Belongie, Serge ;

Hays, James ;

Perona, Pietro ;

Ramanan, Deva ;

Dollar, Piotr ;

Zitnick, C. Lawrence .

COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755

[10] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

← 1 2 3 →