Dual Teacher: Improving the Reliability of Pseudo Labels for Semi-Supervised Oriented Object Detection

被引：9

作者：

Fang, Zhenyu ^{[1
,2
]}

Ren, Jinchang ^{[3
,4
]}

Zheng, Jiangbin ^{[1
]}

Chen, Rongjun ^{[3
]}

Zhao, Huimin ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Software, Xian 710060, Peoples R China

[2] NPU, Yangtze River Delta Res Inst, Suzhou 215400, Taicang, Peoples R China

[3] Guangdong Polytech Normal Univ, Sch Comp Sci, Guangzhou 510665, Peoples R China

[4] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷

基金：

中国国家自然科学基金;

关键词：

Detectors; Remote sensing; Object detection; Training; Feature extraction; Annotations; Semisupervised learning; Predictive models; Supervised learning; Interference; Consistency learning; dual teacher; oriented object detection; pseudo label; semi-supervised learning (SSL);

D O I：

10.1109/TGRS.2024.3519173

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Oriented object detection in remote sensing is a critical task for accurately location and measurement of the interested targets. Despite of its success in object detection, deep learning-based detectors rely heavily on extensive data annotation. However, variations in object appearance significantly increase the difficulty and the cost of creating large-scale annotated datasets. Semi-supervised learning (SSL) aims to utilize unlabeled data to enhance object detectors. Among these, pseudo-label-based methods have shown promising results recently. Nonetheless, as training progresses, the accumulation of errors in pseudo labels leads to prediction bias without corrections. To tackle this particular challenge, we present a SSL pipeline, named "dual teacher," for improving the reliability of pseudo labels in the semi-supervised oriented object detection. First, to mitigate the bias caused by limited annotated data, a global burn-in (GBI) strategy is introduced at the beginning of training, which guides the student detector to learn the feature extraction on a global scale. In addition, an online bounding box (bbox) correction module is proposed to decrease the occurrence of mislabeled instances and enhance the reliability of detection. These improvements are facilitated by an additional detector, instead of a single teacher model in the teacher-student architecture. Dual teacher reduces the dependency on the quality of pseudo labels related to the model complexity and combines the strengths of both the two-stage and one-stage detectors. With only 20% labeled data, dual teacher outperforms fully supervised rotated fully convolutional one-stage object detection (R-FCOS), you only look once X-small (YOLOX-s), and rotated region-based convolutional neural network (R-RCNN) by up to 2% on both a large-scale dataset for object detection in aerial images (DOTA) and SODA-A datasets. This reveals its potential in reducing labor-intensive tasks and enhancing robustness against environmental interference and noisy labels. The code is available at: https://github.com/ZYFFF-CV/DualTeacher-semisup.git.

引用

页数：15

共 62 条

[1]

Bachman P, 2014, ADV NEUR IN, V27

[2] Soft-NMS - Improving Object Detection With One Line of Code [J].

Bodla, Navaneeth ;

Singh, Bharat ;

Chellappa, Rama ;

Davis, Larry S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570

[3] Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].

Cheng, Gong ;

Yuan, Xiang ;

Yao, Xiwen ;

Yan, Kebing ;

Zeng, Qinghua ;

Xie, Xingxing ;

Han, Junwei .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488

[4] Anchor-Free Oriented Proposal Generator for Object Detection [J].

Cheng, Gong ;

Wang, Jiabao ;

Li, Ke ;

Xie, Xingxing ;

Lang, Chunbo ;

Yao, Yanqing ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[5] Self-Guided Proposal Generation for Weakly Supervised Object Detection [J].

Cheng, Gong ;

Xie, Xuan ;

Chen, Weining ;

Feng, Xiaoxu ;

Yao, Xiwen ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7] Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].

Ding, Jian ;

Xue, Nan ;

Long, Yang ;

Xia, Gui-Song ;

Lu, Qikai .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853

[8]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[9]

Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 5 6 7 →