SOOD: Towards Semi-Supervised Oriented Object Detection

被引：42

作者：

Hua, Wei ^{[1
]}

Liang, Dingkang ^{[1
,2
]}

Li, Jingyu ^{[1
]}

Liu, Xiaolong ^{[1
]}

Zou, Zhikang ^{[2
]}

Ye, Xiaoqing ^{[2
]}

Bai, Xiang ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52729.2023.01493

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-Supervised Object Detection (SSOD), aiming to explore unlabeled data for boosting object detectors, has become an active task in recent years. However, existing SSOD approaches mainly focus on horizontal objects, leaving multi-oriented objects that are common in aerial images unexplored. This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework. Towards oriented objects in aerial scenes, we design two loss functions to provide better supervision. Focusing on the orientations of objects, the first loss regularizes the consistency between each pseudo-label-prediction pair (includes a prediction and its corresponding pseudo label) with adaptive weights based on their orientation gap. Focusing on the layout of an image, the second loss regularizes the similarity and explicitly builds the many-to-many relation between the sets of pseudo-labels and predictions. Such a global consistency constraint can further boost semi-supervised learning. Our experiments show that when trained with the two proposed losses, SOOD surpasses the state-of-the-art SSOD methods under various settings on the DOTA-v1.5 benchmark. The code will be available at https://github.com/HamPerdredes/SOOD.

引用

页码：15558 / 15567

页数：10

共 51 条

[1]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[2]

Berthelot D, 2019, ADV NEUR IN, V32

[3] Dense Learning based Semi-Supervised Object Detection [J].

Chen, Binghui ;

Li, Pengyu ;

Chen, Xiang ;

Wang, Biao ;

Zhang, Lei ;

Hua, Xian-Sheng .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :4805-4814

[4]

Cuturi Marco, 2013, P ADV NEURAL INFORM, V26

[5]

Frogner C, 2015, ADV NEUR IN, V28

[6] OTA: Optimal Transport Assignment for Object Detection [J].

Ge, Zheng ;

Liu, Songtao ;

Liu, Zeming ;

Yoshie, Osamu ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312

[7] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[8]

Grandvalet Y., 2004, P NIPS

[9] Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection [J].

Guo, Zonghao ;

Liu, Chang ;

Zhang, Xiaosong ;

Jiao, Jianbin ;

Ji, Xiangyang ;

Ye, Qixiang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8788-8797

[10] ReDet: A Rotation-equivariant Detector for Aerial Object Detection [J].

Han, Jiaming ;

Ding, Jian ;

Xue, Nan ;

Xia, Gui-Song .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2785-2794

← 1 2 3 4 5 6 →