Dual-det : a fast detector for oriented object detection in aerial images

被引：2

作者：

Guan, Qiuyu ^{[1
]}

Qu, Zhenshen ^{[1
]}

Zhao, Pengbo ^{[1
]}

Zeng, Ming ^{[1
]}

Liu, Junyu ^{[2
]}

机构：

[1] Harbin Inst Technol, Space Control & Inertial Technol Res Ctr, Harbin, Peoples R China

[2] Kyoto Univ, Grad Sch Informat, Kyoto, Japan

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2021年 / 42卷 / 24期

关键词：

VEHICLE DETECTION; REGION PROPOSAL; TEXT DETECTION;

D O I：

10.1080/01431161.2021.1995071

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Fast and accurate object detection in aerial images remains a challenging task. Usually, to better describe an object, oriented bounding boxes (OBBs) can better fit objects. Due to high background complexity and large object scale variation, single-angle anchor-based two-stage detectors are widely adopted, which offer better accuracy. However, the single-angle prediction has a small error tolerance for objects with a large aspect ratio, and the hyperparameters of the anchor-based network are difficult to adjust, and the number of hyperparameters is extremely large. Furthermore, the two-stage detection inference speed is slow, and it is difficult to achieve real-time detection. In this paper, we propose Dual-Det, a keypoint-based oriented object detector. We firstly propose a dual-angle with a short-side and ratio regression strategy (DASR), which uses the object centre and the length and angles of two diagonals to represent an object. A short side guided (SSG) loss is further added to guide the direction of the diagonal regression box. To improve the detection performance for dense and tiny objects, a lightweight supervised pixel attention learner is finally proposed. The experiment results show that Dual-Det achieves 90.23% mAP at 46FPS on HRSC2016, 90.83% mAP at 46FPS on UCAS-AOD and 72.00% mAP at 0.018 s per image in the inference phase on DOTA. The code will be open source on https://github.com/gqy4166000/ijrs_dasr.

引用

页码：9542 / 9564

页数：23

共 46 条

[1] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
Azimi, Seyed Majid
Vig, Eleonora
Bahmanyar, Reza
Koerner, Marco
Reinartz, Peter
[J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 150 - 165
[2] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[3] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
Cheng, Gong
Zhou, Peicheng
Han, Junwei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
[4] Deformable Convolutional Networks
Dai, Jifeng
Qi, Haozhi
Xiong, Yuwen
Li, Yi
Zhang, Guodong
Hu, Han
Wei, Yichen
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
[5] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[6] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[7] Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks
Deng, Zhipeng
Sun, Hao
Zhou, Shilin
Zhao, Juanping
Zou, Huanxin
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (08) : 3652 - 3664
[8] Learning RoI Transformer for Oriented Object Detection in Aerial Images
Ding, Jian
Xue, Nan
Long, Yang
Xia, Gui-Song
Lu, Qikai
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
[9] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[10] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587

← 1 2 3 4 5 →