Task-Wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial Images

被引：7

作者：

Huang, Zhanchao ^{[1
,2
]}

Li, Wei ^{[3
,4
]}

Xia, Xiang-Gen ^{[5
]}

Wang, Hao ^{[3
,4
]}

Tao, Ran ^{[3
,4
]}

机构：

[1] Fuzhou Univ, Acad Digital China, Key Lab Spatial Data Miningand Informat Sharing, Minist Educ, Fuzhou 350108, Peoples R China

[2] Fuzhou Univ, Natl & Local Joint Engn Res Ctr Satellite Geospat, Fuzhou 350108, Peoples R China

[3] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

[4] Beijing Inst Technol, Beijing Key Lab Fract Signals & Syst, Beijing 100081, Peoples R China

[5] Univ Delaware, Dept Elect & Comp Engn, Newark, DE 19716 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

关键词：

Feature extraction; Task analysis; Location awareness; Object detection; Convolutional neural networks; Remote sensing; Training; Arbitrary-oriented object detection (AOOD); convolutional neural network (CNN); dynamic label assignment; oriented bounding box (OBB); task-wise sampling strategy;

D O I：

10.1109/TNNLS.2024.3367331

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Arbitrary-oriented object detection (AOOD) has been widely applied to locate and classify objects with diverse orientations in remote sensing images. However, the inconsistent features for the localization and classification tasks in AOOD models may lead to ambiguity and low-quality object predictions, which constrains the detection performance. In this article, an AOOD method called task-wise sampling convolutions (TS-Conv) is proposed. TS-Conv adaptively samples task-wise features from respective sensitive regions and maps these features together in alignment to guide a dynamic label assignment for better predictions. Specifically, sampling positions of the localization convolution in TS-Conv are supervised by the oriented bounding box (OBB) prediction associated with spatial coordinates, while sampling positions and convolutional kernel of the classification convolution are designed to be adaptively adjusted according to different orientations for improving the orientation robustness of features. Furthermore, a dynamic task-consistent-aware label assignment (DTLA) strategy is developed to select optimal candidate positions and assign labels dynamically according to ranked task-aware scores obtained from TS-Conv. Extensive experiments on several public datasets covering multiple scenes, multimodal images, and multiple categories of objects demonstrate the effectiveness, scalability, and superior performance of the proposed TS-Conv.

引用

页码：1 / 15

页数：15

共 65 条

[1] DRBox-v2: An Improved Detector With Rotatable Boxes for Target Detection in SAR Images [J].

An, Quanzhi ;

Pan, Zongxu ;

Liu, Lei ;

You, Hongjian .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :8333-8349

[2] Localizing From Classification: Self-Directed Weakly Supervised Object Localization for Remote Sensing Images [J].

Bai, Jing ;

Ren, Junjie ;

Xiao, Zhu ;

Chen, Zheng ;

Gao, Chengxi ;

Ali, Talal Ahmed Ali ;

Jiao, Licheng .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) :17935-17949

[3]

Cao JL, 2020, PROC CVPR IEEE, P11482, DOI 10.1109/CVPR42600.2020.01150

[4] Anchor-Free Oriented Proposal Generator for Object Detection [J].

Cheng, Gong ;

Wang, Jiabao ;

Li, Ke ;

Xie, Xingxing ;

Lang, Chunbo ;

Yao, Yanqing ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[5] Dual-Aligned Oriented Detector [J].

Cheng, Gong ;

Yao, Yanqing ;

Li, Shengyang ;

Li, Ke ;

Xie, Xingxing ;

Wang, Jiabao ;

Yao, Xiwen ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[6] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[7]

Dai LH, 2022, Arxiv, DOI arXiv:2205.12785

[8] Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges [J].

Ding, Jian ;

Xue, Nan ;

Xia, Gui-Song ;

Bai, Xiang ;

Yang, Wen ;

Yang, Michael Ying ;

Belongie, Serge ;

Luo, Jiebo ;

Datcu, Mihai ;

Pelillo, Marcello ;

Zhang, Liangpei .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7778-7796

[9] Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].

Ding, Jian ;

Xue, Nan ;

Long, Yang ;

Xia, Gui-Song ;

Lu, Qikai .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853

[10] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

← 1 2 3 4 5 6 7 →