A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection

被引:105
作者
Huang, Zhanchao [1 ,2 ]
Li, Wei [1 ,2 ]
Xia, Xiang-Gen [3 ]
Tao, Ran [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Beijing Key Lab Fract Signals & Syst, Beijing 100081, Peoples R China
[3] Univ Delaware, Dept Elect & Comp Engn, Newark, DE 19716 USA
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Task analysis; Shape; Training; Object detection; Heating systems; Feature extraction; Convolutional neural networks; Arbitrary-oriented object; convolutional neural network; gaussian heatmap; label assignment; object detection;
D O I
10.1109/TIP.2022.3148874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, many arbitrary-oriented object detection (AOOD) methods have been proposed and attracted widespread attention in many fields. However, most of them are based on anchor-boxes or standard Gaussian heatmaps. Such label assignment strategy may not only fail to reflect the shape and direction characteristics of arbitrary-oriented objects, but also have high parameter-tuning efforts. In this paper, a novel AOOD method called General Gaussian Heatmap Label Assignment (GGHL) is proposed. Specifically, an anchor-free object-adaptation label assignment (OLA) strategy is presented to define the positive candidates based on two-dimensional (2D) oriented Gaussian heatmaps, which reflect the shape and direction features of arbitrary-oriented objects. Based on OLA, an oriented-bounding-box (OBB) representation component (ORC) is developed to indicate OBBs and adjust the Gaussian center prior weights to fit the characteristics of different objects adaptively through neural network learning. Moreover, a joint-optimization loss (JOL) with area normalization and dynamic confidence weighting is designed to refine the misalign optimal results of different subtasks. Extensive experiments on public datasets demonstrate that the proposed GGHL improves the AOOD performance with low parameter-tuning and time costs. Furthermore, it is generally applicable to most AOOD methods to improve their performance including lightweight models on embedded platforms.
引用
收藏
页码:1895 / 1910
页数:16
相关论文
共 48 条
[1]   DRBox-v2: An Improved Detector With Rotatable Boxes for Target Detection in SAR Images [J].
An, Quanzhi ;
Pan, Zongxu ;
Liu, Lei ;
You, Hongjian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :8333-8349
[2]   Prime Sample Attention in Object Detection [J].
Cao, Yuhang ;
Chen, Kai ;
Loy, Chen Change ;
Lin, Dahua .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11580-11588
[3]  
CHEN Zhiming, 2020, Lect. Notes Comput. Sci. (Incl. Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinforma.), P195, DOI [10.1007/978-3-030-58558-7_12, DOI 10.1007/978-3-030-58558, DOI 10.1007/978-3-030-58558-712]
[4]   Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Xu, Dong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) :265-278
[5]  
Cheng Gong, 2021, ARXIV211001931
[6]   Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges [J].
Ding, Jian ;
Xue, Nan ;
Xia, Gui-Song ;
Bai, Xiang ;
Yang, Wen ;
Yang, Michael Ying ;
Belongie, Serge ;
Luo, Jiebo ;
Datcu, Mihai ;
Pelillo, Marcello ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7778-7796
[7]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[8]  
Ghiasi G, 2018, ADV NEUR IN, V31
[9]   Align Deep Features for Oriented Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Li, Jie ;
Xia, Gui-Song .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]  
Han Jiaming, 2021, ARXIV210307733