Arbitrary-oriented traffic participant detection and axis prediction for complex crossing road in aerial view

被引:0
作者
Li, Shuang [1 ,3 ]
Liu, Chunsheng [2 ,4 ]
Chen, Luchang [2 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Informat & Automat Engn, Jinan, Peoples R China
[2] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[3] Qilu Univ Technol, Shandong Acad Sci, Sch Informat & Automat Engn, Jinan 250353, Peoples R China
[4] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
关键词
computer vision; image processing; intelligent transportation systems; object detection; object-oriented programming; neural net architecture; OBJECT DETECTION;
D O I
10.1049/itr2.12421
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Different from object detection in natural images, aerial-view object detection faces special challenges with large changes in object orientation and wide multi-scale distribution. Many methods based on Oriented Bounding Boxes (OBB) can reach accurate results, yet may face the problem of parameter mutation and axis prediction deviation. To address these problems, a two-stage detection framework is proposed for arbitrary-oriented traffic participant detection and axis prediction, named Axis Prediction Network. First, a Deformable Convolution Fusion Module (DCF-Module) is proposed to enhance the ability of FPN to extract multi-scale semantic features, for dealing with the multi-scale change of objects. Then, the axis heat-map prediction head network is proposed to fit the long axis of oriented objects labeled with Gaussian model, which uses pixel-by-pixel prediction heatmap to calculate the long axis of the object, avoiding the angle mutation of an OBB. Last, the long and short side prediction head network is proposed to predict the shape of an oriented object, avoiding the mutation of the width and height of an OBB. The experiments are conducted on the new built Crossroad dataset and the public DOTA dataset. Experiment results show that the proposed method achieve good performance in arbitrary-oriented traffic participant detection and axis prediction in aerial view.
引用
收藏
页码:2419 / 2431
页数:13
相关论文
共 56 条
[1]  
ai J., 2016, Advances in neural information processing systems, P379
[2]  
[Anonymous], 2019, IEEE T INTELL TRANSP, V20
[3]  
[Anonymous], 2022, IEEE T INTELL TRANSP, V23, P5615
[4]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[5]  
Cao D., 2022, REMOTE SENS-BASEL, V14, P1
[6]   Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery [J].
Chen, Luchang ;
Liu, Chunsheng ;
Chang, Faliang ;
Li, Shuang ;
Nie, Zhaoying .
NEUROCOMPUTING, 2021, 451 :67-80
[7]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[8]   Point-Based Estimator for Arbitrary-Oriented Object Detection in Aerial Images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4370-4387
[9]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[10]   Oriented Ship Detector for Remote Sensing Imagery Based on Pairwise Branch Detection Head and SAR Feature Enhancement [J].
He, Bokun ;
Zhang, Qingyi ;
Tong, Ming ;
He, Chu .
REMOTE SENSING, 2022, 14 (09)