A New Deep Learning Network for Automatic Bridge Detection from SAR Images Based on Balanced and Attention Mechanism

被引:34
作者
Chen, Lifu [1 ,2 ]
Weng, Ting [1 ,2 ]
Xing, Jin [3 ]
Pan, Zhouhao [4 ]
Yuan, Zhihui [1 ,2 ]
Xing, Xuemin [2 ,5 ]
Zhang, Peng [1 ,2 ]
机构
[1] Changsha Univ Sci & Technol, Sch Elect & Informat Engn, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Lab Radar Remote Sensing Applicat, Changsha 410014, Peoples R China
[3] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[4] China Acad Elect & Informat Technol, Beijing 100041, Peoples R China
[5] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; bridge detection; attention mechanism; feature pyramid; SAR image; automatic detection;
D O I
10.3390/rs12030441
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Bridge detection from Synthetic Aperture Radar (SAR) images has very important strategic significance and practical value, but there are still many challenges in end-to-end bridge detection. In this paper, a new deep learning-based network is proposed to identify bridges from SAR images, namely, multi-resolution attention and balance network (MABN). It mainly includes three parts, the attention and balanced feature pyramid (ABFP) network, the region proposal network (RPN), and the classification and regression. First, the ABFP network extracts various features from SAR images, which integrates the ResNeXt backbone network, balanced feature pyramid, and the attention mechanism. Second, extracted features are used by RPN to generate candidate boxes of different resolutions and fused. Furthermore, the candidate boxes are combined with the features extracted by the ABFP network through the region of interest (ROI) pooling strategy. Finally, the detection results of the bridges are produced by the classification and regression module. In addition, intersection over union (IOU) balanced sampling and balanced L1 loss functions are introduced for optimal training of the classification and regression network. In the experiment, TerraSAR data with 3-m resolution and Gaofen-3 data with 1-m resolution are used, and the results are compared with faster R-CNN and SSD. The proposed network has achieved the highest detection precision (P) and average precision (AP) among the three networks, as 0.877 and 0.896, respectively, with the recall rate (RR) as 0.917. Compared with the other two networks, the false alarm targets and missed targets of the proposed network in this paper are greatly reduced, so the precision is greatly improved.
引用
收藏
页数:19
相关论文
共 35 条
[1]  
[Anonymous], 2018, IEEE C COMPUTER VISI
[2]  
[Anonymous], C COMP VIS PATT REC
[3]  
[Anonymous], 2022, IEEE I CONF COMP VIS
[4]  
[Anonymous], IEEE T PATTERN ANAL
[5]  
[Anonymous], 2011, 2011 3 INT AS PAC C
[6]  
[Anonymous], J FINANC STAB
[7]  
Bai ZY, 2005, INT C COMMUN CIRCUIT, P847
[8]  
Chang Yonglei, 2017, Geomatics and Information Science of Wuhan University, V42, P762, DOI 10.13203/j.whugis20140828
[9]  
Chen Xin, 2008, Computer Engineering, V34, P195
[10]  
Guiting Wang, 2009, Proceedings of the 2009 2nd Asian-Pacific Conference on Synthetic Aperture Radar (APSAR 2009), P498, DOI 10.1109/APSAR.2009.5374121