Adaptive cropping shallow attention network for defect detection of bridge girder steel using unmanned aerial vehicle images

被引:12
作者
Mu, Zonghan [1 ,2 ]
Qin, Yong [1 ]
Yu, Chongchong [3 ]
Wu, Yunpeng [4 ]
Wang, Zhipeng [1 ]
Yang, Huaizhi [5 ]
Huang, Yonghui [5 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100091, Peoples R China
[2] Beijing Jiaotong Univ, Sch Traff & Transportat, Beijing 100091, Peoples R China
[3] Beijing Technol & Business Univ, Sch Artificial Intelligence, Beijing 100048, Peoples R China
[4] Shijiazhuang Tiedao Univ, Sch Safety Engn & Emergency Management, Shijiazhuang 050043, Peoples R China
[5] Beijing Shanghai High Speed Railway Co Ltd, Beijing 100038, Peoples R China
来源
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A | 2023年 / 24卷 / 03期
基金
中国国家自然科学基金;
关键词
Railway; Bridge; Unmanned aerial vehicle (UAV) image; Small object detection; Defect detection;
D O I
10.1631/jzus.A2200175
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Bridges are an important part of railway infrastructure and need regular inspection and maintenance. Using unmanned aerial vehicle (UAV) technology to inspect railway infrastructure is an active research issue. However, due to the large size of UAV images, flight distance, and height changes, the object scale changes dramatically. At the same time, the elements of interest in railway bridges, such as bolts and corrosion, are small and dense objects, and the sample data set is seriously unbalanced, posing great challenges to the accurate detection of defects. In this paper, an adaptive cropping shallow attention network (ACSANet) is proposed, which includes an adaptive cropping strategy for large UAV images and a shallow attention network for small object detection in limited samples. To enhance the accuracy and generalization of the model, the shallow attention network model integrates a coordinate attention (CA) mechanism module and an alpha intersection over union (alpha-IOU) loss function, and then carries out defect detection on the bolts, steel surfaces, and railings of railway bridges. The test results show that the ACSANet model outperforms the YOLOv5s model using adaptive cropping strategy in terms of the total mAP (an evaluation index) and missing bolt mAP by 5% and 30%, respectively. Also, compared with the YOLOv5s model that adopts the common cropping strategy, the total mAP and missing bolt mAP are improved by 10% and 60%, respectively. Compared with the YOLOv5s model without any cropping strategy, the total mAP and missing bolt mAP are improved by 40% and 67%, respectively.
引用
收藏
页码:243 / 256
页数:14
相关论文
共 45 条
[21]   Feedback-driven loss function for small object detection [J].
Liu, Gen ;
Han, Jin ;
Rong, Wenzhong .
IMAGE AND VISION COMPUTING, 2021, 111
[22]  
Liu J., 2019, P 4 INT C ELECT INFO, P695, DOI DOI 10.1007/978-981-15-2914-6_66
[23]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[24]  
Long A, 2021, P 16 E AS PACC STRUC, P119, DOI [10.1007/978-981-15-8079-6_11, DOI 10.1007/978-981-15-8079-6_11]
[25]   Framework for automated UAS-based structural condition assessment of bridges [J].
Morgenthal, Guido ;
Hallermann, Norman ;
Kersten, Jens ;
Taraben, Jakob ;
Debus, Paul ;
Helmrich, Marcel ;
Rodehorst, Volker .
AUTOMATION IN CONSTRUCTION, 2019, 97 :77-95
[26]   Better to Follow, Follow to Be Better: Towards Precise Supervision of Feature Super-Resolution for Small Object Detection [J].
Noh, Junhyug ;
Bae, Wonho ;
Lee, Wonhee ;
Seo, Jinhwan ;
Kim, Gunhee .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9724-9733
[27]  
Rahman Md Atiqur, 2016, Advances in Visual Computing. 12th International Symposium, ISVC 2016. Proceedings: LNCS 10072, P234, DOI 10.1007/978-3-319-50835-1_22
[28]   Automated Vision-Based Loosened Bolt Detection Using the Cascade Detector [J].
Ramana, Lovedeep ;
Choi, Wooram ;
Cha, Young-Jin .
SENSORS AND INSTRUMENTATION, VOL 5, 2017, :23-28
[29]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[30]   Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression [J].
Rezatofighi, Hamid ;
Tsoi, Nathan ;
Gwak, JunYoung ;
Sadeghian, Amir ;
Reid, Ian ;
Savarese, Silvio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :658-666