Robust Sewer Defect Detection With Text Analysis Based on Deep Learning

被引：28

作者：

Oh, Chanmi ^{[1
]}

Dang, L. Minh ^{[2
]}

Han, Dongil ^{[1
]}

Moon, Hyeonjoon ^{[1
]}

机构：

[1] Sejong Univ, Dept Comp Sci & Engn, Seoul 05006, South Korea

[2] FPT Univ, Dept Informat Technol, Ho Chi Minh City 70000, Vietnam

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

新加坡国家研究基金会;

关键词：

Feature extraction; Pipelines; Videos; Inspection; Text recognition; Data mining; Manuals; Deep learning; text recognition; attention mechanism; defect detection; sewer; YOLO; CLASSIFICATION;

D O I：

10.1109/ACCESS.2022.3168660

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sewerage systems play a vital role in building modern cities, providing appropriate ways to release liquid wastes. Due to the rapid expansion of cities, the deterioration of sewage pipes are increasing. Hence, systematic maintenance methods are require to overcome this problem. In most cases, sewer inspection is done by human inspectors, which is error-prone, time-consuming, costly, and lacking appropriate survey evaluations. In this paper, we introduce a new automated framework for detecting sewage pipe defects based on the attention mechanism, improved YOLOv5 architecture, and location information recognition from CCTV videos. The main contributions include (1) the addition of a micro-scale detection feature in the layers to improve the defect detection mechanism; (2) the application of a convolutional block attention module for better channel/spatial features; (3) construction of a larger defect-detection dataset for the 12 most common defect types; and (4) implementation of the TPS-ResNet-BiLSTM-Attn (TRBA) model for the text-information recognition mechanism from CCTV videos. The experimental results show that the proposed real-time sewer defect detection model achieved the mean average precision (mAP) of 75.9% on the proposed dataset, outperforming other standard models, such as YOLO and SSD.

引用

页码：46224 / 46237

页数：14

共 46 条

[1]

[Anonymous], 2021, YOLOV5 ULTRALYTICS O

[2]

[Anonymous], 2006, P 23 INT C MACH LEAR, DOI 10.1145/1143844.1143891

[3]

Atienza R., 2021, ARXIV210508582

[4] What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis [J].

Baek, Jeonghun ;

Kim, Geewook ;

Lee, Junyeop ;

Park, Sungrae ;

Han, Dongyoon ;

Yun, Sangdoo ;

Oh, Seong Joon ;

Lee, Hwalsuk .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4714-4722

[5]

Bochkovskiy A., 2020, PREPRINT

[6]

Chen Y.-C., 2021, ARXIV211113327

[7] Automated detection of sewer pipe defects in closed-circuit television images using deep learning techniques [J].

Cheng, Jack C. P. ;

Wang, Mingzhu .

AUTOMATION IN CONSTRUCTION, 2018, 95 :155-171

[8] Focusing Attention: Towards Accurate Text Recognition in Natural Images [J].

Cheng, Zhanzhan ;

Bai, Fan ;

Xu, Yunlu ;

Zheng, Gang ;

Pu, Shiliang ;

Zhou, Shuigeng .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5086-5094

[9] Underground asset location and condition assessment technologies [J].

Costello, S. B. ;

Chapman, D. N. ;

Rogers, C. D. F. ;

Metje, N. .

TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2007, 22 (5-6) :524-542

[10]

Dang L. M., 2022, TUNNELLING UNDERGROU, V124

← 1 2 3 4 5 →