Robust Semantic Segmentation for Automatic Crack Detection Within Pavement Images Using Multi-Mixing of Global Context and Local Image Features

被引:5
|
作者
Zhang, Hang [1 ]
Zhang, Allen A. [1 ]
Dong, Zishuo [1 ]
He, Anzheng [1 ]
Liu, Yang [2 ]
Zhan, You [1 ]
Wang, Kelvin C. P. [2 ]
机构
[1] Southwest Jiaotong Univ, Sch Civil Engn, Chengdu 610031, Sichuan, Peoples R China
[2] Oklahoma State Univ, Sch Civil & Environm Engn, Stillwater, OK 74078 USA
基金
中国国家自然科学基金;
关键词
Pavement cracks; deep learning; transformer; convolutional neural network; graph network; ARTIFICIAL NEURAL-NETWORK; ATTENTION;
D O I
10.1109/TITS.2024.3360263
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate identification of cracks at the pixel level on intricate asphalt pavements represents a crucial challenge in the domain of intelligent pavement assessment. The current advanced deep-learning networks encounter limitations in simultaneously capturing both the global context and local features of cracks, leading to discontinuous segmentation results and suboptimal recovery of local details. This paper proposes a robust architecture named Mix-Graph CrackNet to present an efficacious solution for this challenge. The Mix-Graph CrackNet, as proposed, is designed to mix the global context and local features multiple times, allowing for a comprehensively understanding of the essential features. Specifically, this paper develops the learnable parallel convolutional-Transformer mixing module to parallelly capture the sophisticated local features as well as the crucial global context. In addition, a new fusion unit is devised in the paper and deployed in the learnable parallel convolutional-Transformer mixing module. The proposed fusion unit is capable of effectively mixing contextual features extracted at both global and local scales while retaining an abundant level of textural details germane to the crack. Moreover, this paper constructs a graph-based skip connection that functions as a shortcut connecting the encoder and decoder, with the primary objective of mitigating information decay. The experimental results are remarkable, with the Mix-Graph CrackNet achieving F-measure and Intersection-Over-Union of 90.37% and 82.43%, respectively, on 1000 testing images. Based on the performance evaluations conducted on both public and private datasets, the proposed Mix-Graph CrackNet architecture demonstrates a significantly superior detection accuracy in comparison to several state-of-the-art models for semantic segmentation.
引用
收藏
页码:11282 / 11303
页数:22
相关论文
共 3 条
  • [1] Automatic Pavement Crack Detection in Multisource Fusion Images Using Similarity and Difference Features
    Liang, Haorong
    Qiu, Dongwei
    Ding, Ke-Liang
    Zhang, Yunlong
    Wang, Yuzheng
    Wang, Xingyu
    Liu, Tiancheng
    Wan, Shanshan
    IEEE SENSORS JOURNAL, 2024, 24 (05) : 5449 - 5465
  • [2] ROBUST IMAGE-BASED CRACK DETECTION IN CONCRETE STRUCTURE USING MULTI-SCALE ENHANCEMENT AND VISUAL FEATURES
    Liu, Xiangzeng
    Ai, Yunfeng
    Scherer, Sebastian
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2304 - 2308
  • [3] A robust and accurate approach to automatic Blood Vessel detection and segmentation from Angiography X-ray images using multi-stage Random Forests
    Gupta, Vipin
    Kale, Amit
    Sundar, Hari
    MEDICAL IMAGING 2012: COMPUTER-AIDED DIAGNOSIS, 2012, 8315