A comprehensive end-to-end computer vision framework for restoration and recognition of low-quality engineering drawings

被引:0
|
作者
Yang, Lvyang [1 ]
Zhang, Jiankang [2 ]
Li, Huaiqiang [2 ]
Ren, Longfei [2 ]
Yang, Chen [1 ]
Wang, Jingyu [1 ]
Shi, Dongyuan [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Adv Electromagnet Technol, Wuhan 430074, Hubei, Peoples R China
[2] Northwest Branch State Grid Corp China, Xian 710048, Shaanxi, Peoples R China
关键词
Collaborative learning; Computer vision; Deep learning; Engineering drawing; Graphical symbol recognition; Image restoration; CLASSIFICATION; DIGITIZATION; NETWORK;
D O I
10.1016/j.engappai.2024.108524
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The digitization of engineering drawings is crucial for efficient reuse, distribution, and archiving. Existing computer vision approaches for digitizing engineering drawings typically assume the input drawings have high quality. However, in reality, engineering drawings are often blurred and distorted due to improper scanning, storage, and transmission, which may jeopardize the effectiveness of existing approaches. This paper focuses on restoring and recognizing low-quality engineering drawings, where an end-to-end framework is proposed to improve the quality of the drawings and identify the graphical symbols on them. The framework uses K-means clustering to classify different engineering drawing patches into simple and complex texture patches based on their gray level co-occurrence matrix statistics. Computer vision operations and a modified Enhanced Super- Resolution Generative Adversarial Network (ESRGAN) model are then used to improve the quality of the two types of patches, respectively. A modified Faster Region-based Convolutional Neural Network (Faster R-CNN) model is used to recognize the quality-enhanced graphical symbols. Additionally, a multi-stage task-driven collaborative learning strategy is proposed to train the modified ESRGAN and Faster R-CNN models to improve the resolution of engineering drawings in the direction that facilitates graphical symbol recognition, rather than human visual perception. A synthetic data generation method is also proposed to construct quality-degraded samples for training the framework. Experiments on real-world electrical diagrams show that the proposed framework achieves an accuracy of 98.98% and a recall of 99.33%, demonstrating its superiority over previous approaches. Moreover, the framework is integrated into a widely-used power system software application to showcase its practicality. The reference codes and data can be found at https://github.com/Lattle-y/AIrecognition-for-lq-ed.git Future work will focus on improving the generalizability of the proposed framework to different quality degradation scenarios and extrapolating the application to different engineering domains.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] End-to-End Computer Vision Framework
    Orhei, Ciprian
    Mocofan, Muguras
    Vert, Silviu
    Vasiu, Radu
    2020 14TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2020, : 63 - 66
  • [2] An End-to-end Computer Vision System Architecture
    Zhang, Ling
    Zhou, Wei
    Zhang, Xiangyu
    Lou, Xin
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2338 - 2342
  • [3] End-To-End Computer Vision Framework: An Open-Source Platform for Research and Education
    Orhei, Ciprian
    Vert, Silviu
    Mocofan, Muguras
    Vasiu, Radu
    SENSORS, 2021, 21 (11)
  • [4] An end-to-end computer vision methodology for quantitative metallography
    Rusanovsky, Matan
    Beeri, Ofer
    Oren, Gal
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] An end-to-end computer vision methodology for quantitative metallography
    Matan Rusanovsky
    Ofer Beeri
    Gal Oren
    Scientific Reports, 12
  • [6] An End-to-End Face Recognition System Evaluation Framework
    West Virginia University
  • [7] cosGCTFormer: An end-to-end driver state recognition framework
    Huang, Jing
    Liu, Tingnan
    Hu, Lin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 261
  • [8] An end-to-end generative framework for video segmentation and recognition
    Kuehne, Hilde
    Gall, Juergen
    Serre, Thomas
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [9] Computer vision for transit travel time prediction: an end-to-end framework using roadside urban imagery
    Abdelhalim, Awad
    Zhao, Jinhua
    PUBLIC TRANSPORT, 2025, 17 (01) : 221 - 246
  • [10] End-to-end Quality of Service Framework for Heterogeneous Networks
    Baldi, Mario
    Giacomelli, Riccardo
    2009 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT - WORKSHOPS, 2009, : 245 - 248