A comprehensive end-to-end computer vision framework for restoration and recognition of low-quality engineering drawings

被引:0
|
作者
Yang, Lvyang [1 ]
Zhang, Jiankang [2 ]
Li, Huaiqiang [2 ]
Ren, Longfei [2 ]
Yang, Chen [1 ]
Wang, Jingyu [1 ]
Shi, Dongyuan [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Adv Electromagnet Technol, Wuhan 430074, Hubei, Peoples R China
[2] Northwest Branch State Grid Corp China, Xian 710048, Shaanxi, Peoples R China
关键词
Collaborative learning; Computer vision; Deep learning; Engineering drawing; Graphical symbol recognition; Image restoration; CLASSIFICATION; DIGITIZATION; NETWORK;
D O I
10.1016/j.engappai.2024.108524
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The digitization of engineering drawings is crucial for efficient reuse, distribution, and archiving. Existing computer vision approaches for digitizing engineering drawings typically assume the input drawings have high quality. However, in reality, engineering drawings are often blurred and distorted due to improper scanning, storage, and transmission, which may jeopardize the effectiveness of existing approaches. This paper focuses on restoring and recognizing low-quality engineering drawings, where an end-to-end framework is proposed to improve the quality of the drawings and identify the graphical symbols on them. The framework uses K-means clustering to classify different engineering drawing patches into simple and complex texture patches based on their gray level co-occurrence matrix statistics. Computer vision operations and a modified Enhanced Super- Resolution Generative Adversarial Network (ESRGAN) model are then used to improve the quality of the two types of patches, respectively. A modified Faster Region-based Convolutional Neural Network (Faster R-CNN) model is used to recognize the quality-enhanced graphical symbols. Additionally, a multi-stage task-driven collaborative learning strategy is proposed to train the modified ESRGAN and Faster R-CNN models to improve the resolution of engineering drawings in the direction that facilitates graphical symbol recognition, rather than human visual perception. A synthetic data generation method is also proposed to construct quality-degraded samples for training the framework. Experiments on real-world electrical diagrams show that the proposed framework achieves an accuracy of 98.98% and a recall of 99.33%, demonstrating its superiority over previous approaches. Moreover, the framework is integrated into a widely-used power system software application to showcase its practicality. The reference codes and data can be found at https://github.com/Lattle-y/AIrecognition-for-lq-ed.git Future work will focus on improving the generalizability of the proposed framework to different quality degradation scenarios and extrapolating the application to different engineering domains.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Low Latency End-to-End Streaming Speech Recognition with a Scout Network
    Wang, Chengyi
    Wu, Yu
    Lu, Liang
    Liu, Shujie
    Li, Jinyu
    Ye, Guoli
    Zhou, Ming
    INTERSPEECH 2020, 2020, : 2112 - 2116
  • [32] Virtual Experience Toolkit: An End-to-End Automated 3D Scene Virtualization Framework Implementing Computer Vision Techniques
    Mora, Pau
    Garcia, Clara
    Ivorra, Eugenio
    Ortega, Mario
    Alcaniz, Mariano L.
    SENSORS, 2024, 24 (12)
  • [33] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
    Zhao, Xun
    Huang, Xinjian
    Cheng, Jianheng
    Xia, Zhendong
    Tu, Zhiheng
    DRONES, 2024, 8 (11)
  • [34] Two approaches to Internet traffic engineering for end-to-end quality of service provisioning
    Ho, KH
    Howarth, M
    Wang, N
    Pavlou, G
    Georgoulas, S
    2005 NEXT GENERATION INTERNET NETWORKS, 2005, : 135 - 142
  • [35] Data Collection Framework for End-to-End Radio and Transport Network Quality Monitoring
    Dobreff, Gergely
    Szalay, Mark
    Ladoczki, Bence
    Molnar, Marton
    Varga, Laszlo
    Bader, Attila
    Pasic, Alija
    2023 15TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2023, : 127 - 130
  • [36] Acceleration Framework and Solution Algorithm for Distribution System Restoration Based on End-to-End Optimization Strategy
    Wang, Yifei
    Yan, Ziheng
    Sang, Linwei
    Hong, Lucheng
    Hu, Qinran
    Shahidehpour, Mohammad
    Xu, Qingshan
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (01) : 429 - 441
  • [37] BetaBuddy: An automated end-to-end computer vision pipeline for analysis of calcium fluorescence dynamics in β-cells
    Alsup, Anne M.
    Fowlds, Kelli
    Cho, Michael
    Luber, Jacob M.
    PLOS ONE, 2024, 19 (03):
  • [38] An End-to-End Human Abnormal Behavior Recognition Framework for Crowds With Mentally Disordered Individuals
    Hao, Yixue
    Tang, Zaiyang
    Alzahrani, Bander
    Alotaibi, Reem
    Alharthi, Reem
    Zhao, Miaomiao
    Mahmood, Arif
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (08) : 3618 - 3625
  • [39] An end-to-end computer vision system based on deep learning for pavement distress detection and quantification
    Cano-Ortiz, Sail
    Iglesias, Lara Lloret
    del Arbol, Pablo Martinez Ruiz
    Lastra-Gonzalez, Pedro
    Castro-Fresno, Daniel
    CONSTRUCTION AND BUILDING MATERIALS, 2024, 416
  • [40] Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
    Fan, Cunhang
    Yi, Jiangyan
    Tao, Jianhua
    Tian, Zhengkun
    Liu, Bin
    Wen, Zhengqi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 198 - 209