DSNet: A dynamic squeeze network for real-time weld seam image segmentation

被引：11

作者：

Chen, Jia ^{[1
,2
]}

Wang, Congcong ^{[1
,2
]}

Shi, Fan ^{[1
,2
]}

Kaaniche, Mounir ^{[3
,4
]}

Zhao, Meng ^{[1
,2
]}

Jing, Yan ^{[5
]}

Chen, Shengyong ^{[1
,2
]}

机构：

[1] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China

[2] Tianjin Univ Technol, Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China

[3] Univ Sorbonne Paris Nord, L2TI, UR 3043, F-93430 Villetaneuse, France

[4] Univ Paris Saclay, Cent Supelec, CVN, INRIA, F-91190 Gif Sur Yvette, France

[5] UMA Intelligent Grp, Zhenjiang, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 133卷

基金：

中国国家自然科学基金;

关键词：

Robot welding; Structured light vision; Seam tracking; Image segmentation; Lightweight network; CONVOLUTION OPERATOR; SYSTEM;

D O I：

10.1016/j.engappai.2024.108278

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The image noise generated by the welding process, such as arc light, splash, and smoke, brings significant challenges for the laser vision sensor -based welding robot to locate the weld seam and accurately conduct automatic welding. Currently, deep learning -based approaches surpass traditional methods in flexibility and robustness. However, their significant computational cost leads to a mismatch with the real-time requirement of automated welding. In this paper, we propose an efficient hybrid architecture of Convolutional Neural Network (CNN) and transformer, referred to as Dynamic Squeeze Network (DSNet), for real-time weld seam segmentation. More precisely, a lightweight segmentation framework is developed to fully leverage the advantages of the transformer structure without significantly increasing computational overhead. In this respect, an efficient encoder, which aims to increase its features diversity, has been designed and resulted in substantial improvement of encoding performance. Moreover, we propose a plug -and -play lightweight attention module that generates more effective attention weights by exploiting statistical information of weld seam data and introducing linear priors. Extensive experiments on weld seam images using NVIDIA GTX 1050Ti show that our approach reduces the number of parameters by 54x, decreases computational complexity by 34x, and improves inference speed by 33x compared to the baseline method TransUNet. DSNet achieves superior accuracy (78.01% IoU, 87.64% Dice) and speed performance (100 FPS) with lower model complexity and computational burden than most state-of-the-art methods. The code is available at https://github.com/hackerschen/DSNet.

引用

页数：12

共 69 条

[1] Light-Weight Multi-channel Aggregation Network for Image Super-Resolution [J].

Bian, Pengcheng ;

Zheng, Zhonglong ;

Zhang, Dawei .

PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 :287-297

[2]

Bo D., 2023, P AAAI C ARTIFICIAL

[3]

Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9

[4]

Chen J., 2021, arXiv

[5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[6]

Dosovitskiy A., 2021, INT C LEARNING REPRE

[7] Seam Feature Point Acquisition Based on Efficient Convolution Operator and Particle Filter in GMAW [J].

Fan, Junfeng ;

Deng, Sai ;

Ma, Yunkai ;

Zhou, Chao ;

Jing, Fengshui ;

Tan, Min .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) :1220-1230

[8] A Precise Initial Weld Point Guiding Method of Micro-Gap Weld Based on Structured Light Vision Sensor [J].

Fan, Junfeng ;

Jing, Fengshui ;

Yang, Lei ;

Long, Teng ;

Tan, Min .

IEEE SENSORS JOURNAL, 2019, 19 (01) :322-331

[9] Rethinking BiSeNet For Real-time Semantic Segmentation [J].

Fan, Mingyuan ;

Lai, Shenqi ;

Huang, Junshi ;

Wei, Xiaoming ;

Chai, Zhenhua ;

Luo, Junfeng ;

Wei, Xiaolin .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9711-9720

[10] Autonomous seam acquisition and tracking system for multi-pass welding based on vision sensor [J].

Gu, W. P. ;

Xiong, Z. Y. ;

Wan, W. .

INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2013, 69 (1-4) :451-460

← 1 2 3 4 5 6 7 →