Recognition of penetration state in GTAW based on vision transformer using weld pool image

被引:26
作者
Wang, Zhenmin [1 ,2 ]
Chen, Haoyu [1 ]
Zhong, Qiming [1 ]
Lin, Sanbao [2 ]
Wu, Jianwen [1 ]
Xu, Mengjia [1 ]
Zhang, Qin [3 ]
机构
[1] South China Univ Technol, Sch Mech & Automot Engn, Guangzhou 510641, Peoples R China
[2] Harbin Inst Technol, State Key Lab Adv Welding & Joining, Harbin 150001, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
基金
中国国家自然科学基金;
关键词
Gas tungsten arc welding; Penetration recognition; Deep learning; Vision transformer; Self-attention; DEFECTS DETECTION; NEURAL-NETWORK; AL-ALLOY; PREDICTION; SURFACE; BEHAVIOR; CAMERA; DEPTH; WIDTH;
D O I
10.1007/s00170-021-08538-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic welding penetration recognition is a significant and challenging research direction to improve the manufacturing quality and intelligence level of gas tungsten arc welding (GTAW). In this paper, a data-driven deep learning approach based on vision transformer (ViT) is proposed to implement welding penetration recognition. To meet the requirement of the model training on data, a large weld pool image dataset composed of four penetration categories is constructed. Twelve ViT models with different architectures are constructed and trained from scratch to explore their feasibility for penetration recognition. Then, pretrained on ImageNet, a state-of-art ViT-B/16 model, is employed, which alleviates the issue of modeling complexity and data insufficiency and improves the validation accuracy by 4.45%. The transfer learning results demonstrate a 98.11% as the test accuracy for the ViT model. Compared with the other conventional recognition methods, the proposed model has fewer misclassifications and superior generalization performance due to self-attention. Furthermore, the attention rollout method is applied to visualize the image region of interest (ROI), confirming the reliability and effectiveness of the developed approach. This study expands the application scope of the ViT to penetration recognition and achieves remarkable effects, laying a reliable foundation for automated welding quality optimization in GTAW.
引用
收藏
页码:5439 / 5452
页数:14
相关论文
共 46 条
[1]  
Abnar S, 2020, ARXIV200500928
[2]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[3]   Automated defect classification of SS304 TIG welding process using visible spectrum camera and machine learning [J].
Bacioiu, Daniel ;
Melton, Geoff ;
Papaelias, Mayorkinos ;
Shaw, Rob .
NDT & E INTERNATIONAL, 2019, 107
[4]   Automated defect classification of Aluminium 5083 TIG welding using HDR camera and neural networks [J].
Bacioiv, Daniel ;
Melton, Geoff ;
Papaelias, Mayorkinos ;
Shaw, Rob .
JOURNAL OF MANUFACTURING PROCESSES, 2019, 45 (603-613) :603-613
[5]   Predicting the depth of penetration and weld bead width from the infra red thermal image of the weld pool using artificial neural network modeling [J].
Chokkalingham, S. ;
Chandrasekhar, N. ;
Vasudevan, M. .
JOURNAL OF INTELLIGENT MANUFACTURING, 2012, 23 (05) :1995-2001
[6]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[7]  
Dosovitskiy A., 2020, An image is worth 16 x16 words: transformers for image recognition at scale
[8]   Bringing Transparency Design into Practice [J].
Eiband, Malin ;
Schneider, Hanna ;
Bilandzic, Mark ;
Fazekas-Con, Julian ;
Haug, Mareike ;
Hussmann, Heinrich .
IUI 2018: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2018, :211-223
[9]   DeepWelding: A Deep Learning Enhanced Approach to GTAW Using Multisource Sensing Images [J].
Feng, Yunhe ;
Chen, Zongyao ;
Wang, Dali ;
Chen, Jian ;
Feng, Zhili .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) :465-474
[10]   Fuzzy C-means with Spatiotemporal Constraints [J].
Gao, Xu ;
Yu, Fusheng .
2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, :337-340