A Multi-level spatial feature fusion-based transformer for intelligent defect recognition with small samples toward smart manufacturing system

被引:3
|
作者
Gao, Yiping [1 ]
Li, Xinyu [1 ]
Gao, Liang [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Luoyu Rd, Wuhan 1037, Peoples R China
基金
中国国家自然科学基金;
关键词
Small-sample; intelligent defect recognition; multi-level feature fusion; focal-loss transformer; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1080/0951192X.2023.2229270
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Smart is a development trend in manufacturing systems, and intelligent defect recognition is essential in smart manufacturing systems for both quality control and decision-making. But the recognition performance of the current methods still needs to be improved, as well as the interpretability. As a hotspot, Transformer (ViT) has outstanding performance and interpretability on image recognition, which has shown the potential for intelligent defect recognition. However, ViT requires large numbers of samples, while small-sample is common in real-world cases, which contain less information, and this will cause ViT overfitting and misclassifying. Thus, it impedes the application of ViT greatly. To address this problem, a multi-scale spatial feature fusion-based ViT is proposed for small-sample defect recognition. The proposed method simulates human vision to extract the multi-level features of defects, and three improved ViTs are built to fuse the features. The experimental results indicate that the proposed method achieves improved performance on small-sample defect recognition. Compared with the DL and defect recognition methods, the accuracies are improved by 1.5%similar to 20.07% on wood defects, and achieve an accuracy of 100% on steel defects. Furthermore, the visualization results also show that the proposed method is explicable, and it is helpful for defect analysis.
引用
收藏
页码:4 / 17
页数:14
相关论文
共 38 条
  • [1] A Hierarchical Feature Fusion-based Method for Defect Recognition with a Small Sample
    Gao, Yiping
    Gao, Liang
    Li, Xinyu
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 1048 - 1052
  • [2] Patchlpr: a multi-level feature fusion transformer network for LiDAR-based place recognition
    Sun, Yang
    Guo, Jianhua
    Wang, Haiyang
    Zhang, Yuhang
    Zheng, Jiushuai
    Tian, Bin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 157 - 165
  • [3] Human Action Recognition Based On Multi-level Feature Fusion
    Xu, Y. Y.
    Xiao, G. Q.
    Tang, X. Q.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 353 - 355
  • [4] Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting
    Fang Y.
    Gao S.
    Li J.
    Luo W.
    He L.
    Hu B.
    Neurocomputing, 2020, 392 : 98 - 107
  • [5] A Multi-Biometric System Based on Multi-Level Hybrid Feature Fusion
    Mehraj, Haider
    Mir, Ajaz Hussain
    HERALD OF THE RUSSIAN ACADEMY OF SCIENCES, 2021, 91 (02) : 176 - 196
  • [6] Attention-based Multi-level Feature Fusion for Named Entity Recognition
    Yang, Zhiwei
    Chen, Hechang
    Zhang, Jiawei
    Ma, Jing
    Chang, Yi
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3594 - 3600
  • [7] Action Recognition Method Based on Multi-Level Feature Fusion and Temporal Extension
    Wu, Haoyuan
    Xiong, Xin
    Min, Weidong
    Zhao, Haoyu
    Wang, Wenxiang
    Computer Engineering and Applications, 2023, 59 (07) : 134 - 142
  • [8] A Multi-Biometric System Based on Multi-Level Hybrid Feature Fusion
    Haider Mehraj
    Ajaz Hussain Mir
    Herald of the Russian Academy of Sciences, 2021, 91 : 176 - 196
  • [9] Small-Scale Pedestrian Detection Based on Multi-level Feature Fusion
    Yan, Chaoqi
    Zhang, Hong
    Li, Xuliang
    Yang, Yifan
    Chen, Hao
    Yuan, Ding
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [10] Road Recognition Based on Multi-scale Convolutional Network with Multi-level Feature Fusion
    Li, Ye
    Guo, Lili
    Xu, Lele
    Wang, Xianfeng
    Jin, Shan
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069