Human-Machine Hybrid Strategy for Defect Semantic Segmentation With Limited Data

被引:2
|
作者
Shan, Dexing [1 ]
Zhang, Yunzhou [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Defect segmentation; few-shot learning; human-machine hybrid; multimodal data; neural network application; INSPECTION;
D O I
10.1109/TIM.2023.3341105
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of industrial intelligent inspection, two problems need to be solved urgently. One is how to integrate complex industrial inspection expertise into algorithms. The second is how to predict other similar products that have not been seen through prior knowledge of existing products. Solving these two problems can effectively solve the problems of insufficient samples and poor adaptability. Therefore, we propose a human-machine hybrid strategy to construct a promptable industrial scene segmentation task by human input of easy-to-obtain defect-free images or text as guidance information. Specifically, for unseen categories in unseen scenes, we propose to use a defect-free image as the guidance information, establish accurate pixel-level correspondence between defect-free and query images, and decode correlation maps through the 3-D residual convolutional network. For unseen categories in seen scenes, we propose a text-image model based on contrastive language-image pretraining (CLIP) to extract visual features aligned with the text. To establish more accurate text guidance, we obtain the holistic pattern of text features through global self-attention and establish global guidance between text and visual features. Extensive experiments demonstrate that our proposed approach, requiring no additional training or fine-tuning and solely relying on defect-free images and text as guidance, effectively segments previously unseen defect categories in both seen and unseen scenes. This method efficiently addresses the challenge of limited defect samples in the industrial domain.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs
    Mottaghi, Roozbeh
    Fidler, Sanja
    Yao, Jian
    Urtasun, Raquel
    Parikh, Devi
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3143 - 3150
  • [2] Semantic inference in the human-machine communication
    Hadacz, L
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 353 - 356
  • [3] Multi-modal background-aware for defect semantic segmentation with limited data
    Shan, Dexing
    Zhang, Yunzhou
    Liu, Shitong
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024,
  • [4] Human-machine knowledge hybrid augmentation method for surface defect detection based few-data learning
    Gong, Yu
    Wang, Xiaoqiao
    Zhou, Chichun
    Ge, Maogen
    Liu, Conghu
    Zhang, Xi
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (03) : 1723 - 1742
  • [5] HUMAN-MACHINE COLLABORATION FOR MEDICAL IMAGE SEGMENTATION
    Ravanbakhsh, Mahdyar
    Tschernezki, Vadim
    Last, Felix
    Klein, Tassilo
    Batmanghelich, Kayhan
    Tresp, Volker
    Nabi, Moin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1040 - 1044
  • [6] Semantic Segmentation from Limited Training Data
    Milan, A.
    Pham, T.
    Vijay, K.
    Morrison, D.
    Tow, A. W.
    Liu, L.
    Erskine, J.
    Grinover, R.
    Gurman, A.
    Hunn, T.
    Kelly-Boxall, N.
    Lee, D.
    McTaggart, M.
    Rallos, G.
    Razjigaev, A.
    Rowntree, T.
    Shen, T.
    Smith, R.
    Wade-McCue, S.
    Zhuang, Z.
    Lehnert, C.
    Lin, G.
    Reid, I.
    Corke, P.
    Leitner, J.
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1908 - 1915
  • [7] A HUMAN ELEMENT IN A HUMAN-MACHINE HYBRID OF ARTIFICIAL INTELLIGENCE
    Chirva, Daria
    LOGOS, 2024, 34 (06): : 203 - 216
  • [8] Conceptualizing hybrid human-machine systems and interaction
    Buxbaum-Conradi, Sonja
    Redlich, Tobias
    Branding, Jan-Hauke
    PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 551 - 559
  • [9] Human-Machine Hybrid Peer Grading in SPOCs
    Han, Yong
    Wu, Wenjun
    Yan, Yitao
    Zhang, Lijun
    IEEE ACCESS, 2020, 8 : 220922 - 220934
  • [10] A Language Model for Human-Machine Dialog: The Reversible Semantic Grammar
    Lehuen, Jerome
    Lemeunier, Thierry
    NLPCS 2010: NATURAL LANGUAGE PROCESSING AND COGNITIVE SCIENCE, 2010, : 17 - 26