Human-Guided Data Augmentation via Diffusion Model for Surface Defect Recognition Under Limited Data

被引:0
|
作者
Fang, Tiyu [1 ]
Zhang, Mingxin [1 ]
Song, Ran [1 ]
Li, Xiaolei [1 ]
Wei, Zhiyuan [1 ]
Zhang, Wei [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
基金
中国国家自然科学基金;
关键词
Diffusion models; Generative adversarial networks; Training; Data augmentation; Image segmentation; Training data; Reinforcement learning; Image synthesis; Data models; Tires; Diffusion model; reinforcement learning (RL) from human feedback; surface defect recognition (SDR); INSPECTION;
D O I
10.1109/TIM.2025.3541684
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Surface defect recognition (SDR) with limited data is a common challenge in industrial production. Recent methods generally utilize generative adversarial networks (GANs) to generate defect samples as training data for improving the performance of SDR. However, the instability of GAN training often results in uncontrollable and low-quality samples under severe data constraints, making it difficult for the existing methods to effectively handle SDR tasks with different granularities. To address the issue, this article proposes a human-guided data augmentation method under extremely limited data. Its core idea is to introduce human feedback into a diffusion model for synthesizing controllable and high-quality defect samples by reinforcement learning (RL), aiming to improve various-granularity SDR tasks such as defect classification and segmentation. First, a conditional diffusion model (CDM) is constructed to generate controllable defect samples using semantic labels, which learn defect distribution from a small number of annotated defect samples. Then, a reward model is designed to evaluate the outcome of the CDM by human feedback. Next, based on the trained reward model, the CDM is further optimized by proximal policy optimization (PPO). Finally, the refined CDM is used to generate high-quality defect samples as training data for enhancing defect classification and segmentation. Extensive experiments on NEU-Seg, magnetic-tile (MT), and the collected Tire datasets demonstrate that our method outperforms the state-of-the-art generative methods in terms of generated image quality. Furthermore, the performance of defect classification and segmentation has also shown significant enhancements based on the generated samples, with a maximum improvement of 16.90% in accuracy and 12.85% in mean intersection over union (mIoU) compared to results obtained without data augmentation.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Radar Target Recognition Algorithm Based on Data Augmentation and WACGAN with a Limited Training Data
    Zhuke-Fan
    Wang J.-G.
    Liu Y.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (06): : 1124 - 1131
  • [22] Data Augmentation via Latent Diffusion for Saliency Prediction
    Aydemir, Bahar
    Bhattacharjee, Deblina
    Zhang, Tong
    Salzmann, Mathieu
    Susstrunk, Sabine
    COMPUTER VISION - ECCV 2024, PT LXXVIII, 2025, 15136 : 360 - 377
  • [23] Data Augmentation of Surface Electromyography for Hand Gesture Recognition
    Tsinganos, Panagiotis
    Cornelis, Bruno
    Cornelis, Jan
    Jansen, Bart
    Skodras, Athanassios
    SENSORS, 2020, 20 (17) : 1 - 23
  • [24] Enhancing human action recognition with GAN-based data augmentation
    Pulakurthi, Prasanna Reddy
    de Melo, Celso M.
    Rao, Raghuveer
    Rabbani, Majid
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [25] Data augmentation for solving industrial recognition tasks with underrepresented defect classes
    Wunsch, Lennard
    Anding, Katharina
    Polte, Galina
    Liu, Kun
    Notni, Gunther
    ACTA IMEKO, 2023, 12 (04):
  • [26] Enhancing plant health classification via diffusion model-based data augmentation
    Lee, Younghoon
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [27] Label-Guided Data Augmentation for Chinese Named Entity Recognition
    Jiang, Miao
    Chen, Honghui
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [28] Phased Data Augmentation for Training a Likelihood-Based Generative Model with Limited Data
    Mimura, Yuta
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2025, 13 (01): : 126 - 135
  • [29] Industrial Process Soft Sensing Based on Bidirectional Optimization Learning of Data Augmentation and Prediction Models Under Limited Data
    Li, He
    Wang, Zhaojing
    Li, Li
    Yan, Xiaoyun
    Hu, Xinrong
    Li, Lijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [30] Human Activity Recognition Based on Multichannel Convolutional Neural Network With Data Augmentation
    Shi, Wenbing
    Fang, Xianjin
    Yang, Gaoming
    Huang, Ji
    IEEE ACCESS, 2022, 10 : 76596 - 76606