Human-Guided Data Augmentation via Diffusion Model for Surface Defect Recognition Under Limited Data

被引：0

作者：

Fang, Tiyu ^{[1
]}

Zhang, Mingxin ^{[1
]}

Song, Ran ^{[1
]}

Li, Xiaolei ^{[1
]}

Wei, Zhiyuan ^{[1
]}

Zhang, Wei ^{[1
]}

机构：

[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2025年 / 74卷

基金：

中国国家自然科学基金;

关键词：

Diffusion models; Generative adversarial networks; Training; Data augmentation; Image segmentation; Training data; Reinforcement learning; Image synthesis; Data models; Tires; Diffusion model; reinforcement learning (RL) from human feedback; surface defect recognition (SDR); INSPECTION;

D O I：

10.1109/TIM.2025.3541684

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Surface defect recognition (SDR) with limited data is a common challenge in industrial production. Recent methods generally utilize generative adversarial networks (GANs) to generate defect samples as training data for improving the performance of SDR. However, the instability of GAN training often results in uncontrollable and low-quality samples under severe data constraints, making it difficult for the existing methods to effectively handle SDR tasks with different granularities. To address the issue, this article proposes a human-guided data augmentation method under extremely limited data. Its core idea is to introduce human feedback into a diffusion model for synthesizing controllable and high-quality defect samples by reinforcement learning (RL), aiming to improve various-granularity SDR tasks such as defect classification and segmentation. First, a conditional diffusion model (CDM) is constructed to generate controllable defect samples using semantic labels, which learn defect distribution from a small number of annotated defect samples. Then, a reward model is designed to evaluate the outcome of the CDM by human feedback. Next, based on the trained reward model, the CDM is further optimized by proximal policy optimization (PPO). Finally, the refined CDM is used to generate high-quality defect samples as training data for enhancing defect classification and segmentation. Extensive experiments on NEU-Seg, magnetic-tile (MT), and the collected Tire datasets demonstrate that our method outperforms the state-of-the-art generative methods in terms of generated image quality. Furthermore, the performance of defect classification and segmentation has also shown significant enhancements based on the generated samples, with a maximum improvement of 16.90% in accuracy and 12.85% in mean intersection over union (mIoU) compared to results obtained without data augmentation.

引用

页数：16

共 50 条

[21] Radar Target Recognition Algorithm Based on Data Augmentation and WACGAN with a Limited Training Data
Zhuke-Fan
Wang J.-G.
Liu Y.-J.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (06): : 1124 - 1131
[22] Data Augmentation via Latent Diffusion for Saliency Prediction
Aydemir, Bahar
Bhattacharjee, Deblina
Zhang, Tong
Salzmann, Mathieu
Susstrunk, Sabine
COMPUTER VISION - ECCV 2024, PT LXXVIII, 2025, 15136 : 360 - 377
[23] Data Augmentation of Surface Electromyography for Hand Gesture Recognition
Tsinganos, Panagiotis
Cornelis, Bruno
Cornelis, Jan
Jansen, Bart
Skodras, Athanassios
SENSORS, 2020, 20 (17) : 1 - 23
[24] Enhancing human action recognition with GAN-based data augmentation
Pulakurthi, Prasanna Reddy
de Melo, Celso M.
Rao, Raghuveer
Rabbani, Majid
SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
[25] Data augmentation for solving industrial recognition tasks with underrepresented defect classes
Wunsch, Lennard
Anding, Katharina
Polte, Galina
Liu, Kun
Notni, Gunther
ACTA IMEKO, 2023, 12 (04):
[26] Enhancing plant health classification via diffusion model-based data augmentation
Lee, Younghoon
MULTIMEDIA SYSTEMS, 2025, 31 (02)
[27] Label-Guided Data Augmentation for Chinese Named Entity Recognition
Jiang, Miao
Chen, Honghui
APPLIED SCIENCES-BASEL, 2025, 15 (05):
[28] Phased Data Augmentation for Training a Likelihood-Based Generative Model with Limited Data
Mimura, Yuta
ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2025, 13 (01): : 126 - 135
[29] Industrial Process Soft Sensing Based on Bidirectional Optimization Learning of Data Augmentation and Prediction Models Under Limited Data
Li, He
Wang, Zhaojing
Li, Li
Yan, Xiaoyun
Hu, Xinrong
Li, Lijun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[30] Human Activity Recognition Based on Multichannel Convolutional Neural Network With Data Augmentation
Shi, Wenbing
Fang, Xianjin
Yang, Gaoming
Huang, Ji
IEEE ACCESS, 2022, 10 : 76596 - 76606

← 1 2 3 4 5 →