Data augmentation in material images using the improved HP-VAE-GAN

被引:19
作者
Han, Yuexing [1 ,2 ,3 ]
Liu, Yuhong [1 ]
Chen, Qiaochuan [1 ,4 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Key Lab Silicate Cultural Rel Conservat, Minist Educ, Shanghai 200444, Peoples R China
[3] Zhejiang Lab, Hangzhou 311100, Peoples R China
[4] Shanghai Univ, Sch Comp Engn & Sci, 99 Shangda Rd, Shanghai 200444, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Data augmentation; Image generation; HP-VAE-GAN; CBAM; Material image classification; MICROSTRUCTURE;
D O I
10.1016/j.commatsci.2023.112250
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Since the rapid development of computer vision relies heavily on large-scale labeled data and high-performance computing equipment, therefore, image recognition in small sample datasets faces several challenges, such as difficult to implement model training. In the field of materials research, the cost of collecting image data is relatively high. In order to solve the problem of insufficient image samples in material research, an improved HP-VAE-GAN is proposed to generate material images to achieve data augmentation. HP-VAE-GAN is a single sample generation model that consists of Patch-VAE and Patch-GAN. The improved HP-VAE-GAN introduces the attention mechanism into model. By adding CBAM (Convolutional Block Attention Module) to the encoder of Patch-VAE, the feature extraction and representation capabilities of the network are further improved. Use this model to train a single image, and then generate a certain number of samples to achieve the expansion of the training set. For the classification of ultrahigh carbon steel microstructure images, experiments show that the accuracy of classification model (MobileNet, ResNet50 and VGG16) trained with real images plus generated images is improved obviously. In addition, the effectiveness of the improved HP-VAE-GAN is verified by ex-periments on texture images similar to material images.
引用
收藏
页数:10
相关论文
共 41 条
[1]  
Brock A, 2019, Arxiv, DOI arXiv:1809.11096
[2]   Machine learning for molecular and materials science [J].
Butler, Keith T. ;
Davies, Daniel W. ;
Cartwright, Hugh ;
Isayev, Olexandr ;
Walsh, Aron .
NATURE, 2018, 559 (7715) :547-555
[3]  
Camastra F., 2007, BOOK REV IMAGE PROCE
[4]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[5]  
Chen Pengfei., 2018, Log hyperbolic cosine loss improves variational auto-encoder
[6]   UHCSDB: UltraHigh Carbon Steel Micrograph DataBase Tools for Exploring Large Heterogeneous Microstructure Datasets [J].
DeCost, Brian L. ;
Hecht, Matthew D. ;
Francis, Toby ;
Webler, Bryan A. ;
Picard, Yoosuf N. ;
Holm, Elizabeth A. .
INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2017, 6 (02) :197-205
[7]  
Demir U., 2018, Patch-based image inpainting with generative adversarial networks
[8]  
Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[9]  
Gadkari Dhanashree., 2004, Image Quality Analysis Using GLCM
[10]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144