Using synthetic dataset for semantic segmentation of the human body in the problem of extracting anthropometric data

被引:0
|
作者
Absadyk, Azat [1 ]
Turar, Olzhas [1 ]
Akhmed-Zaki, Darkhan [1 ]
机构
[1] Astana IT Univ, Dept Sci & Innovat, Astana, Kazakhstan
来源
关键词
synthetic data; human segmentation; anthropometry; CNN; NVIDIA replicator; human body;
D O I
10.3389/frai.2024.1336320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background The COVID-19 pandemic highlighted the need for accurate virtual sizing in e-commerce to reduce returns and waste. Existing methods for extracting anthropometric data from images have limitations. This study aims to develop a semantic segmentation model trained on synthetic data that can accurately determine body shape from real images, accounting for clothing.Methods A synthetic dataset of over 22,000 images was created using NVIDIA Omniverse Replicator, featuring human models in various poses, clothing, and environments. Popular CNN architectures (U-Net, SegNet, DeepLabV3, PSPNet) with different backbones were trained on this dataset for semantic segmentation. Models were evaluated on accuracy, precision, recall, and IoU metrics. The best performing model was tested on real human subjects and compared to actual measurements.Results U-Net with EfficientNet backbone showed the best performance, with 99.83% training accuracy and 0.977 IoU score. When tested on real images, it accurately segmented body shape while accounting for clothing. Comparison with actual measurements on 9 subjects showed average deviations of -0.24 cm for neck, -0.1 cm for shoulder, 1.15 cm for chest, -0.22 cm for thallium, and 0.17 cm for hip measurements.Discussion The synthetic dataset and trained models enable accurate extraction of anthropometric data from real images while accounting for clothing. This approach has significant potential for improving virtual fitting and reducing returns in e-commerce. Future work will focus on refining the algorithm, particularly for thallium and hip measurements which showed higher variability.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Semantic Segmentation of Panoramic Images Using a Synthetic Dataset
    Xu, Yuanyou
    Wang, Kaiwei
    Yang, Kailun
    Sun, Dongming
    Fu, Jia
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS, 2019, 11169
  • [2] Synthesis in Style: Semantic Segmentation of Historical Documents using Synthetic Data
    Bartz, Christian
    Raetz, Hendrik
    Otholt, Jona
    Meinel, Christoph
    Yang, Haojin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3878 - 3884
  • [3] Semantic Object Segmentation in Cultural Sites using Real and Synthetic Data
    Ragusa, Francesco
    Di Mauro, Daniele
    Palermo, Alfio
    Furnari, Antonino
    Farinella, Giovanni Maria
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1964 - 1971
  • [4] Synthetic Data for Semantic Segmentation in Underwater Imagery
    Pergeorelis, Michael
    Bazik, Maxim
    Saponaro, Philip
    Kim, Joong
    Kambhamettu, Chandra
    2022 OCEANS HAMPTON ROADS, 2022,
  • [5] Extensible portal frame bridge synthetic dataset for structural semantic segmentation
    Tatiana Fountoukidou
    Iuliia Tkachenko
    Benjamin Poli
    Serge Miguet
    AI in Civil Engineering, 2024, 3 (1):
  • [6] A Synthetic Dataset for Semantic Segmentation of Waterbodies in Out-of-Distribution Situations
    Ioannou, Eleftherios
    Thalatam, Sainath
    Georgescu, Serban
    SCIENTIFIC DATA, 2024, 11 (01)
  • [7] Semantic Segmentation of Human Body using Generative Adversarial Neural Networks
    Pathak, Priyansha
    Narendra, Priyanka
    Mathanky, Raja S.
    Murthy, H. V. Srinivasa
    2018 FOURTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO) - 2018, 2018, : 221 - 222
  • [8] Extracting main modes of human body shape variation from 3-D anthropometric data
    Ben Azouz, Z
    Shu, C
    Lepage, R
    Rioux, M
    FIFTH INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2005, : 335 - 342
  • [9] The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes
    Ros, German
    Sellart, Laura
    Materzynska, Joanna
    Vazquez, David
    Lopez, Antonio M.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3234 - 3243
  • [10] Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation
    Quang Nguyen
    Truong Vu
    Anh Tran
    Khoi Nguyen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,