Video segmentation of industrial smoke based on dynamic fully convolutional network-Gaussian mixture model and multi-scale fusion attention module

被引:0
作者
Ding Wenyu [1 ,2 ]
Liu Hui [1 ,2 ]
Chen Fugang [3 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming, Yunnan, Peoples R China
[3] Yunnan Kungang Elect & Inforrmat Sci Ltd, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
industrial smoke; spatiotemporal feature; convolutional network; Gaussian mixture model; multi-scale fusion; attention module; RECALL;
D O I
10.1117/1.JEI.32.3.033038
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurately segmenting industrial smoke in videos plays a crucial role in assessing pollution levels based on smoke image evaluation. However, existing fully convolutional networks (FCNs) face challenges in precisely segmenting the edges of industrial smoke and exhibit low extraction and segmentation accuracy for small target smoke. To address this issue, we propose a video segmentation method specifically designed for industrial smoke. This method utilizes the dynamic FCN-Gaussian mixture model (GMM) along with a multi-scale fusion module and an attention module. The FCN-GMM effectively extracts dynamic feature information from spatiotemporal data, capturing motion in video or image sequences while preserving spatial details. The key innovation of FCN-GMM lies in integrating dynamic and static networks through a neural network, enabling the capture of features in both the temporal and spatial domains. Our approach begins by constructing a dynamic feature extraction network that captures spatial and temporal feature information separately during the training process, thereby enhancing the extraction of smoke edges. Additionally, we introduce a mechanism for multi-scale feature fusion and an attention module to effectively extract information related to small target smoke. Our experimental results demonstrate that our network accurately segments significant target smoke compared with FCNs. Furthermore, the network prioritizes the consideration of smoke edge information and improves the extraction of small target smoke, thereby enhancing the overall accuracy of smoke image segmentation with an increase of up to 10% in the intersection over union index. (C) 2023 SPIE and IS&T
引用
收藏
页数:27
相关论文
共 46 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Bak C, 2017, Arxiv, DOI arXiv:1607.04730
  • [3] Survey on deep learning applications in digital image security
    Bao, Zhenjie
    Xue, Ru
    [J]. OPTICAL ENGINEERING, 2021, 60 (12)
  • [4] BUCKLAND M, 1994, J AM SOC INFORM SCI, V45, P12, DOI 10.1002/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO
  • [5] 2-L
  • [6] Celik Turgay, 2007, 2007 15th European Signal Processing Conference (EUSIPCO), P1794
  • [7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [8] Cheng Jingchun, 2017, Learning to segment instances in videos with spatial propagation network
  • [9] A Motion Image Detection Method Based on the Inter-frame Difference Method
    Cheng, Yuanhang
    Wang, Jing
    [J]. MECHANICAL DESIGN AND POWER ENGINEERING, PTS 1 AND 2, 2014, 490-491 : 1283 - 1286
  • [10] Robust background subtraction with foreground validation for urban traffic video
    Cheung, SCS
    Kamath, C
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (14) : 2330 - 2340