Algorithm for segmentation of smoke using the improved DeeplabV3 network

被引:0
|
作者
Wang Z. [1 ]
Su Y. [1 ]
Liu Y. [2 ]
Zhang W. [3 ]
机构
[1] School of Electrical Automation and Information Engineering, Tianjin University, Tianjin
[2] School of Electronic Information and Optical Engineering, Nankai University, Tianjin
[3] School of Microelectronics, Tianjin University, Tianjin
关键词
Attention mechanism; Deep learning; Deformable convolution; Image processing; Semantic segmentation; Smoke detection;
D O I
10.19665/j.issn1001-2400.2019.06.008
中图分类号
学科分类号
摘要
Existing smoke detection methods depend mostly on features which are selected manually and the smoke areas in video images often cannot be segmented accurately. This paper proposes an improved DeeplabV3 smoke segmentation algorithm based on this. A feature refinement module is added after the basic encoder network to weaken the gridding effects caused by dilated convolutions. For the non-rigid objects such as smoke with variable scales and postures, the Atrous Spatial Pyramid Pooling module is combined with the deformable convolution to better adapt to the smoke deformation. And a channel attention decoder module is proposed to further restore the spatial details of smoke images. According to the test of the smoke image data set, the proposed model has a faster average prediction speed of 71.73ms per image. Besides, compared with the DeeplabV3 network, this algorithm can lead to a higher MPA (Mean Pixel Accuracy) of 97.78% and a higher MIoU (Mean Intersection over Union) of 91.21%, thus improving the performance by 0.56% and 2.17% respectively, and making it more suitable for smoke segmentation. Public smoke video test results show that this model outperforms other video-based smoke detection methods for the detection rate, and that it is of certain research significance and practical value. © 2019, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:52 / 59
页数:7
相关论文
共 15 条
  • [1] Muhammad K., Ahmad J., Mehmood I., Et al., Convolutional Neural Networks Based Fire Detection in Surveillance Videos, IEEE Access, 6, pp. 18174-18183, (2018)
  • [2] Zhang Q.X., Lin G.H., Zhang Y.M., Et al., Wildland Forest Fire Smoke Detection Based on Faster R-CNN using Synthetic Smoke Images, Procedia Engineering: 211, pp. 441-446, (2018)
  • [3] Yuan F., Zhang L., Xia X., Et al., Deep Smoke Segmentation, Neurocomputing, 357, pp. 248-260, (2019)
  • [4] Long J., Shelhamer E., Darrell T., Fully Convolutional Networks for Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 4, pp. 640-651, (2014)
  • [5] Chen L.C., Papandreou G., Kokkinos I., Et al., Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs, Conference Track Proceedings of the 2015 3rd International Conference on Learning Representations, (2015)
  • [6] Chen L.C., Papandreou G., Kokkinos I., Et al., DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFS, IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 4, pp. 834-848, (2018)
  • [7] Chen L.C., Papandreou G., Schroff F., Et al., Rethinking Atrous Convolution for Semantic Image Segmentation
  • [8] Yu F., Koltun V., Funkhouser T., Dilated Residual Networks, Proceedings of the 2017 30th IEEE Conference on Computer Vision and Pattern Recognition, pp. 636-644, (2017)
  • [9] Dai J., Qi H., Xiong Y., Et al., Deformable Convolutional Networks, Proceedings of the 2017 IEEE International Conference on Computer Vision, pp. 764-773, (2017)
  • [10] Hu J., Shen L., Sun G., Squeeze-and-excitation Networks, Proceedings of the 2018 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 7132-7141, (2018)