High precision real-time semantic segmentation algorithm: Multi-channel deep weighted aggregation network

被引:0
作者
Qi Y.-S. [1 ,2 ,3 ]
Chen P.-L. [1 ,2 ,3 ]
Gao X.-J. [4 ]
Dong C.-Y. [1 ,2 ,3 ]
Wei S.-J. [1 ,2 ,3 ]
机构
[1] School of Electric Power, Inner Mongolia University of Technology, Hohhot
[2] Engineering Research Center of Large Energy Storage Technology of Ministry of Education, Hohhot
[3] Center for Intelligent Energy Technology and Equipment Engineering, Inner Mongolia University, Hohhot
[4] Faculty of Information Technology, Beijing University of Technology, Beijing
来源
Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 05期
关键词
context information; deep learning; depth fusion; semantic feature; semantic segmentation;
D O I
10.13195/j.kzyjc.2022.1699
中图分类号
学科分类号
摘要
In recent years, with the continuous development of deep learning technology, various semantic segmentation algorithms based on deep learning have emerged, but most of the segmentation algorithms cannot achieve high speed and high accuracy at the same time, and a real-time semantic segmentation framework for multi-channel depth-weighted aggregation networks (MCDWA_Net) is proposed to solve this problem. Firstly, the multi-channel idea is introduced to construct a three-channel semantic representation model, which is used to extract three types of complementary semantic information of the image: 1) Low-level semantic channel outputs the local features such as the edge, color, and structure of the object in the image; 2) Auxiliary semantic channel extracts the transition information between low-level semantics and high-level semantics, and realizes multi-layer feedback to the high-level semantic channel; 3) Advanced semantic channel obtains context logical relationships and category semantic information in images. Then, a three-class semantic feature weighted aggregation module is designed to output a more complete global semantic description. Finally, an enhancement training mechanism is introduced to realize the feature enhancement in the training stage, thereby improving the training speed. Experimental results show that the proposed method not only has fast inference speed, but also has high segmentation accuracy in complex scenes, which can achieve the balance of semantic segmentation speed and accuracy. © 2024 Northeast University. All rights reserved.
引用
收藏
页码:1450 / 1460
页数:10
相关论文
共 26 条
[1]  
Huang T H, Nie Z Y, Wang Q G, Et al., Real-time image semantic segmentation based on block adaptive feature fusion, Acta Automatica Sinica, 47, 5, pp. 1137-1148, (2021)
[2]  
Zhou B L, Zhao H, Puig X, Et al., Semantic understanding of scenes through the ADE20K dataset, International Journal of Computer Vision, 127, 3, pp. 302-321, (2019)
[3]  
Geiger A, Lenz P, Urtasun R., Are we ready for autonomous driving? The KITTI vision benchmark suite, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354-3361, (2012)
[4]  
Yu Y, Fu Y X, Yang C D, Et al., Fine-grained car model recognition based on FR-ResNet, Acta Automatica Sinica, 47, 5, pp. 1125-1136, (2021)
[5]  
Wu Z F, Shen C H, Hengel A V D., Real-time semantic image segmentation via spatial sparsity, (2017)
[6]  
Paszke A, Chaurasia A, Kim S, Et al., ENet: A deep neural network architecture for real-time semantic segmentation, (2016)
[7]  
Ronneberger O, Fischer P, Brox T., U-net: Convolutional networks for biomedical image segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 234-241, (2015)
[8]  
Han C, Wang J L, Wu Y X, Et al., A review of deep learning models based on neuroevolution, Acta Electronica Sinica, 49, 2, pp. 372-379, (2021)
[9]  
Evans B, Al-Sahaf H, Xue B, Et al., Evolutionary deep learning: A genetic programming approach to image classification, 2018 IEEE Congress on Evolutionary Computation, pp. 1-6, (2018)
[10]  
Wang C Y, Xu C, Yao X, Et al., Evolutionary generative adversarial networks, IEEE Transactions on Evolutionary Computation, 23, 6, pp. 921-934, (2019)