Convolutional Neural Network-Based Multiscale Feature Selection and Evaluation in Image Segmentation

被引:0
作者
Cao, Di [1 ]
Cao, Jian-Nong [2 ,3 ]
Deng, Liang [4 ]
Lou, Li-Ping [5 ]
机构
[1] Changan Univ, Sch Earth Sci & Resources, Xian 710054, Peoples R China
[2] Minist Land & Resources, Key Lab Degraded & Unused Land Consolidat Engn, Xian 710054, Peoples R China
[3] Changan Univ, Sch Geol Engn & Geomat, Xian 710054, Peoples R China
[4] Xian Res Inst Co Ltd, China Coal Technol & Engn Grp, Xian 710054, Peoples R China
[5] Yellow River Engn Consulting Co Ltd, Zhengzhou 450000, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; remote sensing; multiscale feature pyramid; pooling model; convolutional neural networks; MULTIRESOLUTION;
D O I
10.1109/ACCESS.2024.3400026
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multiscale image segmentation based on artificial neural networks is a hot topic in research on remote sensing image processing. However, the establishment and evaluation of pooling models and selection of feature operators lack clear standards. Based on the biological visual multiscale perception mechanism, this study combines classical wavelet theory with convolutional neural network theory to establish 10 sets of geometric operators and construct the corresponding multiscale image feature pyramids. Statistical analysis shows that the 10 sets of operators exhibit two types of information transmission characteristics, that is, balanced and growth. The obtained image features become more fragmented as operator complexity increases. After excluding the two operator groups with high complexities, the remaining eight groups were applied to the convolutional neural network image-segmentation algorithm. Eight pooling models were established to obtain the corresponding multiscale image features, perform convolution operations, and generate multiscale segmentation results for remote sensing images. The evaluation results reveal that the high complexity of the feature operators is unfavorable for feature transmission and preservation, and compared with operators having the information transmission characteristics of growth, those with balanced information transmission characteristics show better performance in convolutional neural network image segmentation. The segmentation accuracy was improved by 1.5%-2%. The conformity of the segmentation results was improved by 1%-1.5%. Finally, the degree of interclass chaos is reduced by 4.1%-10%.
引用
收藏
页码:68003 / 68014
页数:12
相关论文
共 38 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   The principled design of large-scale recursive neural network architectures-DAG-RNNs and the protein structure prediction problem [J].
Baldi, P ;
Pollastri, G .
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (04) :575-602
[3]  
Byeon W, 2014, IEEE IMAGE PROC, P4373, DOI 10.1109/ICIP.2014.7025887
[4]  
Candes D. L., 1999, Curvelets-A surprisingly effectivenonadaptive representation for objects with edges,''Curve and SurfaceFitting: Saint-Malo, P105
[5]   Deep Learning-Based Multiscale Pyramid Sieve and Analysis Module in Image Segmentation [J].
Cao, Di ;
Cao, Jian-Nong ;
Zhu, Qian ;
Lou, Li-Jiao ;
Xiao, Nian-Zhong .
IEEE ACCESS, 2023, 11 :22307-22319
[6]  
Cao Jiannong, 2014, Geomatics and Information Science of Wuhan University, V39, P831, DOI 10.13203/j.whugis20130611
[7]   Performance measure characterization for evaluating neuroimage segmentation algorithms [J].
Chang, Herng-Hua ;
Zhuang, Audrey H. ;
Valentino, Daniel J. ;
Chu, Woei-Chyn .
NEUROIMAGE, 2009, 47 (01) :122-135
[8]  
[陈恩庆 Chen Enqing], 2015, [计算机应用研究, Application Research of Computers], V32, P3801
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]  
Cohen A., 1992, Rev. Math. Iberoamer, V9, P51, DOI DOI 10.4171/RMI/133