An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

被引：1

作者：

Chen, Bolin ^{[1
]}

Zhao, Tiesong ^{[1
,2
]}

Zhou, Liping ^{[1
]}

Yang, Jing ^{[1
]}

Liu, Jiahui ^{[1
]}

Lin, Liqun ^{[1
]}

机构：

[1] Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou, Fujian, Peoples R China

[2] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2022年 / 41卷 / 01期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Semantic segmentation; Image classification; Region of interest; Ambiguous pixels; Atrous-ResFCN; MFR-DenseNet; IMAGE SEGMENTATION;

D O I：

10.1007/s00034-021-01784-9

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

These past six years have witnessed that segmentation algorithms make a breakthrough due to the development of deep-based semantic segmentation networks, which can realize classification from image-level to pixel-level. However, when segmenting a small-batch, complex-features and obvious-gap database, these semantic segmentation networks will lead to segmentation ambiguities in some boundaries between foreground (i.e. the main segmentation body) and its background with limited image detail expression. To solve such problem, this paper proposes an optimized segmentation scheme for ambiguous pixels based on Atrous-ResFCN and MFR-DenseNet, namely ARMD. Firstly, masking algorithm and adaptive box mechanism are applied in pre-processed KITTI segmentation database to construct the classification database, which will be used to train ambiguities judgment model. In addition, Atrous-ResFCN-8s and Atrous-ResFCN-16s are proposed and further combined to determine segmentation ambiguities with different-level segmentation abilities. Finally, MFR-DenseNet is further migrated to optimize these ambiguous pixels with effective threshold selection. Experimental results demonstrate that our proposed ARMD algorithm is beneficial to improve segmentation accuracy, where the highest MIoU and PA are able to reach 88.18% and 95.88%, respectively.

引用

页码：372 / 394

页数：23

共 34 条

[1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods
Achanta, Radhakrishna
Shaji, Appu
Smith, Kevin
Lucchi, Aurelien
Fua, Pascal
Suesstrunk, Sabine
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2274 - 2281
[2] SEEDED REGION GROWING
ADAMS, R
BISCHOF, L
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) : 641 - 647
[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[4] Large-Scale Machine Learning with Stochastic Gradient Descent
Bottou, Leon
[J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
[5] Fast approximate energy minimization via graph cuts
Boykov, Y
Veksler, O
Zabih, R
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) : 1222 - 1239
[6] A COMPUTATIONAL APPROACH TO EDGE-DETECTION
CANNY, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) : 679 - 698
[7] Multipath feature recalibration DenseNet for image classification
Chen, Bolin
Zhao, Tiesong
Liu, Jiahui
Lin, Liqun
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (03) : 651 - 660
[8] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection
Chen, Hao
Li, Youfu
Su, Dan
[J]. PATTERN RECOGNITION, 2019, 86 : 376 - 385
[9] Chen L.-C., 2017, RETHINKING ATROUS CO
[10] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851

← 1 2 3 4 →