An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

被引:1
作者
Chen, Bolin [1 ]
Zhao, Tiesong [1 ,2 ]
Zhou, Liping [1 ]
Yang, Jing [1 ]
Liu, Jiahui [1 ]
Lin, Liqun [1 ]
机构
[1] Fuzhou Univ, Coll Phys & Informat Engn, Fujian Key Lab Intelligent Proc & Wireless Transm, Fuzhou, Fujian, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Semantic segmentation; Image classification; Region of interest; Ambiguous pixels; Atrous-ResFCN; MFR-DenseNet; IMAGE SEGMENTATION;
D O I
10.1007/s00034-021-01784-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
These past six years have witnessed that segmentation algorithms make a breakthrough due to the development of deep-based semantic segmentation networks, which can realize classification from image-level to pixel-level. However, when segmenting a small-batch, complex-features and obvious-gap database, these semantic segmentation networks will lead to segmentation ambiguities in some boundaries between foreground (i.e. the main segmentation body) and its background with limited image detail expression. To solve such problem, this paper proposes an optimized segmentation scheme for ambiguous pixels based on Atrous-ResFCN and MFR-DenseNet, namely ARMD. Firstly, masking algorithm and adaptive box mechanism are applied in pre-processed KITTI segmentation database to construct the classification database, which will be used to train ambiguities judgment model. In addition, Atrous-ResFCN-8s and Atrous-ResFCN-16s are proposed and further combined to determine segmentation ambiguities with different-level segmentation abilities. Finally, MFR-DenseNet is further migrated to optimize these ambiguous pixels with effective threshold selection. Experimental results demonstrate that our proposed ARMD algorithm is beneficial to improve segmentation accuracy, where the highest MIoU and PA are able to reach 88.18% and 95.88%, respectively.
引用
收藏
页码:372 / 394
页数:23
相关论文
共 34 条
  • [1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods
    Achanta, Radhakrishna
    Shaji, Appu
    Smith, Kevin
    Lucchi, Aurelien
    Fua, Pascal
    Suesstrunk, Sabine
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2274 - 2281
  • [2] SEEDED REGION GROWING
    ADAMS, R
    BISCHOF, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) : 641 - 647
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [5] Fast approximate energy minimization via graph cuts
    Boykov, Y
    Veksler, O
    Zabih, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) : 1222 - 1239
  • [7] Multipath feature recalibration DenseNet for image classification
    Chen, Bolin
    Zhao, Tiesong
    Liu, Jiahui
    Lin, Liqun
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (03) : 651 - 660
  • [8] Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection
    Chen, Hao
    Li, Youfu
    Su, Dan
    [J]. PATTERN RECOGNITION, 2019, 86 : 376 - 385
  • [9] Chen L.-C., 2017, RETHINKING ATROUS CO
  • [10] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851