Multi-scale Autoencoders in Autoencoder for Semantic Image Segmentation

被引:0
|
作者
Yusiong, John Paul T. [1 ,2 ]
Naval, Prospero C., Jr. [1 ]
机构
[1] Univ Philippines, Coll Engn, Dept Comp Sci, Comp Vis & Machine Intelligence Grp, Quezon City, Philippines
[2] Univ Philippines, Visayas Tacloban Coll, Div Nat Sci & Math, Tacloban City, Leyte, Philippines
关键词
Autoencoders in autoencoder; Semantic image segmentation; Stacked autoencoders;
D O I
10.1007/978-3-030-14799-0_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic image segmentation is essential for scene understanding. Several state-of-the-art deep learning-based approaches achieved remarkable results by increasing the network depth to improve performance. Using this principle, we introduce a novel encoder-decoder network architecture for semantic image segmentation of outdoor scenes called SAsiANet. SAsiANet utilizes multi-scale cascaded autoencoders at the decoder section of an autoencoder to achieve high accuracy pixel-wise prediction and involves exploiting features across multiple scales when upsampling the output of the encoder to obtain better spatial and contextual information effectively. The proposed network architecture is trained using the cross-entropy loss function but without incorporating any class balancing technique to the loss function. Our experimental results on two challenging outdoor scenes: the CamVid urban scenes dataset and the Freiburg forest dataset demonstrate that SAsiANet provides an effective way of producing accurate segmentation maps since it achieved state-of-the-art results on the test set of both datasets, 72.40% mIoU, and 89.90% mIoU, respectively.
引用
收藏
页码:587 / 599
页数:13
相关论文
共 50 条
  • [1] Multi-scale image semantic segmentation based on ASPP and improved HRNet
    Shi Jian-feng
    Gao Zhi-ming
    Wang A-chuan
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (11) : 1497 - 1505
  • [2] Multi-scale Convolutional Neural Network for SAR Image Semantic Segmentation
    Duan, Yiping
    Tao, Xiaoming
    Han, Chaoyi
    Qin, Xiaowei
    Lu, Jianhua
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [3] Enhanced multi-scale networks for semantic segmentation
    Li, Tianping
    Cui, Zhaotong
    Han, Yu
    Li, Guanxing
    Li, Meng
    Wei, Dongmei
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2557 - 2568
  • [4] Multi-scale Context Intertwining for Semantic Segmentation
    Lin, Di
    Ji, Yuanfeng
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 622 - 638
  • [5] Dynamic Multi-scale Filters for Semantic Segmentation
    He, Junjun
    Deng, Zhongying
    Qiao, Yu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3561 - 3571
  • [6] Enhanced multi-scale networks for semantic segmentation
    Tianping Li
    Zhaotong Cui
    Yu Han
    Guanxing Li
    Meng Li
    Dongmei Wei
    Complex & Intelligent Systems, 2024, 10 : 2557 - 2568
  • [7] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Sang-Woong Lee
    Multimedia Tools and Applications, 2018, 77 : 18689 - 18707
  • [8] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Lee, Sang-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18689 - 18707
  • [9] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
    Ma, Xinxin
    Liu, Kai
    Ding, Chongyang
    Yan, Lin
    Duan, Meiyu
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [10] Semantic Segmentation of Remote Sensing Image Based on Multi-Scale Semantic Encoder-Decoder Network
    Liang Y.
    Yi C.-X.
    Wang G.-Y.
    Hu Y.-H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3199 - 3214