Incremental and Multi-Task Learning Strategies for Coarse-To-Fine Semantic Segmentation

被引:8
作者
Mel, Mazen [1 ,2 ]
Michieli, Umberto [1 ]
Zanuttigh, Pietro [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
[2] Higher Sch Commun Tunis SupCom, Ariana 2083, Tunisia
关键词
semantic segmentation; deep learning; hierarchical learning; incremental learning; multi-task learning;
D O I
10.3390/technologies8010001
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The semantic understanding of a scene is a key problem in the computer vision field. In this work, we address the multi-level semantic segmentation task where a deep neural network is first trained to recognize an initial, coarse, set of a few classes. Then, in an incremental-like approach, it is adapted to segment and label new objects' categories hierarchically derived from subdividing the classes of the initial set. We propose a set of strategies where the output of coarse classifiers is fed to the architectures performing the finer classification. Furthermore, we investigate the possibility to predict the different levels of semantic understanding together, which also helps achieve higher accuracy. Experimental results on the New York University Depth v2 (NYUDv2) dataset show promising insights on the multi-level scene understanding.
引用
收藏
页数:16
相关论文
共 43 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], P INT C LEARN REPR I
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] A model of inductive bias learning
    Baxter, J
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 : 149 - 198
  • [5] Bommana MM, 2019, NANOPARTICLES IN PHARMACOTHERAPY, P23, DOI 10.1016/B978-0-12-816504-1.00001-6
  • [6] Bucilua C., 2006, P 12 ACM SIGKDD INT, P535, DOI DOI 10.1145/1150402.1150464
  • [7] Semantic parsing for priming object detection in indoors RGB-D scenes
    Cadena, Cesar
    Kosecka, Jana
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (4-5) : 582 - 597
  • [8] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [9] Chen L.C., 2019, P EUR C COMP VIS ECC, P833
  • [10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848