Incremental and Multi-Task Learning Strategies for Coarse-To-Fine Semantic Segmentation

被引：9

作者：

Mel, Mazen ^{[1
,2
]}

Michieli, Umberto ^{[1
]}

Zanuttigh, Pietro ^{[1
]}

机构：

[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy

[2] Higher Sch Commun Tunis SupCom, Ariana 2083, Tunisia

来源：

TECHNOLOGIES | 2020年 / 8卷 / 01期

关键词：

semantic segmentation; deep learning; hierarchical learning; incremental learning; multi-task learning;

D O I：

10.3390/technologies8010001

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The semantic understanding of a scene is a key problem in the computer vision field. In this work, we address the multi-level semantic segmentation task where a deep neural network is first trained to recognize an initial, coarse, set of a few classes. Then, in an incremental-like approach, it is adapted to segment and label new objects' categories hierarchically derived from subdividing the classes of the initial set. We propose a set of strategies where the output of coarse classifiers is fed to the architectures performing the finer classification. Furthermore, we investigate the possibility to predict the different levels of semantic understanding together, which also helps achieve higher accuracy. Experimental results on the New York University Depth v2 (NYUDv2) dataset show promising insights on the multi-level scene understanding.

引用

页数：16

共 43 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], P IEEE C COMP VIS PA

[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[4] A model of inductive bias learning [J].

Baxter, J .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 12 :149-198

[5]

Bommana MM, 2019, NANOPARTICLES IN PHARMACOTHERAPY, P23, DOI 10.1016/B978-0-12-816504-1.00001-6

[6]

Bucilu C., 2006, P ACM INT C KNOWL DI, P535, DOI DOI 10.1145/1150402.1150464

[7] Semantic parsing for priming object detection in indoors RGB-D scenes [J].

Cadena, Cesar ;

Kosecka, Jana .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (4-5) :582-597

[8] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

[9]

Chen L.C., 2019, P EUR C COMP VIS ECC, P833

[10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

← 1 2 3 4 5 →