Multi-view 3D Entangled Forest For Semantic Segmentation and Mapping

被引:0
作者
Antonello, Morris [1 ]
Wolf, Daniel [2 ]
Prankl, Johann [2 ]
Ghidoni, Stefano [1 ]
Menegatti, Emanuele [1 ]
Vincze, Markus [2 ]
机构
[1] Univ Padua, Dept Informat Engn, Padua, Italy
[2] Vienna Univ Technol, Automat & Control Inst, Vienna, Austria
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年
关键词
SEARCH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications that provide location related services need to understand the environment in which humans live such that verbal references and human interaction are possible. We formulate this semantic labelling task as the problem of learning the semantic labels from the perceived 3D structure. In this contribution we propose a batch approach and a novel multi-view frame fusion technique to exploit multiple views for improving the semantic labelling results. The batch approach works offline and is the direct application of an existing single-view method to scene reconstructions with multiple views. The multi-view frame fusion works in an incremental fashion accumulating the single-view results, hence allowing the online multi-view semantic segmentation of single frames and the offline reconstruction of semantic maps. Our experiments show the superiority of the approaches based on our fusion scheme, which leads to a more accurate semantic labelling.
引用
收藏
页码:1855 / 1862
页数:8
相关论文
共 33 条
[1]   Contextually guided semantic labeling and search for three-dimensional point clouds [J].
Anand, Abhishek ;
Koppula, Hema Swetha ;
Joachims, Thorsten ;
Saxena, Ashutosh .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (01) :19-34
[2]  
[Anonymous], ARXIV160905130
[3]  
[Anonymous], 2015, CVPR
[4]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.442
[5]  
[Anonymous], 2017, Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications
[6]  
Armeni I., 2017, ARXIV E PRINTS
[7]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[8]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[9]  
Couprie C., 2013, P INT C LEARN REPR
[10]   Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture [J].
Eigen, David ;
Fergus, Rob .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2650-2658