Indoor Objects and Outdoor Urban Scenes Recognition by 3D Visual Primitives

被引:1
作者
Fu, Junsheng [1 ,3 ]
Kamarainen, Joni-Kristian [1 ]
Buch, Anders Glent [2 ]
Kruger, Norbert [2 ]
机构
[1] Tampere Univ Technol, Vis Grp, FIN-33101 Tampere, Finland
[2] Univ Southern Denmark, CARO Grp, Odense, Denmark
[3] Nokia Res Ctr, Tampere, Finland
来源
COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I | 2015年 / 9008卷
关键词
FEATURES;
D O I
10.1007/978-3-319-16628-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection, recognition and pose estimation in 3D images have gained momentum due to availability of 3D sensors (RGB-D) and increase of large scale 3D data, such as city maps. The most popular approach is to extract and match 3D shape descriptors that encode local scene structure, but omits visual appearance. Visual appearance can be problematic due to imaging distortions, but the assumption that local shape structures are sufficient to recognise objects and scenes is largely invalid in practise since objects may have similar shape, but different texture (e.g., grocery packages). In this work, we propose an alternative appearance-driven approach which first extracts 2D primitives justified by Marr's primal sketch, which are "accumulated" over multiple views and the most stable ones are "promoted" to 3D visual primitives. The 3D promoted primitives represent both structure and appearance. For recognition, we propose a fast and effective correspondence matching using random sampling. For quantitative evaluation we construct a semisynthetic benchmark dataset using a public 3D model dataset of 119 kitchen objects and another benchmark of challenging street-view images from 4 different cities. In the experiments, our method utilises only a stereo view for training. As the result, with the kitchen objects dataset our method achieved almost perfect recognition rate for +/- 10 degrees camera view point change and nearly 80% for +/- 20 degrees, and for the street-view benchmarks it achieved 75% accuracy for 160 street-view images pairs, 80% for 96 street-view images pairs, and 92% for 48 street-view image pairs.
引用
收藏
页码:270 / 285
页数:16
相关论文
共 50 条
[41]   Efficient 3D Object Recognition from Cluttered Point Cloud [J].
Li, Wei ;
Cheng, Hongtai ;
Zhang, Xiaohua .
SENSORS, 2021, 21 (17)
[42]   Performance evaluation of 3D descriptors for object recognition in construction applications [J].
Chen, Jingdao ;
Fang, Yihai ;
Cho, Yong K. .
AUTOMATION IN CONSTRUCTION, 2018, 86 :44-52
[43]   3D face recognition using image decomposition and POEM descriptor [J].
Abbad, Abdelghafour ;
El Kaitouni, Soukaina El Idrissi ;
Benhdech, Adil ;
Abbad, Khalid ;
Tairi, Hamid .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) :17-30
[44]   Accurate and Efficient LIF-Nets for 3D Detection and Recognition [J].
Shi, Yueting ;
Li, Hai ;
Zhang, Hehui ;
Wu, Zhenzhi ;
Ren, Shiwei .
IEEE ACCESS, 2020, 8 :98562-98571
[45]   A Global Hypothesis Verification Framework for 3D Object Recognition in Clutter [J].
Aldoma, Aitor ;
Tombari, Federico ;
Di Stefano, Luigi ;
Vincze, Markus .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1383-1396
[46]   Efficient 3D object recognition via geometric information preservation [J].
Liu, Hongsen ;
Cong, Yang ;
Yang, Chenguang ;
Tang, Yandong .
PATTERN RECOGNITION, 2019, 92 :135-145
[47]   Indoor camera pose estimation via style-transfer 3D models [J].
Chen, Junjie ;
Li, Shuai ;
Liu, Donghai ;
Lu, Weisheng .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (03) :335-353
[48]   3D Object Recognition Using Fuzzy Mathematical Modeling of 2D Images [J].
Sheta, Alaa F. ;
Baareh, Abdelkarim ;
Ai-Batah, Mohammad .
2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, :278-283
[49]   3DHoNR: A 3D object recognition using an efficient and fast low-dimensional 3D descriptor for a Real-time Application [J].
Joshi, Piyush ;
Mukherjee, Varun ;
Garg, Pratham ;
Kumar, Vinay .
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, ARSO, 2024, :177-181
[50]   Semantics-guided reconstruction of indoor navigation elements from 3D colorized points [J].
Yang, Juntao ;
Kang, Zhizhong ;
Zeng, Liping ;
Akwensi, Perpetual Hope ;
Sester, Monika .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 173 :238-261